Details of Research Outputs

TitleA Versatility-Performance Balanced Hardware Architecture for Scene Text Detection
Creator
Date Issued2022
Conference Name2022 IEEE Smartworld, Ubiquitous Intelligence & Computing, Scalable Computing & Communications, Digital Twin, Privacy Computing, Metaverse, Autonomous & Trusted Vehicles (SmartWorld/UIC/ScalCom/DigitalTwin/PriComp/Meta)
Source PublicationProceedings - 2022 IEEE Smartworld, Ubiquitous Intelligence & Computing, Scalable Computing & Communications, Digital Twin, Privacy Computing, Metaverse, Autonomous & Trusted Vehicles (SmartWorld/UIC/ScalCom/DigitalTwin/PriComp/Meta)
ISBN979-8-3503-4655-8
Pages540-549
Conference Date15-18 December 2022
Conference PlaceHaikou, China
PublisherIEEE
Abstract

Detecting and extracting textual information from natural scene images needs Scene Text Detection (STD) algorithms. Fully Convolutional Neural Networks (FCNs) are usually utilized as the backbone model to extract features in these instance segmentation based STD algorithms. FCNs naturally come with high computational complexity. Furthermore, to keep up with the growing variety of models, flexible architectures are needed. In order to accelerate various STD algorithms efficiently, a versatility-performance balanced hardware architecture is proposed, together with a simple but efficient way of configuration. This architecture is able to compute different FCN models without hardware redesign. The optimization is focused on hardware with finely designed computing modules, while the versatility of different network reconfigurations is achieved by microcodes instead of a strenuously designed compiler. Multiple parallel techniques at different levels and several complexity-reduction methods are explored to speed up the FCN computation. Results from implementation show that, given the same tasks, the proposed system achieves a better throughput compared with the studied GPU. Particularly, our system reduces the comprehensive Operation Expense (OpEx) at GPU by 46%, while the power efficiency is enhanced by 32%. This work has been deployed in commercial applications and provided stable consumer text detection services.

KeywordFiled-Programmable-Gate-Array (FPGA) Fully Convolutional Neural Network (FCN) Hardware Acceleration Instance Segmentation Scene Text Detection (STD)
DOI10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00093
URLView source
Language英语English
Scopus ID2-s2.0-85168156335
Citation statistics
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/11675
CollectionFaculty of Science and Technology
Corresponding AuthorChen, Donglong
Affiliation
1.Peng Cheng Laboratory,Shenzhen,China
2.National University of Defense Technology,Changsha,China
3.BNU-HKBU United International College,Zhuhai,China
4.Southern University of Science and Technology,Shenzhen,China
5.City University of Hong Kong,Hong Kong, China
6.Uc,Santa Barbara,United States
7.NUAA,China
8.Iǧdir University,Turkey
Corresponding Author AffilicationBeijing Normal-Hong Kong Baptist University
Recommended Citation
GB/T 7714
Xin, Yao,Tang, Guoming,Chen, Donglonget al. A Versatility-Performance Balanced Hardware Architecture for Scene Text Detection[C]: IEEE, 2022: 540-549.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Xin, Yao]'s Articles
[Tang, Guoming]'s Articles
[Chen, Donglong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Xin, Yao]'s Articles
[Tang, Guoming]'s Articles
[Chen, Donglong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Xin, Yao]'s Articles
[Tang, Guoming]'s Articles
[Chen, Donglong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.