Details of Research Outputs

Status已发表Published
TitleA Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application
Creator
Date Issued2023-04-01
Source PublicationSymmetry
ISSN2073-8994
Volume15Issue:4
Abstract

Optical character recognition (OCR) is the process of acquiring text and layout information through analysis and recognition of text data image files. It is also a process to identify the geometric location and orientation of the texts and their symmetrical behavior. It usually consists of two steps: text detection and text recognition. Scene text recognition is a subfield of OCR that focuses on processing text in natural scenes, such as streets, billboards, license plates, etc. Unlike traditional document category photographs, it is a challenging task to use computer technology to locate and read text information in natural scenes. Imaging sequence recognition is a longstanding subject of research in the field of computer vision. Great progress has been made in this field; however, most models struggled to recognize text in images of complex scenes with high accuracy. This paper proposes a new pattern of text recognition based on the convolutional recurrent neural network (CRNN) as a solution to address this issue. It combines real-time scene text detection with differentiable binarization (DBNet) for text detection and segmentation, text direction classifier, and the Retinex algorithm for image enhancement. To evaluate the effectiveness of the proposed method, we performed experimental analysis of the proposed algorithm, and carried out simulation on complex scene image data based on existing literature data and also on several real datasets designed for a variety of nonstationary environments. Experimental results demonstrated that our proposed model performed better than the baseline methods on three benchmark datasets and achieved on-par performance with other approaches on existing datasets. This model can solve the problem that CRNN cannot identify text in complex and multi-oriented text scenes. Furthermore, it outperforms the original CRNN model with higher accuracy across a wider variety of application scenarios.

KeywordCRNN DBNet OCR Retinex
DOI10.3390/sym15040849
URLView source
Indexed BySCIE
Language英语English
WOS Research AreaScience & Technology - Other Topics
WOS SubjectMultidisciplinary Sciences
WOS IDWOS:000979610900001
Scopus ID2-s2.0-85156115004
Citation statistics
Cited Times:7[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeJournal article
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/10564
CollectionFaculty of Science and Technology
Corresponding AuthorShi, Hongjian
Affiliation
Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science,Beijing Normal University—Hong Kong Baptist University United International College,Zhuhai,519087,China
First Author AffilicationBeijing Normal-Hong Kong Baptist University
Corresponding Author AffilicationBeijing Normal-Hong Kong Baptist University
Recommended Citation
GB/T 7714
Liu, Yiyi,Wang, Yuxin,Shi, Hongjian. A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application[J]. Symmetry, 2023, 15(4).
APA Liu, Yiyi, Wang, Yuxin, & Shi, Hongjian. (2023). A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application. Symmetry, 15(4).
MLA Liu, Yiyi,et al."A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application". Symmetry 15.4(2023).
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Liu, Yiyi]'s Articles
[Wang, Yuxin]'s Articles
[Shi, Hongjian]'s Articles
Baidu academic
Similar articles in Baidu academic
[Liu, Yiyi]'s Articles
[Wang, Yuxin]'s Articles
[Shi, Hongjian]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Liu, Yiyi]'s Articles
[Wang, Yuxin]'s Articles
[Shi, Hongjian]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.