科研成果详情

发表状态已发表Published
题名A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application
作者
发表日期2023-04-01
发表期刊Symmetry
ISSN/eISSN2073-8994
卷号15期号:4
摘要

Optical character recognition (OCR) is the process of acquiring text and layout information through analysis and recognition of text data image files. It is also a process to identify the geometric location and orientation of the texts and their symmetrical behavior. It usually consists of two steps: text detection and text recognition. Scene text recognition is a subfield of OCR that focuses on processing text in natural scenes, such as streets, billboards, license plates, etc. Unlike traditional document category photographs, it is a challenging task to use computer technology to locate and read text information in natural scenes. Imaging sequence recognition is a longstanding subject of research in the field of computer vision. Great progress has been made in this field; however, most models struggled to recognize text in images of complex scenes with high accuracy. This paper proposes a new pattern of text recognition based on the convolutional recurrent neural network (CRNN) as a solution to address this issue. It combines real-time scene text detection with differentiable binarization (DBNet) for text detection and segmentation, text direction classifier, and the Retinex algorithm for image enhancement. To evaluate the effectiveness of the proposed method, we performed experimental analysis of the proposed algorithm, and carried out simulation on complex scene image data based on existing literature data and also on several real datasets designed for a variety of nonstationary environments. Experimental results demonstrated that our proposed model performed better than the baseline methods on three benchmark datasets and achieved on-par performance with other approaches on existing datasets. This model can solve the problem that CRNN cannot identify text in complex and multi-oriented text scenes. Furthermore, it outperforms the original CRNN model with higher accuracy across a wider variety of application scenarios.

关键词CRNN DBNet OCR Retinex
DOI10.3390/sym15040849
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Science & Technology - Other Topics
WOS类目Multidisciplinary Sciences
WOS记录号WOS:000979610900001
Scopus入藏号2-s2.0-85156115004
引用统计
被引频次:8[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/10564
专题理工科技学院
通讯作者Shi, Hongjian
作者单位
Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science,Beijing Normal University—Hong Kong Baptist University United International College,Zhuhai,519087,China
第一作者单位北师香港浸会大学
通讯作者单位北师香港浸会大学
推荐引用方式
GB/T 7714
Liu, Yiyi,Wang, Yuxin,Shi, Hongjian. A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application[J]. Symmetry, 2023, 15(4).
APA Liu, Yiyi, Wang, Yuxin, & Shi, Hongjian. (2023). A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application. Symmetry, 15(4).
MLA Liu, Yiyi,et al."A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application". Symmetry 15.4(2023).
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Liu, Yiyi]的文章
[Wang, Yuxin]的文章
[Shi, Hongjian]的文章
百度学术
百度学术中相似的文章
[Liu, Yiyi]的文章
[Wang, Yuxin]的文章
[Shi, Hongjian]的文章
必应学术
必应学术中相似的文章
[Liu, Yiyi]的文章
[Wang, Yuxin]的文章
[Shi, Hongjian]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。