Details of Research Outputs

TitleBNU-HKBU UIC NLP team 2 at SemEval-2019 task 6: Detecting offensive language using BERT model
Creator
Date Issued2019
Conference NameNAACL HLT 2019 - International Workshop on Semantic Evaluation
Source PublicationNAACL HLT 2019 - International Workshop on Semantic Evaluation - Proceedings of the 13th Workshop
ISBN9781950737062
Pages551-555
Conference DateJune 6–June 7, 2019
Conference PlaceMinneapolis, Minnesota, USA
Abstract

In this study we deal with the problem of identifying and categorizing offensive language in social media. Our group, BNU-HKBU UIC NLP Team2, use supervised classification along with multiple version of data generated by different ways of pre-processing the data. We then use the state-of-the-art model Bidirectional Encoder Representations from Transformers, or BERT (Devlin et al. (2018)), to capture linguistic, syntactic and semantic features. Long range dependencies between each part of a sentence can be captured by BERT's bidirectional encoder representations. Our results show 85.12% accuracy and 80.57% F1 scores in Subtask A (offensive language identification), 87.92% accuracy and 50% F1 scores in Subtask B (categorization of offense types), and 69.95% accuracy and 50.47% F1 score in Subtask C (offense target identification). Analysis of the results shows that distinguishing between targeted and untargeted offensive language is not a simple task. More work needs to be done on the unbalance data problem in Subtasks B and C. Some future work is also discussed.

URLView source
Language英语English
Scopus ID2-s2.0-85091389212
Citation statistics
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/6839
CollectionFaculty of Science and Technology
Affiliation
Computer Science and Technology,Division of Science and Technology,BNU-HKBU United International College,Zhuhai,Guangdong,China
First Author AffilicationBeijing Normal-Hong Kong Baptist University
Recommended Citation
GB/T 7714
Wu, Zhenghao,Zheng, Hao,Wang, Jianminget al. BNU-HKBU UIC NLP team 2 at SemEval-2019 task 6: Detecting offensive language using BERT model[C], 2019: 551-555.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Wu, Zhenghao]'s Articles
[Zheng, Hao]'s Articles
[Wang, Jianming]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wu, Zhenghao]'s Articles
[Zheng, Hao]'s Articles
[Wang, Jianming]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wu, Zhenghao]'s Articles
[Zheng, Hao]'s Articles
[Wang, Jianming]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.