发表状态 | 已发表Published |
题名 | Multi-class Twitter sentiment classification with emojis |
作者 | |
发表日期 | 2018-09-28 |
发表期刊 | Industrial Management and Data Systems
![]() |
ISSN/eISSN | 0263-5577 |
卷号 | 118期号:9页码:1804-1820 |
摘要 | Purpose: Recently, various Twitter Sentiment Analysis (TSA) techniques have been developed, but little has paid attention to the microblogging feature – emojis, and few works have been conducted on the multi-class sentiment analysis of tweets. The purpose of this paper is to consider the popularity of emojis on Twitter and investigate the feasibility of an emoji training heuristic for multi-class sentiment classification of tweets. Tweets from the “2016 Orlando nightclub shooting” were used as a source of study. Besides, this study also aims to demonstrate how mapping can contribute to interpreting sentiments. Design/methodology/approach: The authors presented a methodological framework to collect, pre-process, analyse and map public Twitter postings related to the shooting. The authors designed and implemented an emoji training heuristic, which automatically prepares the training data set, a feature needed in Big Data research. The authors improved upon the previous framework by advancing the pre-processing techniques, enhancing feature engineering and optimising the classification models. The authors constructed the sentiment model with a logistic regression classifier and selected features. Finally, the authors presented how to visualise citizen sentiments on maps dynamically using Mapbox. Findings: The sentiment model constructed with the automatically annotated training sets using an emoji approach and selected features performs well in classifying tweets into five different sentiment classes, with a macro-averaged F-measure of 0.635, a macro-averaged accuracy of 0.689 and the MAEM of 0.530. Compared to those experimental results in related works, the results are satisfactory, indicating the model is effective and the proposed emoji training heuristic is useful and feasible in multi-class TSA. The maps authors created, provide a much easier-to-understand visual representation of the data, and make it more efficient to monitor citizen sentiments and distributions. Originality/value: This work appears to be the first to conduct multi-class sentiment classification on Twitter with automatic annotation of training sets using emojis. Little attention has been paid to applying TSA to monitor the public’s attitudes towards terror attacks and country’s gun policies, the authors consider this work to be a pioneering work. Besides, the authors have introduced a new data set of 2016 Orlando Shooting tweets, which will be made available for other researchers to mine the public’s political opinions about gun policies. |
关键词 | Emojis Sentiment analysis Twitter |
DOI | 10.1108/IMDS-12-2017-0582 |
URL | 查看来源 |
收录类别 | SCIE |
语种 | 英语English |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Interdisciplinary Applications ; Engineering, Industrial |
WOS记录号 | WOS:000446474100005 |
Scopus入藏号 | 2-s2.0-85053283315 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/10983 |
专题 | 个人在本单位外知识产出 |
通讯作者 | Li, Mengdi |
作者单位 | 1.NVIDIA Joint,Lab on Mixed Reality (Visualisation and AI),University of Nottingham Ningbo China,Ningbo,China 2.International Doctoral Innovation Centre,University of Nottingham Ningbo China,Ningbo,China 3.Nottingham University Business School China,University of Nottingham Ningbo China,Ningbo,China 4.NVIDIA Technology Centre APJ,Singapore |
推荐引用方式 GB/T 7714 | Li, Mengdi,Ch'ng, Eugene,Chong, Alain Yee Loonget al. Multi-class Twitter sentiment classification with emojis[J]. Industrial Management and Data Systems, 2018, 118(9): 1804-1820. |
APA | Li, Mengdi, Ch'ng, Eugene, Chong, Alain Yee Loong, & See, Simon. (2018). Multi-class Twitter sentiment classification with emojis. Industrial Management and Data Systems, 118(9), 1804-1820. |
MLA | Li, Mengdi,et al."Multi-class Twitter sentiment classification with emojis". Industrial Management and Data Systems 118.9(2018): 1804-1820. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论