Details of Research Outputs

TitleModeling of Information Diffusion in Sina Weibo Based on Random Forest Classifier and SIR Model
Creator
Date Issued2020
Conference Name15th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, ICNC-FSKD 2019
Source PublicationAdvances in Intelligent Systems and Computing
ISSN2194-5357
Volume1074
Pages569-576
Conference DateJuly 20-22, 2019
Conference PlaceKunming, PEOPLES R CHINA
Abstract

Recent developments in information diffusion model for social network have not taken into account its topological structures. Characteristics such as the degree of connections and clustering of nodes in a network are known to influence the speed of information propagation. Yet, existing models (such as SIR with an average probability to repost received message) are not sophisticate enough to reflect the fine-grain characteristics. Differences among nodes are often overlooked, leading to inaccurate description of the information dissemination process. In this work, a new approach to predict the information diffusion probability in social network is studied. We combine the Random Forest classification and the SIR model together to analyze the dissemination of information in Weibo. Python crawlers are employed to obtain a total of 316,329 microblogs concerning major news events in 2018, together with related features of nodes from Sina Weibo. The unbalanced positive and negative repost behavior together with 15 features that characterize the nodes and edges data are rebalanced by SMOTE resampling, then used to train a Random Forest classifier to predict individual user’s forwarding behavior. For comparison, we find the performance of the Random Forest classifier, judging from the AUC of receiver operating characteristic (ROC) curve, is higher than a comparable SVM model. Finally, a Susceptible Infected Recovered (SIR) information propagation model with the forwarding rates obtained from the Random Forest classifier as input parameter is used to simulate the information dissemination process of Weibo. The predicted time behaviors of the Susceptible, Infected, and Recovered populations are in good agreement with real-life data obtained from Sina Weibo.

KeywordInformation diffusion Machine learning Random Forest classifier SIR model SMOTE resampling Social network
DOI10.1007/978-3-030-32456-8_62
URLView source
Language英语English
Scopus ID2-s2.0-85077004622
Citation statistics
Cited Times [WOS]:0   [WOS Record]     [Related Records in WOS]
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/6217
CollectionFaculty of Science and Technology
Affiliation
Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai,China
First Author AffilicationBeijing Normal-Hong Kong Baptist University
Recommended Citation
GB/T 7714
Zhang, Jianyi,He, Ping,Tsang, Ken K.T.et al. Modeling of Information Diffusion in Sina Weibo Based on Random Forest Classifier and SIR Model[C], 2020: 569-576.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Zhang, Jianyi]'s Articles
[He, Ping]'s Articles
[Tsang, Ken K.T.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhang, Jianyi]'s Articles
[He, Ping]'s Articles
[Tsang, Ken K.T.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang, Jianyi]'s Articles
[He, Ping]'s Articles
[Tsang, Ken K.T.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.