Title | Modeling of Information Diffusion in Sina Weibo Based on Random Forest Classifier and SIR Model |
Creator | |
Date Issued | 2020 |
Conference Name | 15th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, ICNC-FSKD 2019 |
Source Publication | Advances in Intelligent Systems and Computing
![]() |
ISSN | 2194-5357 |
Volume | 1074 |
Pages | 569-576 |
Conference Date | July 20-22, 2019 |
Conference Place | Kunming, PEOPLES R CHINA |
Abstract | Recent developments in information diffusion model for social network have not taken into account its topological structures. Characteristics such as the degree of connections and clustering of nodes in a network are known to influence the speed of information propagation. Yet, existing models (such as SIR with an average probability to repost received message) are not sophisticate enough to reflect the fine-grain characteristics. Differences among nodes are often overlooked, leading to inaccurate description of the information dissemination process. In this work, a new approach to predict the information diffusion probability in social network is studied. We combine the Random Forest classification and the SIR model together to analyze the dissemination of information in Weibo. Python crawlers are employed to obtain a total of 316,329 microblogs concerning major news events in 2018, together with related features of nodes from Sina Weibo. The unbalanced positive and negative repost behavior together with 15 features that characterize the nodes and edges data are rebalanced by SMOTE resampling, then used to train a Random Forest classifier to predict individual user’s forwarding behavior. For comparison, we find the performance of the Random Forest classifier, judging from the AUC of receiver operating characteristic (ROC) curve, is higher than a comparable SVM model. Finally, a Susceptible Infected Recovered (SIR) information propagation model with the forwarding rates obtained from the Random Forest classifier as input parameter is used to simulate the information dissemination process of Weibo. The predicted time behaviors of the Susceptible, Infected, and Recovered populations are in good agreement with real-life data obtained from Sina Weibo. |
Keyword | Information diffusion Machine learning Random Forest classifier SIR model SMOTE resampling Social network |
DOI | 10.1007/978-3-030-32456-8_62 |
URL | View source |
Language | 英语English |
Scopus ID | 2-s2.0-85077004622 |
Citation statistics |
Cited Times [WOS]:0
[WOS Record]
[Related Records in WOS]
|
Document Type | Conference paper |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/6217 |
Collection | Faculty of Science and Technology |
Affiliation | Beijing Normal University-Hong Kong Baptist University United International College,Zhuhai,China |
First Author Affilication | Beijing Normal-Hong Kong Baptist University |
Recommended Citation GB/T 7714 | Zhang, Jianyi,He, Ping,Tsang, Ken K.T.et al. Modeling of Information Diffusion in Sina Weibo Based on Random Forest Classifier and SIR Model[C], 2020: 569-576. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment