Title | MCFEND: A Multi-source Benchmark Dataset for Chinese Fake News Detection |
Creator | |
Date Issued | 2024-05-13 |
Conference Name | WWW '24: The ACM Web Conference 2024 |
Source Publication | WWW 2024 - Proceedings of the ACM Web Conference
![]() |
ISBN | 979-8-4007-0171-9 |
Pages | 4018-4027 |
Conference Date | May 13 - 17, 2024 |
Conference Place | Singapore |
Country | Singapore |
Abstract | The prevalence of fake news across various online sources has had a significant influence on the public. Existing Chinese fake news detection datasets are limited to news sourced solely from Weibo. However, fake news originating from multiple sources exhibits diversity in various aspects, including its content and social context. Methods trained on purely one single news source can hardly be applicable to real-world scenarios. Our pilot experiment demonstrates that the F1 score of the state-of-the-art method that learns from a large Chinese fake news detection dataset, Weibo-21, drops significantly from 0.943 to 0.470 when the test data is changed to multi-source news data, failing to identify more than one-third of the multi-source fake news. To address this limitation, we constructed the first multi-source benchmark dataset for Chinese fake news detection, termed MCFEND, which is composed of news we collected from diverse sources such as social platforms, messaging apps, and traditional online news outlets. Notably, such news has been fact-checked by 14 authoritative fact-checking agencies worldwide. In addition, various existing Chinese fake news detection methods are thoroughly evaluated on our proposed dataset in cross-source, multi-source, and unseen source ways. MCFEND, as a benchmark dataset, aims to advance Chinese fake news detection approaches in real-world scenarios. |
Keyword | chinese fake news detection cross-source evaluation multi-source benchmark dataset multi-source evaluation |
DOI | 10.1145/3589334.3645385 |
URL | View source |
Language | 英语English |
Scopus ID | 2-s2.0-85194096251 |
Citation statistics | |
Document Type | Conference paper |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/11776 |
Collection | Beijing Normal-Hong Kong Baptist University |
Corresponding Author | He, Haorui |
Affiliation | 1.Department of Interactive Media, Hong Kong Baptist University, Hong Kong 2.School of Data Science, The Chinese University of Hong Kong, Shenzhen, Shenzhen, China 3.Department of Computer Science, Beijing Normal University-Hong Kong Baptist University, United International College, Zhuhai, China 4.Department of Computer Science, The University of Hong Kong, Hong Kong |
Recommended Citation GB/T 7714 | Li, Yupeng,He, Haorui,Bai, Jinet al. MCFEND: A Multi-source Benchmark Dataset for Chinese Fake News Detection[C], 2024: 4018-4027. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment