基于多层次情感与语义特征融合的虚假新闻检测方法

doi:10.11871/jfdc.issn.2096-742X.2026.02.009

数据与计算发展前沿 ›› 2026, Vol. 8 ›› Issue (2): 111-122.

CSTR: 32002.14.jfdc.CN10-1649/TP.2026.02.009

doi: 10.11871/jfdc.issn.2096-742X.2026.02.009

基于多层次情感与语义特征融合的虚假新闻检测方法

安真仟(),刘为军^*()

中国人民公安大学，侦查学院，北京 100038

收稿日期:2025-01-15 出版日期:2026-04-20 发布日期:2026-04-23
通讯作者: *刘为军（E-mail: liuweijun@ppsuc.edu.cn）
作者简介:安真仟，中国人民公安大学，博士研究生，主要研究方向为网络安全。
本文中主要负责论文中模型的实现与文章的写作。
AN Zhenqian is a Ph.D. candidate at the People’s Public Security University of China. His main research interests include cybersecurity.
In this paper, he is mainly responsible for the implementation of the model and the writing of the article.
E-mail: 2023111010@stu.ppsuc.edu.cn|刘为军，中国人民公安大学网络安全与法治协同创新中心教授，侦查学院学术委员会主任，博士生导师，主要研究方向为网络犯罪治理。
本文中主要负责文章思路与写作的指导。
LIU Weijun is a professor at the Collaborative Innovation Center for Cybersecurity and Rule of Law at the People’s Public Security University of China, Academic Committee Director of the School of Investigation, and a Ph.D. supervisor. His research interests include the governance of cybercrime.
In this paper, he is mainly responsible for providing guidance on the overall framework and writing of the paper.
E-mail: liuweijun@ppsuc.edu.cn
基金资助:
中国人民公安大学侦查学双一流专项(2023SYL02)

A Fake News Detection Method Based on Multi-Level Sentiment and Semantic Feature Fusion

AN Zhenqian(),LIU Weijun^*()

Institute of Criminal Investigation, China People’s Public Security University, Beijing 100038, China

Received:2025-01-15 Online:2026-04-20 Published:2026-04-23

摘要/Abstract

摘要：

【目的】 随着互联网的快速发展和生成式人工智能技术的迅猛进步，虚假新闻传播问题日益严峻，如何高效、准确开展虚假新闻检测成为学界关注的热点问题。传统虚假新闻检测方法主要依赖于新闻文本的语义特征，忽视了对文本其他维度特征以及评论等信息的深入挖掘。【方法】 本文提出基于BERT-SentiMHCA（Bidirectional Encoder Representations from Transformer-Sentiment Features with Multi-Head Cross Attention）的虚假新闻检测方法，在考虑新闻正文语义特征的基础上，综合考虑正文、评论的情感特征，及二者情感相似度，通过多头自注意力机制和多头交叉注意力机制实现多层次特征有效融合，融合多层次文本与情感特征以提升虚假新闻检测的准确性与鲁棒性。具体方法包括：（1）使用预训练语言模型BERT提取新闻文本的深层语义表示；（2）构建情感分类器，分别提取正文及评论的情感特征；（3）设计融合模块，通过注意力机制实现特征融合并采用神经网络分类器构建虚假新闻检测模型。【结论】 在四个公开虚假新闻数据集上的实验结果表明，本文提出的BERT-SentiMHCA模型在准确率、精确率、召回率和F1值等评价指标上，较多个主流基线模型分别实现了较为明显的性能提升。结果验证了本文方法在多层次文本特征提取方面的能力，能够显著提升虚假新闻检测的性能。

关键词: 虚假新闻检测, 情感特征, BERT, 特征融合

Abstract:

[Objective] With the rapid development of the internet and the accelerated advancement of generative artificial intelligence technologies, the spread of fake news has become increasingly severe. Detecting fake news efficiently and accurately has emerged as a critical research focus in academia. Traditional fake news detection methods primarily rely on semantic features of news texts, while neglecting in-depth exploration of features from other dimensions of the text as well as information such as user comments. [Methods] This paper proposes a fake news detection method based on BERT-SentiMHCA (Bidirectional Encoder Representations from Transformer-Sentiment Features with Multi-Head Cross Attention) model. Building on semantic features of news content, the method comprehensively integrates sentiment features of both news content and user comments, as well as their sentimen similarity. Through multi-head self-attention and multi-head cross-attention mechanisms, it achieves effective fusion of multi-level features, combining textual and emotional characteristics to enhance the accuracy and robustness of fake news detection. The specific methodology includes: (1) utilizing the pre-trained language model BERT to extract deep semantic representations of news texts; (2) constructing sentiment classifiers to extract sentiment features from both news content and comments; (3) designing a fusion module that integrates features via attention mechanisms and employs a neural network classifier to build the fake news detection model. [Conclusions] Experimental results on four public fake news datasets demonstrate that the proposed BERT-SentiMHCA model achieves notable performance improvements in terms of accuracy, precision, recall, and F1-score compared to several mainstream baseline models. These results validate the capability of the proposed method in multi-level textual feature extraction, significantly enhancing the performance of fake news detection.

Key words: fake news detection, sentiment features, BERT, feature fusion

安真仟, 刘为军. 基于多层次情感与语义特征融合的虚假新闻检测方法[J]. 数据与计算发展前沿, 2026, 8(2): 111-122.

AN Zhenqian, LIU Weijun. A Fake News Detection Method Based on Multi-Level Sentiment and Semantic Feature Fusion[J]. Frontiers of Data and Computing, 2026, 8(2): 111-122, https://cstr.cn/32002.14.jfdc.CN10-1649/TP.2026.02.009.

图/表 8

图1

图2

图3

图4

表1

表2

表3

表4

参考文献 31

[1]	第55次《中国互联网络发展状况统计报告》发布[J]. 传媒论坛, 2025, 8(2):121.
[2]	NOTARMUZI D, CASTELLANO C, FLAMMINI A, et al. Universality, criticality and complexity of information propagation in social media[J]. Nature Communications, 2022, 13(1): 1308. doi: 10.1038/s41467-022-28964-8 pmid: 35288567
[3]	李明德, 李聿哲. 赋能和消解: 生成式人工智能与新闻真实的碰撞[J/OL]. 西安交通大学学报(社会科学版), 1-14[2025-04-23]. http://kns.cnki.net/kcms/detail/61.1329.C.20250325.1523.002.html.
[4]	ZHANG X, GHORBANI A A. An overview of online fake news: Characterization, detection, and discussion[J/OL]. Information Processing & Management, 2020, 57(2): 102025.
[5]	WANG C, ZHOU Z, JIN X L, et al. The influence of affective cues on positive emotion in predicting instant information sharing on microblogs: Gender as a moderator[J]. Information Processing & Management, 2017, 53(3): 721-734. doi: 10.1016/j.ipm.2017.02.003
[6]	CASTILLO C, MENDOZA M, POBLETE B. Information credibility on twitter[C/OL]. Proceedings of the 20th international conference on World wide web. Hyderabad India: ACM, 2011: 675-684.
[7]	PEREZROSAS V, KLEINBERG B, LEFEVRE A, et al. Automatic Detection of Fake News[M/OL]. Proceedings of the 27th International Conference on Computational Linguistics, pages 3391-3401 Santa Fe, New Mexico, USA, August 20-26, 2018.
[8]	MA J, GAO W, MITRA P, et al. Detecting rumors from microblogs with recurrent neural networks[C]. Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI2016). 2016:3818-3824.
[9]	WANG W Y. “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection[M/OL]. Annual Meeting of the Association for Computational Linguistics, 2017.
[10]	李家毅. 基于LSTM的虚假新闻检测系统的研究与实现[D]. 哈尔滨: 哈尔滨工业大学, 2022.
[11]	唐雪涛. 基于神经网络嵌入模型的中文文本分类方法研究[D]. 合肥: 合肥工业大学, 2021.
[12]	刘晓明, 李丞正旭, 吴少聪, 等. 文本分类算法及其应用场景研究综述[J]. 计算机学报, 2024, 47(6): 1244-1287.
[13]	刘鑫楠, 洪鑫宇, 曹振洋, 等. 社交媒体谣言检测: 方法、挑战与趋势[J/OL]. 计算机工程与应用, 1-26[2025-05-18]. http://kns.cnki.net/kcms/detail/11.2127.TP.20241209.1359.006.html.
[14]	TRUICĂ C-O, APOSTOL E-S, KARRAS P. DANES: Deep Neural Network Ensemble Architecture for Social and Textual Context-aware Fake News Detection[J]. Knowledge-Based Systems, 2024, 294: 111715. doi: 10.1016/j.knosys.2024.111715
[15]	SHU K, SLIVA A, WANG S, et al. Fake News Detection on Social Media: A Data Mining Perspective[M/OL]. ACM SIGKDD explorations newsletter, 2017: 22-36.
[16]	FARHOUDINIA B, OZTURKCAN S, KASAP N. Emotions unveiled: detecting COVID-19 fake news on social media[J]. Humanities and Social Sciences Communications, 2024, 11: 640. doi: 10.1057/s41599-024-03083-5
[17]	WANG J, YU L C, LAI K R, et al. Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model[C/OL]. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Berlin, Germany: Association for Computational Linguistics, 2016: 225-230.
[18]	ZAEEM R N, LI C, BARBER K S. On Sentiment of Online Fake News[C/OL]// 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). The Hague, Netherlands: IEEE, 2020: 760-767.
[19]	LÜHRING J, SHETTY A, KOSCHMIEDER C, et al. Emotions in misinformation studies: distinguishing affective state from emotional response and misinformation recognition from acceptance[J]. Cognitive Research, 2024, 9: 82.
[20]	VOSOUGHI S, ROY D, ARAL S. The spread of true and false news online[J]. Science, 2018, 359(6380): 1146-1151. doi: 10.1126/science.aap9559 pmid: 29590045
[21]	GHANEM B, PONZETTO S P, ROSSO P, et al. FakeFlow: Fake News Detection by Modeling the Flow of Affective Information[C]. In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021: 679-689.
[22]	LIU F, ZHANG X, LIU Q. An Emotion-Aware Approach for Fake News Detection[J]. IEEE Transactions on Computational Social Systems, 2024, 11(3): 3516-3524. doi: 10.1109/TCSS.2023.3335269
[23]	REN L, DUAN G, HUANG T, et al. Multi-local feature relation network for few-shot learning[J]. Neural Computing and Applications, 2022, 34(10): 7393-7403. doi: 10.1007/s00521-021-06840-8
[24]	DEVLIN J, CHANG M W, LEE K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding[C]. Proceedings of the 2019 Conference of the North. Minneapolis, Minnesota: Association for Computational Linguistics, 2019: 4171-4186.
[25]	NAN Q, CAO J, ZHU Y, et al. MDFEND: Multi-domain Fake News Detection[C/OL]. Proceedings of the 30th ACM International Conference on Information & Knowledge Management. Virtual Event Queensland Australia: ACM, 2021: 3343-3347.
[26]	YANG C, ZHOU X, ZAFARANI R. CHECKED: COVID-19 fake news dataset[J]. Social Network Analysis and Mining, 2021, 11(1): 58. doi: 10.1007/s13278-021-00766-8
[27]	MA Z, LIU M, FANG G, et al. LTCR: Long-Text Chinese Rumor Detection Dataset[J]. arxiv preprint arxiv:2306.07201, 2023.
[28]	LI Y, HE H, BAI J, et al. Mcfend: a multi-source benchmark dataset for Chinese fake news detection[C]. In: Proceedings of the ACM Web Conference 2024, 2024: 4018-4027.
[29]	CHO K, VAN MERRIENBOER B, GULCEHRE C, et al. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation[M]. arXiv, 2014.
[30]	KIM Y. Convolutional neural networks for sentence classification[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. Doha: EMNLP, 2014: 1746-1751.
[31]	WANG Y, MA F, JIN Z, et al. EANN: Event Adversarial Neural Networks for Multi-Modal Fake News Detection[C]// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. London United Kingdom: ACM, 2018: 849-857.

数据集		虚假新闻数量	真实新闻数量
weibo21	训练集	2,693	2,784
	验证集	898	928
	测试集	898	928
CHECKED	训练集	206	1,056
	验证集	69	352
	测试集	69	352
LTCR	训练集	310	1,039
	验证集	103	346
	测试集	103	346
MCFEND	训练集	10,629	3,644
	验证集	3,543	1,215
	测试集	3,543	1,215

实验环境	名称及参数
操作系统	Window 10
内存	2×8 GB
处理器	Intel Core i5-12400F
核心数	6
显卡	NVIDIA GeForce RTX 4060

数据集	方法	准确率	精确率	召回率	F1-值
weibo21	Bi-GRU	0.833	0.833	0.833	0.832
	TextCNN	0.881	0.884	0.881	0.881
	EANNT	0.898	0.898	0.897	0.898
	MDFEND	0.903	0.904	0.903	0.903
	Ours	0.924	0.925	0.924	0.923
CHECKED	Bi-GRU	0.832	0.692	0.832	0.756
	TextCNN	0.867	0.752	0.867	0.806
	EANNT	0.891	0.793	0.891	0.839
	MDFEND	0.887	0.786	0.886	0.833
	Ours	0.910	0.918	0.909	0.913
LTCR	Bi-GRU	0.855	0.821	0.855	0.838
	TextCNN	0.879	0.844	0.878	0.859
	EANNT	0.883	0.855	0.883	0.867
	MDFEND	0.910	0.885	0.909	0.897
	Ours	0.938	0.922	0.938	0.930
MCFEND	Bi-GRU	0.893	0.797	0.893	0.842
	TextCNN	0.890	0.793	0.890	0.839
	EANNT	0.887	0.787	0.887	0.834
	MDFEND	0.894	0.799	0.894	0.844
	Ours	0.901	0.810	0.901	0.853

数据集	方法	准确率	精确率	召回率	F1值
weibo21	BERT-NN	0.892	0.894	0.892	0.892
	BERT-CSM	0.905	0.907	0.905	0.905
	BERT-TSM	0.906	0.908	0.906	0.905
	Ours	0.924	0.925	0.924	0.923
CHECKED	BERT-NN	0.869	0.853	0.868	0.860
	BERT-CSM	0.883	0.870	0.882	0.876
	BERT-TSM	0.878	0.857	0.877	0.867
	Ours	0.910	0.918	0.909	0.913
LTCR	BERT-NN	0.883	0.846	0.883	0.860
	BERT-CSM	-	-	-	-
	BERT-TSM	0.902	0.874	0.902	0.888
	Ours	0.938	0.922	0.938	0.930
MCFEND	BERT-NN	0.877	0.768	0.877	0.819
	BERT-CSM	-	-	-	-
	BERT-TSM	0.886	0.784	0.886	0.832
	Ours	0.901	0.810	0.901	0.853

基于多层次情感与语义特征融合的虚假新闻检测方法

A Fake News Detection Method Based on Multi-Level Sentiment and Semantic Feature Fusion

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 8

参考文献 31

相关文章 14

编辑推荐

Metrics

本文评价

[1]	琚子政, 陈鹏, 隋晋光, 朱隆昇. 基于多尺度时空图融合的警情预测模型[J]. 数据与计算发展前沿, 2026, 8(2): 154-170.
[2]	潘语泉,袁得嵛,贾源,王安然. 基于融合特征的VGAT-VGAN跨社交网络身份关联算法[J]. 数据与计算发展前沿, 2026, 8(1): 103-118.
[3]	朱冬,杨小渝,唐述杰,朱锋锋,孔潇,郭艳峰,李兵,秦志鹏. 基于小样本数据的晶体合成工艺智能推荐研究[J]. 数据与计算发展前沿, 2026, 8(1): 219-231.
[4]	令狐荣微, 张瑜, 石元泉, 杨玉军. 基于多特征融合的PE恶意代码检测与分类研究[J]. 数据与计算发展前沿, 2025, 7(6): 77-91.
[5]	强威,罗向东,张希莹. 基于BERTopic模型的产品需求演化与功能映射应用研究[J]. 数据与计算发展前沿, 2025, 7(5): 198-211.
[6]	李勇,任勇毛,殷卓然,周旭. 一种基于深度学习的轻量化流量识别模型[J]. 数据与计算发展前沿, 2025, 7(2): 3-11.
[7]	卢成浩,陈秀宏. 基于隐式分区学习深度特征融合重建曲面网络[J]. 数据与计算发展前沿, 2024, 6(6): 19-31.
[8]	赵小丹, 胡林. 基于深度学习的农业科技政策知识抽取方法研究[J]. 数据与计算发展前沿, 2024, 6(4): 106-115.
[9]	王桂江, 黄润才, 黄勃. 基于K-BERT和残差循环单元的中文情感分析[J]. 数据与计算发展前沿, 2023, 5(4): 127-138.
[10]	刘琦玮,李俊,顾蓓蓓,赵泽方. TSAIE：图像增强文本的多模态情感分析模型[J]. 数据与计算发展前沿, 2022, 4(3): 131-140.
[11]	陈雨,玄宇航,张玉志. 基于深度学习和指代消解的中文人名识别[J]. 数据与计算发展前沿, 2022, 4(2): 63-73.
[12]	李贞贞,钟永恒,王辉,刘佳,孙源. 基于深度学习与统计信息的领域术语抽取方法研究[J]. 数据与计算发展前沿, 2022, 4(2): 87-98.
[13]	陈涛,安俊秀. 基于特征融合的微博短文本情感分类研究[J]. 数据与计算发展前沿, 2020, 2(6): 21-29.
[14]	冷佳旭,刘莹. 基于深度学习的小目标检测与识别[J]. 数据与计算发展前沿, 2020, 2(2): 120-135.