Frontiers of Data and Computing ›› 2026, Vol. 8 ›› Issue (1): 183-194.

CSTR: 32002.14.jfdc.CN10-1649/TP.2026.01.015

doi: 10.11871/jfdc.issn.2096-742X.2026.01.015

• Technology and Application • Previous Articles     Next Articles

Research on Short Video Popularity Prediction Based on Multimodal Features: A Case Study of Douyin Platform

MI Saixue(),ZHANG Qi,ZHANG Shihao*(),LI Gen   

  1. School of Information and Cyber Security, People’s Public Security University of China, Beijing 100038, China
  • Received:2025-06-05 Online:2026-02-20 Published:2026-02-02
  • Contact: ZHANG Shihao E-mail:1026081425@qq.com;zhangshihao@ppsuc.edu.cn

Abstract:

[Objective] Short videos have become a crucial medium for online public opinion dissemination, making accurate popularity prediction vital for content moderation and public sentiment analysis. However, existing studies exhibit limitations in feature extraction and temporal modeling: First, the unidimensional feature analysis fails to fully leverage multimodal data sources. Second, conventional linear approaches prove inadequate in characterizing the nonlinear popularity dynamics of short videos, particularly the distinctive “cold-start-explosion-decay” lifecycle patterns. To address these gaps, this study proposes a multimodal feature-based approach for short video popularity prediction. [Methods] First, a multidimensional feature system is constructed, encompassing user influence, author influence, audiovisual quality and content features, comment features, and interaction features. Second, the Random Forest model is employed for nonlinear modeling to capture complex feature interactions and improve the ability to predict video heat. [Results] Experimental results demonstrate the superior performance of the proposed method in short video popularity prediction tasks, achieving an F1-score of 69.3%, representing a 13.7 percentage point improvement over the baseline model. The AUC value reaches 71.3%, showing a 16 percentage point enhancement compared to the baseline. [Conclusions] The multimodal feature-based approach significantly improves prediction accuracy, offering a robust technical solution for online public opinion analysis and content governance..

Key words: short video, popularity prediction, multimodal features, user influence, random forest