Frontiers of Data and Computing ›› 2022, Vol. 4 ›› Issue (6): 118-128.

CSTR: 32002.14.jfdc.CN10-1649/TP.2022.06.011

doi: 10.11871/jfdc.issn.2096-742X.2022.06.011

• Technology and Application • Previous Articles     Next Articles

An Improved Sentiment Analysis Model Incorporating Textual Topic Features

ZHANG Shuai(),HUANG Bo*(),JU Jiaji   

  1. School of Electrical and Electronic Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
  • Received:2021-12-20 Online:2022-12-20 Published:2022-12-20
  • Contact: HUANG Bo E-mail:854400656@qq.com;huangbosues@sues.edu.cn

Abstract:

[Objective] Massive user reviews are of great value to consumers and related enterprises. This paper addresses the problems of sparse data, unclear topics, and poor classification accuracy caused by the short length of review information. [Methods] This paper proposes a Bi-LSTM self-attention mechanism online review sentiment analysis model (TSC-BiLSTM) incorporating topic features. Compared with the traditional LSTM method, this method uses the Latent Dirichlet Allocation (LDA) topic model to obtain the topic word distribution of comments, stitches it with the comment word vector as input, mines the full-text feature information through Bi-LSTM, and combines with self-attention mechanism to dynamically assign weights. [Results] This model expands the feature space of the original short review text, reduces the sparsity of the data, clarifies the topic, and improves the accuracy of sentiment classification. [Conclusions] Experiments on review datasets of a hotel and a takeaway platform show that the proposed method achieves better performances compared with other related models. It provides a novel view of topic sentiment analysis methods.

Key words: LDA, topic words, Bi-LSTM, self-attention, sentiment analysis