Frontiers of Data and Computing ›› 2026, Vol. 8 ›› Issue (1): 158-167.

CSTR: 32002.14.jfdc.CN10-1649/TP.2026.01.013

doi: 10.11871/jfdc.issn.2096-742X.2026.01.013

• Technology and Application • Previous Articles     Next Articles

An Implicit Transcript Enhancement Method for Unbalanced Crimes Based on PolyLoss Function

LI Wenbang(),YAN Jinghua*(),DONG Ze   

  1. School of Information and Network Security, People’s Public Security University of China, Beijing 100038, China
  • Received:2025-04-01 Online:2026-02-20 Published:2026-02-02
  • Contact: YAN Jinghua E-mail:1053058921@qq.com;yanjing hua@ppsuc.edu.cn

Abstract:

[Objective] This study aims to address the imbalance issue in the criminal cant dataset and further improve the classification performance of criminal cant texts. [Method] For long texts, we present SimPoly, while for short texts, EDAPoly is proposed. [Result] Experimental results demonstrate that SimPoly and EDAPoly significantly improve classification model performance on imbalanced criminal cant datasets, achieving notable gains in accuracy, recall, and F1-score compared to baseline methods without text augmentation. [Conclusion] The proposed method not only provides an effective solution for the practical application of criminal cant recognition technology, but also offers new ideas and support for similar imbalanced text classification tasks.

Key words: criminal cant, text augmentation, loss function, unbalanced text processing