Frontiers of Data and Computing ›› 2024, Vol. 6 ›› Issue (5): 126-138.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.05.012

doi: 10.11871/jfdc.issn.2096-742X.2024.05.012

Previous Articles     Next Articles

Robust AdaBoost Regression Model Based on Double LOF and Inverse-Cross-Validation

ZENG Fanbei1(),YANG Lianqiang2,*()   

  1. 1. School of Big Data and Statistics, Anhui University, Hefei, Anhui 230601, China
    2. School of Artificial Intelligence, Anhui University, Hefei, Anhui 230601, China
  • Received:2023-01-03 Online:2024-10-20 Published:2024-10-21

Abstract:

[Objective] The robustness of the traditional AdaBoost regression model is insufficient. The improved AdaBoost.RT+ and AdaBoost.RS algorithms hold insignificant suppression on abnormal data and low identification accuracy of abnormal data. It is meaningful to enhance the robustness of AdaBoost algorithms. [Methods] First, dual LOF and inverse cross validation algorithms are proposed, the abnormal degree of data is characterized by probability based on these two algorithms. Then, appropriate weight coefficients are given according to the abnormal degree of the data to suppress its influence and keep no effect on the normal data. [Results] This AdaBoost.R_LOF model holds better robustness and less mean squared error on prediction. [Limitations] However, more hyperparameters are needed. [Conclusions] Simulations and real applications show that the new model has better robustness and estimation under the different proportions of outliers compared with AdaBoost.R2, AdaBoost.RT+ and AdaBoost.RS algorithms.

Key words: oAdaBoost, double LOF, Inverse-Cross-Validation, AdaBoost.R_ LOF