> 于2024年4月27日,在期刊Journal of Computer Technology and Applied Mathematics (JCTAM)正式发表。 ##Title Research on the Effectiveness of Different Outlier Detection Methods in Common Data Distribution Types ## Author Qingqing Song Belarusian State University Shaoliang Xia Belarusian State University ## Abstract Outlier detection are widely applied in areas such as network performance optimization and pre-processing of machine learning data. In the field of machine learning, the objective is to enhance data quality, thereby improving the performance of subsequent statistical analyses or machine learning models. Currently, there are numerous effective and reliable outlier analysis methods, and their effectiveness varies significantly when dealing with different types of data distributions. Therefore, it is essential to select an appropriate outlier analysis method. In this study, we conducted outlier detection on sample data from five continuous probability distributions (including Normal, Chi-square, Exponential, Gamma, and T distributions) and four discrete probability distributions (including Binomial, Poisson, Geometric, and Hypergeometric distributions). This paper employs five outlier detection methods, namely Z-Score, IQR, DBScan, Isolation Forest, and Random Forest, and evaluates the detection effectiveness of these methods. Through comparison and analysis, this paper summarizes the characteristics of various outlier detection methods when dealing with sample data from different types of distributions. These findings will assist us in making more rational method selections when facing different outlier detection scenarios. ## Keywords Outlier Detection, Outlier Analysis, Machine Learning, Data Distribution, Performance Evaluation, Data Preprocessing Last modification:October 28, 2024 © Allow specification reprint Support Appreciate the author AliPayWeChat Like 1 If you think my article is useful to you, please feel free to appreciate