ChauBoxplot and AdaptiveBoxplot: two R packages for boxplot-based outlier detection

📅 2026-01-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the limitation of the traditional Tukey boxplot, which tends to over-identify outliers in large samples, thereby compromising detection reliability. To mitigate this issue, the authors propose two R packages—ChauBoxplot and AdaptiveBoxplot—that dynamically adjust outlier detection criteria according to sample size. ChauBoxplot employs an improved interquartile range estimator, while AdaptiveBoxplot implements an adaptive thresholding mechanism. Extensive simulation experiments demonstrate that both methods substantially reduce false positive rates in large samples without sacrificing interpretability or statistical robustness. The paper further provides practical guidance for selecting between the two approaches, offering data analysts more accurate and stable tools for outlier detection in real-world applications.

Technology Category

Application Category

📝 Abstract
Tukey's boxplot is widely used for outlier detection; however, its classic fixed-fence rule tends to flag an excessive number of outliers as the sample size grows. To address this limitation, we introduce two new R packages, ChauBoxplot and AdaptiveBoxplot, which implement more robust methods for outlier detection. We also provide practical guidance, drawn from simulation results, to help practitioners choose suitable boxplot methods and balance interpretability with statistical reliability.
Problem

Research questions and friction points this paper is trying to address.

outlier detection
boxplot
Tukey's boxplot
sample size
fixed-fence rule
Innovation

Methods, ideas, or system contributions that make the work stand out.

boxplot
outlier detection
R package
adaptive fence
robust statistics
🔎 Similar Papers
No similar papers found.
Tiejun Tong
Tiejun Tong
Professor of Statistics, Hong Kong Baptist University
StatisticsBiostatisticsMeta-analysisEvidence-based Practice
H
Hongmei Lin
School of Statistics and Data Science, Shanghai University of International Business and Economics, Shanghai, China
B
Bowen Gang
Department of Statistics and Data Science, Fudan University, Shanghai, China
R
Riquan Zhang
School of Statistics and Data Science, Shanghai University of International Business and Economics, Shanghai, China