The BACON Approach for Rank-Deï¬cient Data

Athanassios Kondylis; Ali S. Hadi; Mark Werner

doi:10.18187/pjsor.v8i3.514

Download

PDF

Statistic

Read Counter : 1017 Download : 445

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Abstract

Rank-deficient data are not uncommon in practice. They result from highly collinear variables and/or high-dimensional data. A special case of the latter occurs when the number of recorded variables exceeds the number of observations. The use of the BACON algorithm for outlier detection in multivariate data is extended here to include rank-deficient data. We present two approaches to identifying outliers in rank-deficient data based on the original BACON algorithm. The first algorithm projects the data onto a robust subspace of reduced dimension, while the second employs a ridge type regularization on the covariance matrix. Both algorithms are tested on real as well as simulated data sets with good results in terms of their effectiveness in outlier detection. They are also examined in terms of computational efficiency and found to be very fast, with particularly good scaling properties for increasing dimension.

Keywords

High-dimensional data Mahalanobis distance Outlier detection Spatial median.

Authors who publish with this journal agree to the following License

CC BY: This license allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.

How to Cite

Kondylis, A., Hadi, A. S., & Werner, M. (2012). The BACON Approach for Rank-Deï¬cient Data. Pakistan Journal of Statistics and Operation Research, 8(3), 359-379. https://doi.org/10.18187/pjsor.v8i3.514

The BACON Approach for Rank-Deï¬cient Data

Article Sidebar

Downloads

Metrics

Main Article Content

Abstract

Keywords

Article Details

The BACON Approach for Rank-Deï¬cient Data