Eintrag weiter verarbeiten

A methodology for automatised outlier detection in high-dimensional datasets: an application to euro area banks' supervisory data

Gespeichert in:

Personen und Körperschaften: Farnè, Matteo (VerfasserIn), Vouldis, Angelos T. (VerfasserIn)
Titel: A methodology for automatised outlier detection in high-dimensional datasets: an application to euro area banks' supervisory data/ Matteo Farnè, Angelos T. Vouldis
Format: E-Book
Sprache: Englisch
veröffentlicht:
Frankfurt am Main, Germany European Central Bank [2018]
Gesamtaufnahme: Europäische Zentralbank: Working paper series ; no 2171 (July 2018)
Quelle: Verbunddaten SWB
Lizenzfreie Online-Ressourcen
Details
Zusammenfassung: Outlier detection in high-dimensional datasets poses new challenges that have not been investigated in the literature. In this paper, we present an integrated methodology for the identification of outliers which is suitable for datasets with higher number of variables than observations. Our method aims to utilise the entire relevant information present in a dataset to detect outliers in an automatized way, a feature that renders the method suitable for application in large dimensional datasets. Our proposed five-step procedure for regression outlier detection entails a robust selection stage of the most explicative variables, the estimation of a robust regression model based on the selected variables, and a criterion to identify outliers based on robust measures of the residuals' dispersion. The proposed procedure deals also with data redundancy and missing observations which may inhibit the statistical processing of the data due to the ill-conditioning of the covariance matrix. The method is validated in a simulation study and an application to actual supervisory data on banks' total assets.
Umfang: 1 Online-Ressource (circa 57 Seiten); Illustrationen
ISBN: 9789289932769
9289932767
DOI: 10.2866/357467