Publication detail

Semiparametric Outlier Detection in Nonstationary Times Series: Case Study for Atmospheric Pollution in Brno, Czech Republic

HOLEŠOVSKÝ, J. ČAMPULOVÁ, M. MICHÁLEK, J.

Original Title

Semiparametric Outlier Detection in Nonstationary Times Series: Case Study for Atmospheric Pollution in Brno, Czech Republic

Type

journal article in Web of Science

Language

English

Original Abstract

Large environmental datasets usually include outliers which can have significant effects on further analysis and modelling. There exist various outlier detection methods that depend on the distribution of the analysed variable. However quite often the distribution of environmental variables can not be estimated. This paper presents an approach for identification of outliers in environmental time series which does not impose restrictions on the distribution of observed variables. The suggested algorithm combines kernel smoothing and extreme value estimation techniques for stochastic processes within considerations of nonstationary expected value of the process. The nonstationarity in variance is evaded by change point analysis which precedes the proposed algorithm. Possible outliers are identified as observations with rare occurrence and, in correspondence to extreme value methodology, the confidence limits for high values of observed variables are constructed. The proposed methodology can be especially convenient for cases where validation of the data has to be carried out manually, since it significantly reduces the number of implausible observations. For a case study, the technique is applied for outlier detection in time series of hourly PM10 concentrations in Brno, Czech Republic. The methodology is derived on solid theoretical results and seems to perform well for the series of PM10. However its flexibility makes it generally applicable not only to series of atmospheric pollutants. On the other hand, the choice of return level turns out to be crucial in sensitivity to the outliers. This issue should be left to the practitioners to decide with respect to specific application conditions.

Keywords

Air pollution; outlier; kernel smoothing; local bandwidth; extreme value; extremal index

Authors

HOLEŠOVSKÝ, J.; ČAMPULOVÁ, M.; MICHÁLEK, J.

Released

2. 1. 2018

Publisher

Elsevier

ISBN

1309-1042

Periodical

Atmospheric Pollution Research

Year of study

9

Number

1

State

Republic of Turkey

Pages from

27

Pages to

36

Pages count

10

BibTex

@article{BUT137510,
  author="Jan {Holešovský} and Martina {Čampulová} and Jaroslav {Michálek}",
  title="Semiparametric Outlier Detection in Nonstationary Times Series: Case Study for Atmospheric Pollution in Brno, Czech Republic",
  journal="Atmospheric Pollution Research",
  year="2018",
  volume="9",
  number="1",
  pages="27--36",
  doi="10.1016/j.apr.2017.06.005",
  issn="1309-1042"
}