Machine Learning Method for Changepoint Detection in Short Time Series Data
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26210%2F23%3APU150028" target="_blank" >RIV/00216305:26210/23:PU150028 - isvavai.cz</a>
Result on the web
<a href="https://www.mdpi.com/2504-4990/5/4/71" target="_blank" >https://www.mdpi.com/2504-4990/5/4/71</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.3390/make5040071" target="_blank" >10.3390/make5040071</a>
Alternative languages
Result language
angličtina
Original language name
Machine Learning Method for Changepoint Detection in Short Time Series Data
Original language description
Analysis of data is crucial in waste management to improve effective planning from both short- and long-term perspectives. Real-world data often presents anomalies, but in the waste management sector, anomaly detection is seldom performed. The main goal and contribution of this paper is a proposal of a complex machine learning framework for changepoint detection in a large number of short time series from waste management. In such a case, it is not possible to use only an expert-based approach due to the time-consuming nature of this process and subjectivity. The proposed framework consists of two steps: (1) outlier detection via outlier test for trend-adjusted data, and (2) changepoints are identified via comparison of linear model parameters. In order to use the proposed method, it is necessary to have a sufficient number of experts’ assessments of the presence of anomalies in time series. The proposed framework is demonstrated on waste management data from the Czech Republic. It is observed that certain waste categories in specific regions frequently exhibit changepoints. On the micro-regional level, approximately 31.1% of time series contain at least one outlier and 16.4% exhibit changepoints. Certain groups of waste are more prone to the occurrence of anomalies. The results indicate that even in the case of aggregated data, anomalies are not rare, and their presence should always be checked.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10102 - Applied mathematics
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Machine Learning and Knowledge Extraction
ISSN
2504-4990
e-ISSN
—
Volume of the periodical
5
Issue of the periodical within the volume
4
Country of publishing house
CH - SWITZERLAND
Number of pages
26
Pages from-to
1407-1432
UT code for WoS article
001130875400001
EID of the result in the Scopus database
2-s2.0-85180490104