Machine learning aided noise filtration and signal classification for CREDO experiment
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F47813059%3A19630%2F22%3AA0000245" target="_blank" >RIV/47813059:19630/22:A0000245 - isvavai.cz</a>
Výsledek na webu
<a href="https://pos.sissa.it/395/227" target="_blank" >https://pos.sissa.it/395/227</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.22323/1.395.0227" target="_blank" >10.22323/1.395.0227</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Machine learning aided noise filtration and signal classification for CREDO experiment
Popis výsledku v původním jazyce
The wealth of smartphone data collected by the Cosmic Ray Extremely Distributed Observatory (CREDO) greatly surpasses the capabilities of manual analysis. So, efficient means of rejecting the non-cosmic-ray noise and identification of signals attributable to extensive air showers are necessary. To address these problems we discuss a Convolutional Neural Network-based method of artefact rejection and complementary method of particle identification based on common statistical classifiers as well as their ensemble extensions. These approaches are based on supervised learning, so we need to provide a representative subset of the CREDO dataset for training and validation. According to this approach over 2300 images were chosen and manually labeled by 5 judges. The images were split into spot, track, worm (collectively named signals) and artefact classes. Then the preprocessing consisting of luminance summation of RGB channels (grayscaling) and background removal by adaptive thresholding was performed. For purposes of artefact rejection the binary CNN-based classifier was proposed which is able to distinguish between artefacts and signals. The classifier was fed with input data in the form of Daubechies wavelet transformed images. In the case of cosmic ray signal classification, the well-known feature-based classifiers were considered. As feature descriptors, we used Zernike moments with additional feature related to total image luminance. For the problem of artefact rejection, we obtained an accuracy of 99%. For the 4-class signal classification, the best performing classifiers achieved a recognition rate of 88%.
Název v anglickém jazyce
Machine learning aided noise filtration and signal classification for CREDO experiment
Popis výsledku anglicky
The wealth of smartphone data collected by the Cosmic Ray Extremely Distributed Observatory (CREDO) greatly surpasses the capabilities of manual analysis. So, efficient means of rejecting the non-cosmic-ray noise and identification of signals attributable to extensive air showers are necessary. To address these problems we discuss a Convolutional Neural Network-based method of artefact rejection and complementary method of particle identification based on common statistical classifiers as well as their ensemble extensions. These approaches are based on supervised learning, so we need to provide a representative subset of the CREDO dataset for training and validation. According to this approach over 2300 images were chosen and manually labeled by 5 judges. The images were split into spot, track, worm (collectively named signals) and artefact classes. Then the preprocessing consisting of luminance summation of RGB channels (grayscaling) and background removal by adaptive thresholding was performed. For purposes of artefact rejection the binary CNN-based classifier was proposed which is able to distinguish between artefacts and signals. The classifier was fed with input data in the form of Daubechies wavelet transformed images. In the case of cosmic ray signal classification, the well-known feature-based classifiers were considered. As feature descriptors, we used Zernike moments with additional feature related to total image luminance. For the problem of artefact rejection, we obtained an accuracy of 99%. For the 4-class signal classification, the best performing classifiers achieved a recognition rate of 88%.
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10308 - Astronomy (including astrophysics,space science)
Návaznosti výsledku
Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2022
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
Proceedings of Science
ISBN
—
ISSN
1824-8039
e-ISSN
—
Počet stran výsledku
9
Strana od-do
„227-1“-„227-9“
Název nakladatele
Sissa Medialab Srl
Místo vydání
Itálie
Místo konání akce
Berlin
Datum konání akce
12. 7. 2021
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—