High-throughput spike detection in greenhouse cultivated grain crops with attention mechanisms based deep learning models
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14310%2F24%3A00135605" target="_blank" >RIV/00216224:14310/24:00135605 - isvavai.cz</a>
Result on the web
<a href="https://spj.science.org/doi/10.34133/plantphenomics.0155" target="_blank" >https://spj.science.org/doi/10.34133/plantphenomics.0155</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.34133/plantphenomics.0155" target="_blank" >10.34133/plantphenomics.0155</a>
Alternative languages
Result language
angličtina
Original language name
High-throughput spike detection in greenhouse cultivated grain crops with attention mechanisms based deep learning models
Original language description
Detection of spikes is the first important step towards image-based quantitative assessment of crop yield. However, spikes of grain plants occupy only a tiny fraction of the image area and often emerge in the middle of the mass of plant leaves that exhibit similar colors as spike regions. Consequently, accurate detection of grain spikes renders, in general, a non-trivial task even for advanced, state-of-the-art deep learning neural networks (DNNs). To improve pattern detection in spikes, we propose architectural changes to Faster-RCNN (FRCNN) by reducing feature extraction layers and introducing a global attention module. The performance of our extended FRCNN-A vs. conventional FRCNN was compared on images of different European wheat cultivars, including ’difficult’ bushy phenotypes from two different phenotyping facilities and optical setups. Our experimental results show that introduced architectural adaptations in FRCNN-A helped to improve spike detection accuracy in inner regions. The mAP of FRCNN and FRCNN-A on inner spikes is 76.0% and 81.0%, respectively, while on the state-of-the-art detection DNNs, Swin Transformer mAP is 83.0%. As a lightweight network, FRCNN-A is faster than FRCNN and Swin Transformer on both baseline and augmented training datasets. On the FastGAN augmented dataset, FRCNN achieved mAP of 84.24%, FRCNN-A 85.0%, and the Swin Transformer 89.45%. The increase in mAP of DNNs on the augmented datasets is proportional to the amount of the IPK original and augmented images. Overall, this study indicates a superior performance of attention mechanisms-based deep learning models in detecting small and subtle features of grain spikes.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/EF16_026%2F0008446" target="_blank" >EF16_026/0008446: Signal integration and epigenetic reprograming for plant productivity</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Plant Phenomics
ISSN
2643-6515
e-ISSN
2643-6515
Volume of the periodical
6
Issue of the periodical within the volume
March
Country of publishing house
US - UNITED STATES
Number of pages
11
Pages from-to
1-11
UT code for WoS article
001231155500001
EID of the result in the Scopus database
2-s2.0-85191439858