The assignment success for 22 horse breeds registered in the Czech Republic: The machine learning perspective
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F62156489%3A43210%2F21%3A43919188" target="_blank" >RIV/62156489:43210/21:43919188 - isvavai.cz</a>
Alternative codes found
RIV/00216305:26220/21:PU138702
Result on the web
<a href="https://doi.org/10.17221/120/2020-CJAS" target="_blank" >https://doi.org/10.17221/120/2020-CJAS</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.17221/120/2020-CJAS" target="_blank" >10.17221/120/2020-CJAS</a>
Alternative languages
Result language
angličtina
Original language name
The assignment success for 22 horse breeds registered in the Czech Republic: The machine learning perspective
Original language description
The paper demonstrates the dependability of assignment testing in the identification of an appropriate breed to monitor comprehensive genetic information from molecular markers to analyse the collection of real population data covering 22 horse breeds registered in the Czech Republic, including native breeds and genetic resources. If 17 microsatellites are used, the mean number of alleles per locus corresponds to 10.4. The count of alleles at the individual loci ranges between five (HTG07) and 17 (ASB17). The loci ASB02, ASB23, HMS03, HTG10, and VHL20 exhibit the highest gene diversity and observed heterozygosity (both above 80%), with the mean value of 0.77 and 0.73, respectively. The moderate total inbreeding coefficient (5.2%) is estimated across all the loci and breeds. The levels of apparent breed differentiation span from zero between the Czech Warmblood and Slovak Warmblood to 0.15 between the Shetland Pony and Standardbred. The phylogenetic breed relationships are revealed via the NeighbourNet dendrogram constructed from Reynolds' genetic distances, which clearly separate the Coldblood draught, Hot/Warmblood, and Pony horses. Our results reveal that the Bayesian approach (the Rannala and Mountain technique) provides the most intensive prediction power (83.6%) out of the GENECLASS tools and that the Bayes Net algorithm exhibits the best efficiency (78.4%) from the WEKA machine learning workbench options, considering the use of the five-fold cross validation technique. The algorithms could be trained on large real reference data sets, and thus there appears another viable perspective for machine learning in horse ancestry testing. In this context, it is also important to stress the fact that innovated computational tools will potentially lead towards structuring a novel web server to allow the identification of horse breeds.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10603 - Genetics and heredity (medical genetics to be 3)
Result continuities
Project
<a href="/en/project/QH92277" target="_blank" >QH92277: Genetic diversity and its conservation in selected horse populations in the Czech Republic</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Czech Journal of Animal Science
ISSN
1212-1819
e-ISSN
—
Volume of the periodical
66
Issue of the periodical within the volume
1
Country of publishing house
CZ - CZECH REPUBLIC
Number of pages
12
Pages from-to
1-12
UT code for WoS article
000613949300001
EID of the result in the Scopus database
2-s2.0-85102807507