The Dark Matter of Large Cereal Genomes: Long Tandem Repeats
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61389030%3A_____%2F19%3A00506096" target="_blank" >RIV/61389030:_____/19:00506096 - isvavai.cz</a>
Nalezeny alternativní kódy
RIV/60077344:_____/19:00506096
Výsledek na webu
<a href="http://doi.org/10.3390/ijms20102483" target="_blank" >http://doi.org/10.3390/ijms20102483</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.3390/ijms20102483" target="_blank" >10.3390/ijms20102483</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
The Dark Matter of Large Cereal Genomes: Long Tandem Repeats
Popis výsledku v původním jazyce
Reference genomes of important cereals, including barley, emmer wheat and bread wheat, were released recently. Their comparison with genome size estimates obtained by flow cytometry indicated that the assemblies represent not more than 88-98% of the complete genome. This work is aimed at identifying the missing parts in two cereal genomes and proposing techniques to make the assemblies more complete. We focused on tandemly organised repetitive sequences, known to be underrepresented in genome assemblies generated from short-read sequence data. Our study found arrays of three tandem repeats with unit sizes of 1242 to 2726 bp present in the bread wheat reference genome generated from short reads. However, this and another wheat genome assembly employing long PacBio reads failed in integrating correctly the 2726-bp repeat in the pseudomolecule context. This suggests that tandem repeats of this size, frequently incorporated in unassigned scaffolds, may contribute to shrinking of pseudomolecules without reducing size of the entire assembly. We demonstrate how this missing information may be added to the pseudomolecules with the aid of nanopore sequencing of individual BAC clones and optical mapping. Using the latter technique, we identified and localised a 470-kb long array of 45S ribosomal DNA absent from the reference genome of barley.
Název v anglickém jazyce
The Dark Matter of Large Cereal Genomes: Long Tandem Repeats
Popis výsledku anglicky
Reference genomes of important cereals, including barley, emmer wheat and bread wheat, were released recently. Their comparison with genome size estimates obtained by flow cytometry indicated that the assemblies represent not more than 88-98% of the complete genome. This work is aimed at identifying the missing parts in two cereal genomes and proposing techniques to make the assemblies more complete. We focused on tandemly organised repetitive sequences, known to be underrepresented in genome assemblies generated from short-read sequence data. Our study found arrays of three tandem repeats with unit sizes of 1242 to 2726 bp present in the bread wheat reference genome generated from short reads. However, this and another wheat genome assembly employing long PacBio reads failed in integrating correctly the 2726-bp repeat in the pseudomolecule context. This suggests that tandem repeats of this size, frequently incorporated in unassigned scaffolds, may contribute to shrinking of pseudomolecules without reducing size of the entire assembly. We demonstrate how this missing information may be added to the pseudomolecules with the aid of nanopore sequencing of individual BAC clones and optical mapping. Using the latter technique, we identified and localised a 470-kb long array of 45S ribosomal DNA absent from the reference genome of barley.
Klasifikace
Druh
J<sub>imp</sub> - Článek v periodiku v databázi Web of Science
CEP obor
—
OECD FORD obor
10608 - Biochemistry and molecular biology
Návaznosti výsledku
Projekt
Výsledek vznikl pri realizaci vícero projektů. Více informací v záložce Projekty.
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Ostatní
Rok uplatnění
2019
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
International Journal of Molecular Sciences
ISSN
1422-0067
e-ISSN
—
Svazek periodika
20
Číslo periodika v rámci svazku
10
Stát vydavatele periodika
CH - Švýcarská konfederace
Počet stran výsledku
12
Strana od-do
2483
Kód UT WoS článku
000471001400114
EID výsledku v databázi Scopus
2-s2.0-85067311944