High-throughput Selection of Human de novo-emerged sORFs with High Folding Potential
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61388963%3A_____%2F24%3A00585603" target="_blank" >RIV/61388963:_____/24:00585603 - isvavai.cz</a>
Nalezeny alternativní kódy
RIV/00216208:11310/24:10483116
Výsledek na webu
<a href="https://doi.org/10.1093/gbe/evae069" target="_blank" >https://doi.org/10.1093/gbe/evae069</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1093/gbe/evae069" target="_blank" >10.1093/gbe/evae069</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
High-throughput Selection of Human de novo-emerged sORFs with High Folding Potential
Popis výsledku v původním jazyce
De novo genes emerge from previously noncoding stretches of the genome. Their encoded de novo proteins are generally expected to be similar to random sequences and, accordingly, with no stable tertiary fold and high predicted disorder. However, structural properties of de novo proteins and whether they differ during the stages of emergence and fixation have not been studied in depth and rely heavily on predictions. Here we generated a library of short human putative de novo proteins of varying lengths and ages and sorted the candidates according to their structural compactness and disorder propensity. Using Forster resonance energy transfer combined with Fluorescence-activated cell sorting, we were able to screen the library for most compact protein structures, as well as most elongated and flexible structures. We find that compact de novo proteins are on average slightly shorter and contain lower predicted disorder than less compact ones. The predicted structures for most and least compact de novo proteins correspond to expectations in that they contain more secondary structure content or higher disorder content, respectively. Our experiments indicate that older de novo proteins have higher compactness and structural propensity compared with young ones. We discuss possible evolutionary scenarios and their implications underlying the age-dependencies of compactness and structural content of putative de novo proteins.
Název v anglickém jazyce
High-throughput Selection of Human de novo-emerged sORFs with High Folding Potential
Popis výsledku anglicky
De novo genes emerge from previously noncoding stretches of the genome. Their encoded de novo proteins are generally expected to be similar to random sequences and, accordingly, with no stable tertiary fold and high predicted disorder. However, structural properties of de novo proteins and whether they differ during the stages of emergence and fixation have not been studied in depth and rely heavily on predictions. Here we generated a library of short human putative de novo proteins of varying lengths and ages and sorted the candidates according to their structural compactness and disorder propensity. Using Forster resonance energy transfer combined with Fluorescence-activated cell sorting, we were able to screen the library for most compact protein structures, as well as most elongated and flexible structures. We find that compact de novo proteins are on average slightly shorter and contain lower predicted disorder than less compact ones. The predicted structures for most and least compact de novo proteins correspond to expectations in that they contain more secondary structure content or higher disorder content, respectively. Our experiments indicate that older de novo proteins have higher compactness and structural propensity compared with young ones. We discuss possible evolutionary scenarios and their implications underlying the age-dependencies of compactness and structural content of putative de novo proteins.
Klasifikace
Druh
J<sub>imp</sub> - Článek v periodiku v databázi Web of Science
CEP obor
—
OECD FORD obor
10608 - Biochemistry and molecular biology
Návaznosti výsledku
Projekt
<a href="/cs/project/GB14-36098G" target="_blank" >GB14-36098G: Centrum pro tropickou biologii</a><br>
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2024
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Genome Biology and Evolution
ISSN
1759-6653
e-ISSN
1759-6653
Svazek periodika
16
Číslo periodika v rámci svazku
4
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
17
Strana od-do
evae069
Kód UT WoS článku
001204479300001
EID výsledku v databázi Scopus
2-s2.0-85191082529