Exploration of protein sequence embeddings for protein-ligand binding site detection

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F22%3A10455486" target="_blank" >RIV/00216208:11320/22:10455486 - isvavai.cz</a>
Výsledek na webu
<a href="https://doi.org/10.1109/BIBM55620.2022.9995025" target="_blank" >https://doi.org/10.1109/BIBM55620.2022.9995025</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/BIBM55620.2022.9995025" target="_blank" >10.1109/BIBM55620.2022.9995025</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Exploration of protein sequence embeddings for protein-ligand binding site detection
Popis výsledku v původním jazyce
Detection of protein-ligand binding sites is essential not only for protein function investigation but also in fields such as drug discovery or bioengineering. In this paper, we show that the recently-developed pre-trained language models can be used for protein-ligand binding site prediction. Specifically, we present a neural network architecture where inputs correspond to amino acids embeddings obtained from a protein language model. We show that increasing complexity of the language model improves the predictive performance of the method, eventually leading to results comparable to or surpassing state-of-the-art approaches. Unlike the existing methods, the presented approach does not require time-consuming computation of evolutionary information, resulting in faster running times.
Název v anglickém jazyce
Exploration of protein sequence embeddings for protein-ligand binding site detection
Popis výsledku anglicky
Detection of protein-ligand binding sites is essential not only for protein function investigation but also in fields such as drug discovery or bioengineering. In this paper, we show that the recently-developed pre-trained language models can be used for protein-ligand binding site prediction. Specifically, we present a neural network architecture where inputs correspond to amino acids embeddings obtained from a protein language model. We show that increasing complexity of the language model improves the predictive performance of the method, eventually leading to results comparable to or surpassing state-of-the-art approaches. Unlike the existing methods, the presented approach does not require time-consuming computation of evolutionary information, resulting in faster running times.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
<a href="/cs/project/GA22-21696S" target="_blank" >GA22-21696S: Hluboké vizuální reprezentace nestrukturovaných dat</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2022
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
ISBN
978-1-66546-819-0
ISSN
—
e-ISSN
—
Počet stran výsledku
6
Strana od-do
3356-3361
Název nakladatele
IEEE
Místo vydání
USA
Místo konání akce
Las Vegas, NV, USA
Datum konání akce
6. 12. 2022
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

Cryptic binding site prediction with protein language models CryptoBench: cryptic protein-ligand binding sites dataset and benchmark Improving protein-ligand binding site prediction accuracy by classification of inner pocket points using local features

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Exploration of protein sequence embeddings for protein-ligand binding site detection

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)