Multiple Conformer Descriptors for QSAR Modeling
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989592%3A15110%2F21%3A73610368" target="_blank" >RIV/61989592:15110/21:73610368 - isvavai.cz</a>
Result on the web
<a href="https://onlinelibrary.wiley.com/doi/10.1002/minf.202060030" target="_blank" >https://onlinelibrary.wiley.com/doi/10.1002/minf.202060030</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1002/minf.202060030" target="_blank" >10.1002/minf.202060030</a>
Alternative languages
Result language
angličtina
Original language name
Multiple Conformer Descriptors for QSAR Modeling
Original language description
The most widely used QSAR approaches are mainly based on 2D molecular representation which ignores stereoconfiguration and conformational flexibility of compounds. 3D QSAR uses a single conformer of each compound which is difficult to choose reasonably. 4D QSAR uses multiple conformers to overcome the issues of 2D and 3D methods. However, many of existing 4D QSAR models suffer from the necessity to pre-align conformers, while alignment-independent approaches often ignore stereoconfiguration of compounds. In this study we propose a QSAR modeling approach based on transforming chirality-aware 3D pharmacophore descriptors of individual conformers into a set of latent variables representing the whole conformer set of a molecule. This is achieved by clustering together all conformers of all training set compounds. The final representation of a compound is a bit string encoding cluster membership of its conformers. In our study we used Random Forest, but this representation can be used in combination with any machine learning method. We compared this approach with conventional 2D and 3D approaches using multiple data sets and investigated the sensitivity of the approach proposed to tuning parameters: number of conformers and clusters.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10608 - Biochemistry and molecular biology
Result continuities
Project
<a href="/en/project/LTARF18013" target="_blank" >LTARF18013: Improve the output of primary screening of biologically active compounds using computational models</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Molecular Informatics
ISSN
1868-1743
e-ISSN
—
Volume of the periodical
40
Issue of the periodical within the volume
11
Country of publishing house
DE - GERMANY
Number of pages
11
Pages from-to
"nestránkováno"
UT code for WoS article
000680535600001
EID of the result in the Scopus database
2-s2.0-85112051056