SoluProt: Prediction of Protein Solubility
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F18%3APU130777" target="_blank" >RIV/00216305:26230/18:PU130777 - isvavai.cz</a>
Result on the web
<a href="http://www.fit.vutbr.cz/research/pubs/all.php?id=11808" target="_blank" >http://www.fit.vutbr.cz/research/pubs/all.php?id=11808</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
SoluProt: Prediction of Protein Solubility
Original language description
Protein solubility poses a major bottleneck in production of many therapeutic and industrially attractive proteins. Experimental solubilization attempts are plagued by relatively low success rates and often lead to the loss of biological activity. Therefore, any advance in computational prediction of protein solubility may reduce the cost of experimental studies significantly. Here, we propose a novel software tool SoluProt for prediction of solubility from protein sequence based on machine learning and TargetTrack database. SoluProt achieved the best accuracy 58.2% and AUC 0.61 of all available tools at an independent balanced test set derived from NESG database. While the absolute prediction performance is rather low, SoluProt can still help to reduce costs of experimental studies significantly by efficient prioritization of protein sequences. The main SoluProt contribution lies in improved preprocessing of noisy training data and sensible selection of sequence features included in the prediction model.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/LQ1602" target="_blank" >LQ1602: IT4Innovations excellence in science</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
DAZ & WIKT 2018 Proceedings
ISBN
978-80-214-5679-2
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
261-265
Publisher name
Brno University of Technology
Place of publication
Brno
Event location
Brno
Event date
Oct 11, 2018
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—