SoluProt: prediction of soluble protein expression in Escherichia coli
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14310%2F21%3A00119188" target="_blank" >RIV/00216224:14310/21:00119188 - isvavai.cz</a>
Alternative codes found
RIV/00159816:_____/21:00075155 RIV/00216305:26230/21:PU138927
Result on the web
<a href="https://academic.oup.com/bioinformatics/article/37/1/23/6070085" target="_blank" >https://academic.oup.com/bioinformatics/article/37/1/23/6070085</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1093/bioinformatics/btaa1102" target="_blank" >10.1093/bioinformatics/btaa1102</a>
Alternative languages
Result language
angličtina
Original language name
SoluProt: prediction of soluble protein expression in Escherichia coli
Original language description
Motivation: Poor protein solubility hinders the production of many therapeutic and industrially useful proteins. Experimental efforts to increase solubility are plagued by low success rates and often reduce biological activity. Computational prediction of protein expressibility and solubility in Escherichia coli using only sequence information could reduce the cost of experimental studies by enabling prioritization of highly soluble proteins. Results: A new tool for sequence-based prediction of soluble protein expression in E.coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProt's accuracy of 58.5% and AUC of 0.62 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10602 - Biology (theoretical, mathematical, thermal, cryobiology, biological rhythm), Evolutionary biology
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Bioinformatics
ISSN
1367-4803
e-ISSN
1460-2059
Volume of the periodical
37
Issue of the periodical within the volume
1
Country of publishing house
GB - UNITED KINGDOM
Number of pages
6
Pages from-to
23-28
UT code for WoS article
000649437800004
EID of the result in the Scopus database
—