fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization
Abstract Background Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-01-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-017-1995-z |
_version_ | 1818961631570296832 |
---|---|
author | Fanchi Meng Chen Wang Lukasz Kurgan |
author_facet | Fanchi Meng Chen Wang Lukasz Kurgan |
author_sort | Fanchi Meng |
collection | DOAJ |
description | Abstract Background Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. Results We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/fDETECT/ . Conclusions fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-of-the-art tools and is especially suitable for the analysis of large protein sets. |
first_indexed | 2024-12-20T12:16:30Z |
format | Article |
id | doaj.art-387c5955bc97425a8175bc8ee1c59edf |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-12-20T12:16:30Z |
publishDate | 2018-01-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-387c5955bc97425a8175bc8ee1c59edf2022-12-21T19:41:06ZengBMCBMC Bioinformatics1471-21052018-01-0118111110.1186/s12859-017-1995-zfDETECT webserver: fast predictor of propensity for protein production, purification, and crystallizationFanchi Meng0Chen Wang1Lukasz Kurgan2Department of Electrical and Computer Engineering, University of AlbertaDepartment of Computer Science, Virginia Commonwealth UniversityDepartment of Computer Science, Virginia Commonwealth UniversityAbstract Background Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. Results We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/fDETECT/ . Conclusions fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-of-the-art tools and is especially suitable for the analysis of large protein sets.http://link.springer.com/article/10.1186/s12859-017-1995-zX-ray crystallographyProtein productionProtein structure determinationTarget selectionStructural genomicsPrediction |
spellingShingle | Fanchi Meng Chen Wang Lukasz Kurgan fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization BMC Bioinformatics X-ray crystallography Protein production Protein structure determination Target selection Structural genomics Prediction |
title | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_full | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_fullStr | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_full_unstemmed | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_short | fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization |
title_sort | fdetect webserver fast predictor of propensity for protein production purification and crystallization |
topic | X-ray crystallography Protein production Protein structure determination Target selection Structural genomics Prediction |
url | http://link.springer.com/article/10.1186/s12859-017-1995-z |
work_keys_str_mv | AT fanchimeng fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization AT chenwang fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization AT lukaszkurgan fdetectwebserverfastpredictorofpropensityforproteinproductionpurificationandcrystallization |