Comprehensive large-scale assessment of intrinsic protein disorder


MOTIVATION Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures. RESULTS MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed. AVAILABILITY The datasets are available from Web site at URL: SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

DOI: 10.1093/bioinformatics/btu625

12 Figures and Tables

Citations per Year

57 Citations

Semantic Scholar estimates that this publication has 57 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Walsh2015ComprehensiveLA, title={Comprehensive large-scale assessment of intrinsic protein disorder}, author={Ian Walsh and Manuel Giollo and Tom{\'a}s Di Domenico and Carlo Ferrari and Olav Zimmermann and Silvio C. E. Tosatto}, journal={Bioinformatics}, year={2015}, volume={31 2}, pages={201-8} }