TY - JOUR
AU - Walsh, I.
AU - Giollo, M.
AU - Di Domenico, T.
AU - Ferrari, C.
AU - Zimmermann, O.
AU - Tosatto, S. C. E.
TI - Comprehensive large-scale assessment of intrinsic protein disorder
JO - Bioinformatics
VL - 31
IS - 2
SN - 0266-7061
CY - Oxford
PB - Oxford Univ. Press
M1 - FZJ-2015-00109
SP - 201-208
PY - 2015
AB - Motivation: Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures.Results: MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed.
LB - PUB:(DE-HGF)16
UR - <Go to ISI:>//WOS:000347832300008
C6 - pmid:25246432
DO - DOI:10.1093/bioinformatics/btu625
UR - https://juser.fz-juelich.de/record/186000
ER -