Disorder Consensus-based Predictor (disCoP)

disCoP webserver

This consensus-based method is designed for in-silico prediction of per-residue protein disorder propensities. It combines four rationally selected input predictors: DISOclust, DISOPRED2, MD, and SPINE-D, using custom-designed features that aggregate their predictions and binomial deviance-based regression model.

References

Upon the usage the users are requested to use the following citations:

Fan X, Kurgan LA, 2014. Accurate prediction of disorder in protein chains with a comprehensive and empirically designed consensus. Journal of Biomolecular Structure and Dynamics, 32(3): 448-464.

Materials

Supplement to the article is availabe at link (disCoP-Supplement.pdf).

TRAINING dataset - available for download at link (TRAINING.txt). Format of this dataset is as follow:

>Protein_ID fold_x(x:1-3)
Protein sequence
True annotation (1:Disorder, 0:Order)

TEST dataset - available for download at link (TEST.txt). Format of this dataset is as follow:

>Protein_ID
Protein sequence
True annotation (1:Disorder, 0:Order)

TEST_FUNCTION dataset - available for download at link (TEST_FUNCTION.txt). Format of this dataset is as follow:

>Protein_ID
Protein sequence
True annotation (1:Protein-protein binding disorder, 0:Order or other functional disorder)
True annotation (1:Substrate or ligand binding disorder, 0:Order or other functional disorder)
True annotation (1:Protein-DNA binding disorder, 0:Order or other functional disorder)
True annotation (1:Flexible linkers or spacers disorder, 0:Order or other functional disorder)
True annotation (1:Phosphorylation disorder, 0:Order or other functional disorder)
True annotation (1:Autoregulatory disorder, 0:Order or other functional disorder)

Help

disCoP accepts either single or multiple protein sequences. The input is limited to 5 protein sequences at the time. The user should submit the protein sequence(s) in FASTA format.

The format of the input file is as follows (example):

>protein name (The server will trim protein names to first 12 characters)
protein sequence (one letter amino acid code only)

Acknowledgments

We acknowledge with thanks the following software used as a part of this server:

Disorder Consensus-based Predictor (disCoP)

disCoP webserver

1. Upload a file with protein sequences, or paste them into text area

2. Provide your e-mail address (required)

3. Predict:

References

Materials

Help

Acknowledgments