Disorder Consensus-based Predictor (disCoP)

disCoP webserver

This consensus-based method is designed for in-silico prediction of per-residue protein disorder propensities. It combines four rationally selected input predictors: DISOclust, DISOPRED2, MD, and SPINE-D, using custom-designed features that aggregate their predictions and binomial deviance-based regression model.

Please follow the three steps below to make predictions:

1. Upload a file with protein sequences, or paste them into text area

Server accepts up to 5 (FASTA formated) protein sequences. Either upload a file or enter each protein in a new line in the following text field (see Help for details):

2. Provide your e-mail address (required)

Please provide your e-mail address to be notified when results are ready.

3. Predict:

Click button to launch prediction.

Materials

    Datasets:
  • TRAINING dataset - available for download at link (TRAINING.txt). Format of this dataset is as follow:
  1. >Protein_ID fold_x(x:1-3)
  2. Protein sequence
  3. True annotation (1:Disorder, 0:Order)
  • TEST dataset - available for download at link (TEST.txt). Format of this dataset is as follow:
  1. >Protein_ID
  2. Protein sequence
  3. True annotation (1:Disorder, 0:Order)
  • TEST_FUNCTION dataset - available for download at link (TEST_FUNCTION.txt). Format of this dataset is as follow:
  1. >Protein_ID
  2. Protein sequence
  3. True annotation (1:Protein-protein binding disorder, 0:Order or other functional disorder)
  4. True annotation (1:Substrate or ligand binding disorder, 0:Order or other functional disorder)
  5. True annotation (1:Protein-DNA binding disorder, 0:Order or other functional disorder)
  6. True annotation (1:Flexible linkers or spacers disorder, 0:Order or other functional disorder)
  7. True annotation (1:Phosphorylation disorder, 0:Order or other functional disorder)
  8. True annotation (1:Autoregulatory disorder, 0:Order or other functional disorder)

Help

disCoP accepts either single or multiple protein sequences. The input is limited to 5 protein sequences at the time. The user should submit the protein sequence(s) in FASTA format.

The format of the input file is as follows (example):

  1. >protein name (The server will trim protein names to first 12 characters)
  2. protein sequence (one letter amino acid code only)

Acknowledgments

We acknowledge with thanks the following software used as a part of this server: