PROSO tries to answer the following question:

"Which of my cloned proteins have the best/worst chances to be soluble upon heterologous expression?"

The prediction is based on a classifier exploiting subtle differences between soluble proteins from TargetDB and PDB and notoriously insoluble proteins from TargetDB and literature mining. For more details please read the reference or brief

method description. You can also download supplementary materials.

The classifier will correctly evaluate only proteins without trans-membrane segment.

The input protein sequences are categorized into two classes: YES - soluble; NO - insoluble. Additionally the probability of a class (from 0.5 to 1.0) is provided. The probability threshold value is set by default to 0.5. By increasing it one can expect higher classification precision (selectivity) and lower recall (sensitivity). If the result cannot be calculated, a comment is written.

If you use this server, please cite:
Smialowski P, Martin-Galiano AJ, Mikolajka A, Girschick T, Holak TA and Frishman D. Protein solubility: sequence based prediction and experimental verification. Bioinformatics. 2006 Dec 6; [Epub ahead of print]

Submit your sequence:

Input sequences
Please provide upto 5 unambiguous protein sequences (only 20 amino acids symbols are allowed) of length between 10-2000 amino acids in multiple FASTA format

Sample of input sequence