| Veröffentlichte Version Download ( PDF | 586kB) | Lizenz: Creative Commons Namensnennung 3.0 de |
CLIPS-1D: Analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure
Janda, Jan-Oliver, Busch, Markus, Kück, Fabian, Porfenenko, Mikhail und Merkl, Rainer
(2012)
CLIPS-1D: Analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.
BMC Bioinformatics 13, S. 55.
Veröffentlichungsdatum dieses Volltextes: 12 Apr 2012 11:08
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.23790
Zusammenfassung
Background: One aim of the in silico characterization of proteins is to identify all residue-positions, which are crucial for function or structure. Several sequence-based algorithms exist, which predict functionally important sites. However, with respect to sequence information, many functionally and structurally important sites are hard to distinguish and consequently a large number of ...
Background: One aim of the in silico characterization of proteins is to identify all residue-positions, which are crucial for function or structure. Several sequence-based algorithms exist, which predict functionally important sites. However, with respect to sequence information, many functionally and structurally important sites are hard to distinguish and consequently a large number of incorrectly predicted functional sites have to be expected. This is why we were interested to design a new classifier that differentiates between functionally and structurally important sites and to assess its performance on representative datasets. Results: We have implemented CLIPS-1D, which predicts a role in catalysis, ligand-binding, or protein structure for residue-positions in a mutually exclusive manner. By analyzing a multiple sequence alignment, the algorithm scores conservation as well as abundance of residues at individual sites and their local neighborhood and categorizes by means of a multiclass support vector machine. A cross-validation confirmed that residue-positions involved in catalysis were identified with state-of-the-art quality; the mean MCC-value was 0.34. For structurally important sites, prediction quality was considerably higher (mean MCC = 0.67). For ligand-binding sites, prediction quality was lower (mean MCC = 0.12), because binding sites and structurally important residue-positions share conservation and abundance values, which makes their separation difficult. We show that classification success varies for residues in a class-specific manner. This is why our algorithm computes residue-specific p-values, which allow for the statistical assessment of each individual prediction. CLIPS-1D is available as a Web service at http://www-bioinf.uni-regensburg.de/. Conclusions: CLIPS-1D is a classifier, whose prediction quality has been determined separately for catalytic sites, ligand-binding sites, and structurally important sites. It generates hypotheses about residue-positions important for a set of homologous proteins and focuses on conservation and abundance signals. Thus, the algorithm can be applied in cases where function cannot be transferred from well-characterized proteins by means of sequence comparison.
Alternative Links zum Volltext
Beteiligte Einrichtungen
Details
| Dokumentenart | Artikel | ||||
| Titel eines Journals oder einer Zeitschrift | BMC Bioinformatics | ||||
| Verlag: | BIOMED CENTRAL LTD | ||||
|---|---|---|---|---|---|
| Ort der Veröffentlichung: | LONDON | ||||
| Band: | 13 | ||||
| Seitenbereich: | S. 55 | ||||
| Datum | 5 April 2012 | ||||
| Institutionen | Biologie und Vorklinische Medizin > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Rainer Merkl | ||||
| Identifikationsnummer |
| ||||
| Stichwörter / Keywords | INDOLE-3-GLYCEROL PHOSPHATE SYNTHASE; FUNCTIONALLY IMPORTANT RESIDUES; EXCHANGE MASS-SPECTROMETRY; STRUCTURE PREDICTION; SULFOLOBUS-SOLFATARICUS; CONSERVED RESIDUES; NUCLEIC-ACIDS; 3D STRUCTURE; SERVER; SITES; | ||||
| Dewey-Dezimal-Klassifikation | 500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie 500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie | ||||
| Status | Veröffentlicht | ||||
| Begutachtet | Ja, diese Version wurde begutachtet | ||||
| An der Universität Regensburg entstanden | Zum Teil | ||||
| URN der UB Regensburg | urn:nbn:de:bvb:355-epub-237902 | ||||
| Dokumenten-ID | 23790 |
Downloadstatistik
Downloadstatistik