Direkt zum Inhalt

Kandlinger, Florian ; Plach, Maximilian G. ; Merkl, Rainer

AGeNNT: annotation of enzyme families by means of refined neighborhood networks

Kandlinger, Florian, Plach, Maximilian G. und Merkl, Rainer (2017) AGeNNT: annotation of enzyme families by means of refined neighborhood networks. BMC Bioinformatics 18 (1), S. 1-13.

Veröffentlichungsdatum dieses Volltextes: 30 Jan 2018 09:01
Artikel
DOI zum Zitieren dieses Dokuments: 10.5283/epub.36657


Zusammenfassung

Background: Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an SSN provides strong clues to the specific function of enzymes constituting the different clusters. The ...

Background: Large enzyme families may contain functionally diverse members that give rise to clusters in a sequence similarity network (SSN). In prokaryotes, the genome neighborhood of a gene-product is indicative of its function and thus, a genome neighborhood network (GNN) deduced for an SSN provides strong clues to the specific function of enzymes constituting the different clusters. The Enzyme Function Initiative (http://enzymefunction.org/) offers services that compute SSNs and GNNs. Results: We have implemented AGeNNT that utilizes these services, albeit with datasets purged with respect to unspecific protein functions and overrepresented species. AGeNNT generates refined GNNs (rGNNs) that consist of cluster-nodes representing the sequences under study and Pfam-nodes representing enzyme functions encoded in the respective neighborhoods. For cluster-nodes, AGeNNT summarizes the phylogenetic relationships of the contributing species and a statistic indicates how unique nodes and GNs are within this rGNN. Pfam-nodes are annotated with additional features like GO terms describing protein function. For edges, the coverage is given, which is the relative number of neighborhoods containing the considered enzyme function (Pfam-node). AGeNNT is available at https://github.com/kandlinf/agennt. Conclusions: An rGNN is easier to interpret than a conventional GNN, which commonly contains proteins without enzymatic function and overly specific neighborhoods due to phylogenetic bias. The implemented filter routines and the statistic allow the user to identify those neighborhoods that are most indicative of a specific metabolic capacity. Thus, AGeNNT facilitates to distinguish and annotate functionally different members of enzyme families.



Beteiligte Einrichtungen


Details

DokumentenartArtikel
Titel eines Journals oder einer ZeitschriftBMC Bioinformatics
Verlag:BIOMED CENTRAL LTD
Ort der Veröffentlichung:LONDON
Band:18
Nummer des Zeitschriftenheftes oder des Kapitels:1
Seitenbereich:S. 1-13
Datum25 Mai 2017
InstitutionenBiologie und Vorklinische Medizin > Institut für Biophysik und physikalische Biochemie > Prof. Dr. Rainer Merkl
Identifikationsnummer
WertTyp
10.1186/s12859-017-1689-6DOI
Stichwörter / KeywordsSEQUENCE SIMILARITY NETWORKS; PROKARYOTIC GENOMES; METABOLIC PATHWAYS; ESCHERICHIA-COLI; WEB TOOL; BIOSYNTHESIS; DATABASE; CLASSIFICATION; VISUALIZATION; SPECIFICITY; Sequence similarity network; SSN; Genome neighborhood network; GNN; Genome content; Enzyme function; Homology-free annotation
Dewey-Dezimal-Klassifikation500 Naturwissenschaften und Mathematik > 530 Physik
StatusVeröffentlicht
BegutachtetJa, diese Version wurde begutachtet
An der Universität Regensburg entstandenJa
URN der UB Regensburgurn:nbn:de:bvb:355-epub-366573
Dokumenten-ID36657

Bibliographische Daten exportieren

Nur für Besitzer und Autoren: Kontrollseite des Eintrags

nach oben