Direkt zum Inhalt

Lottaz, Claudio ; Iseli, Christian ; Jongeneel, C. Victor ; Bucher, Philipp

Modeling sequencing errors by combining Hidden Markov models

Lottaz, Claudio, Iseli, Christian, Jongeneel, C. Victor and Bucher, Philipp (2003) Modeling sequencing errors by combining Hidden Markov models. Bioinformatics 19 (Suppl2), ii103-ii112.

Date of publication of this fulltext: 02 Dec 2015 10:12
Article
DOI to cite this document: 10.5283/epub.32949


Abstract

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone ...

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.



Involved Institutions


Details

Item typeArticle
Journal or Publication TitleBioinformatics
Publisher:Oxford Univ. Press
Volume:19
Number of Issue or Book Chapter:Suppl2
Page Range:ii103-ii112
Date9 June 2003
InstitutionsMedicine > Institut für Funktionelle Genomik > Lehrstuhl für Statistische Bioinformatik (Prof. Spang)
Informatics and Data Science > Department Computational Life Science > Lehrstuhl für Statistische Bioinformatik (Prof. Spang)
Identification Number
ValueType
10.1093/bioinformatics/btg1067DOI
Keywordscoding region prediction, sequencing errors, expressed sequence tags, hidden Markov models
Dewey Decimal Classification000 Computer science, information & general works > 004 Computer science
StatusPublished
RefereedYes, this version has been refereed
Created at the University of RegensburgNo
URN of the UB Regensburgurn:nbn:de:bvb:355-epub-329497
Item ID32949

Export bibliographical data

Owner only: item control page

nach oben