Modeling sequencing errors by combining Hidden Markov models

Lottaz, C.; Iseli, C.; Jongeneel, C. V.; Bucher, P.

doi:10.1093/bioinformatics/btg1067

Lottaz, C.; Iseli, C.; Jongeneel, C. V.; Bucher, P.

2003

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.

Details

Title Modeling sequencing errors by combining Hidden Markov models

Author(s) Lottaz, C. ; Iseli, C. ; Jongeneel, C. V. ; Bucher, P.

Published in Bioinformatics

Volume 19 Suppl 2

Pages ii103-ii112

Date 2003

Note Swiss Institute of Bioinformatics, Switzerland. Claudio.Lottaz@molgen.mpg.de

DOI https://doi.org/10.1093/bioinformatics/btg1067

Laboratories GR-BUCHER

Record Appears in Scientific production and competences > SV - School of Life Sciences > ISREC - Swiss Institute for Experimental Cancer Research > GR-BUCHER - Bucher Group
Peer-reviewed publications
Work outside EPFL
Journal Articles
Published

Record creation date 2007-12-17

Abstract

Details

Actions