From next-generation resequencing reads to a high-quality variant data set

Pfeifer, S. P.

doi:10.1038/hdy.2016.102

Pfeifer, S. P.

2017

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Sequencing has revolutionized biology by permitting the analysis of genomic variation at an unprecedented resolution. High-throughput sequencing is fast and inexpensive, making it accessible for a wide range of research topics. However, the produced data contain subtle but complex types of errors, biases and uncertainties that impose several statistical and computational challenges to the reliable detection of variants. To tap the full potential of high-throughput sequencing, a thorough understanding of the data produced as well as the available methodologies is required. Here, I review several commonly used methods for generating and processing next-generation resequencing data, discuss the influence of errors and biases together with their resulting implications for downstream analyses and provide general guidelines and recommendations for producing high-quality single-nucleotide polymorphism data sets from raw reads by highlighting several sophisticated reference-based methods representing the current state of the art.

Details

Title From next-generation resequencing reads to a high-quality variant data set

Author(s) Pfeifer, S. P.

Published in Heredity

Pagination 14

Volume 118

Issue 2

Pages 111-124

Date 2017

Publisher London, Nature Publishing Group

ISSN 0018-067X

DOI https://doi.org/10.1038/hdy.2016.102

Other identifier(s) View record in Web of Science

Laboratories UPJENSEN

Record Appears in Scientific production and competences > SV - School of Life Sciences > SV Archives > UPJENSEN - Prof. Jensen Group
Peer-reviewed publications
Work produced at EPFL
Published
Reviews

Record creation date 2017-05-01

Abstract

Details

Actions