Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. A fast and exact algorithm for the exemplar breakpoint distance
 
conference paper

A fast and exact algorithm for the exemplar breakpoint distance

Shao, Mingfu  
•
Moret, Bernard
2015
Proc. 19th Int'l Conf. on Research in Comput. Molecular Bio. RECOMB'15
19th Int't Conf. on Research in Comput. Molecular Bio. RECOMB'15

A fundamental problem in comparative genomics is to compute the distance between two genomes. For two genomes without duplicate genes, we can easily compute a variety of distance measures in linear time, but the problem is NP-hard under most models when genomes contain duplicate genes. Sankoff proposed the use of exemplars to tackle the problem of duplicates genes and gene families: each gene family is represented by a single gene (the exemplar for that family), chosen so as to optimize some metric. Unfortunately, choosing exemplars is itself an NP-hard problem. In this paper, we propose a very fast and exact algorithm to compute the exemplar breakpoint distance, based on new insights in the underlying structure of genome rearrangements and exemplars. We evaluate the performance of our algorithm on simulation data and compare its performance to the best effort to date (a divide-and-conquer approach), showing that our algorithm runs much faster and scales much better. We also devise a new algorithm for the generalized breakpoint distance problem, which can then be applied to assign orthologs. We compare our algorithm with the state-of-the-art method MSOAR by assigning orthologs among five well annotated mammalian genomes, showing that our algorithm runs much faster and is slightly more accurate than MSOAR.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-319-16706-0_31
Web of Science ID

WOS:000361983900031

Author(s)
Shao, Mingfu  
Moret, Bernard
Date Issued

2015

Publisher

Springer

Publisher place

Berlin

Published in
Proc. 19th Int'l Conf. on Research in Comput. Molecular Bio. RECOMB'15
ISBN of the book

978-3-319-16706-0

978-3-319-16705-3

Total of pages

14

Series title/Series vol.

Lecture Notes in Computer Science; 9029

Start page

309

End page

322

Subjects

Exemplar

•

Breakpoint distance

•

ILP

•

Orthology assignment

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LCBB  
Event name
19th Int't Conf. on Research in Comput. Molecular Bio. RECOMB'15
Available on Infoscience
June 9, 2015
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/114958
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés