Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes
 
research article

An Exact Algorithm to Compute the Double-Cut-and-Join Distance for Genomes with Duplicate Genes

Shao, Mingfu  
•
Lin, Yu  
•
Moret, Bernard M. E.  
2015
Journal of Computational Biology

Computing the edit distance between two genomes is a basic problem in the study of genome evolution. The double-cut-and-join (DCJ) model has formed the basis for most algorithmic research on rearrangements over the last few years. The edit distance under the DCJ model can be computed in linear time for genomes without duplicate genes, while the problem becomes NP-hard in the presence of duplicate genes. In this article, we propose an integer linear programming (ILP) formulation to compute the DCJ distance between two genomes with duplicate genes. We also provide an efficient preprocessing approach to simplify the ILP formulation while preserving optimality. Comparison on simulated genomes demonstrates that our method outperforms MSOAR in computing the edit distance, especially when the genomes contain long duplicated segments. We also apply our method to assign orthologous gene pairs among human, mouse, and rat genomes, where once again our method outperforms MSOAR.

  • Details
  • Metrics
Type
research article
DOI
10.1089/cmb.2014.0096
Web of Science ID

WOS:000353778500009

Author(s)
Shao, Mingfu  
Lin, Yu  
Moret, Bernard M. E.  
Date Issued

2015

Publisher

Mary Ann Liebert

Published in
Journal of Computational Biology
Volume

22

Issue

5

Start page

425

End page

435

Subjects

orthology assignment

•

maximum cycle decomposition

•

adjacency graph

•

DCJ distance

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LCBB  
Available on Infoscience
May 29, 2015
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/114207
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés