Indexing a sequence for mapping reads with a single mismatch

Maxime Crochemore; Alessio Langiu; M. Sohel Rahman

doi:10.1098/rsta.2013.0167

Article Dans Une Revue Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences Année : 2014

Indexing a sequence for mapping reads with a single mismatch

(1, 2) , (3, 1) , (1)

1
2
3

Maxime Crochemore

Fonction : Auteur
PersonId : 5397
IdHAL : maximecrochemore
ORCID : 0000-0003-1087-1419
IdRef : 034037357

Department of Computer Science [London]

Laboratoire d'Informatique Gaspard-Monge

Alessio Langiu

Fonction : Auteur
PersonId : 767062
ORCID : 0000-0002-5706-7500
IdRef : 168242427

Dipartimento di Energia, ingegneria dell'Informazione e modelli Matematici [Palermo]

Department of Computer Science [London]

M. Sohel Rahman

Fonction : Auteur

Department of Computer Science [London]

Résumé

Mapping reads against a genome sequence is an interesting and useful problem in Computational Molecular Biology and Bioinformatics. In this paper, we focus on the problem of indexing a sequence for mapping reads with a single mismatch. We first focus on a simpler problem where the length of the pattern is given beforehand during the data structure construction. This version of the problem is interesting in its own right in the context of the Next Generation Sequencing (NGS). In the sequel we show how to solve the more general problem. In both cases, our algorithm can construct an efficient data structure in O(n log^(1+ε)n) time and space and can answer subsequent queries in O(m log log n + K) time. Here, n is the length of the sequence, m is the length of the read, 0 < ε < 1 and K is the optimal output size.

Mots clés

algorithms genome sequence indexing mapping reads mismatch pattern matching

Domaines

Bio-informatique [q-bio.QM] Algorithme et structure de données [cs.DS]

Fichier principal

CrochemoreLangiuRahman2014.pdf (245.24 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Philippe Gambette : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01375933

Soumis le : lundi 3 octobre 2016-18:38:16

Dernière modification le : jeudi 4 avril 2024-03:28:34

Archivage à long terme le : vendredi 3 février 2017-14:01:23

Dates et versions

hal-01375933 , version 1 (03-10-2016)

Identifiants

HAL Id : hal-01375933 , version 1
DOI : 10.1098/rsta.2013.0167

Citer

Maxime Crochemore, Alessio Langiu, M. Sohel Rahman. Indexing a sequence for mapping reads with a single mismatch. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2014, 2 (20130167), pp.1-18. ⟨10.1098/rsta.2013.0167⟩. ⟨hal-01375933⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENPC CNRS PARISTECH LIGM LIGM_MOA ESIEE-PARIS UNIV-EIFFEL LIGM_ADA JSE2024

184 Consultations

157 Téléchargements

Indexing a sequence for mapping reads with a single mismatch

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager