Indexing Arbitrary-Length k-Mers in Sequencing Reads.

Título

Indexing Arbitrary-Length k-Mers in Sequencing Reads.

Autor

Tomasz Kowalski, Szymon Grabowski, Sebastian Deorowicz

Descripción

We propose a lightweight data structure for indexing and querying collections of NGS reads data in main memory. The data structure supports the interface proposed in the pioneering work by Philippe et al. for counting and locating k-mers in sequencing reads. Our solution, PgSA (pseudogenome suffix array), based on finding overlapping reads, is competitive to the existing algorithms in the space use, query times, or both. The main applications of our index include variant calling, error correction and analysis of reads from RNA-seq experiments.

Fecha

2015

Identificador

DOI: 10.1371/journal.pone.0133198

Fuente

PLoS ONE

Editor

Public Library of Science (PLoS)

Cobertura

Science, Medicine

Idioma

EN

Archivos

https://socictopen.socict.org/files/to_import/pdfs/article 1426.pdf

Colección

Citación

Tomasz Kowalski, Szymon Grabowski, Sebastian Deorowicz, “Indexing Arbitrary-Length k-Mers in Sequencing Reads.,” SOCICT Open, consulta 18 de abril de 2026, https://www.socictopen.socict.org/items/show/1383.

Formatos de Salida

Position: 9436 (26 views)