Minimal perfect hash functions in large scale bioinformatics Problem

Name: Minimal perfect hash functions in large scale bioinformatics Problem
Keywords: [INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS], [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]

Limasset, Antoine; Marchet, Camille; Peterlongo, Pierre; Bittner, Lucie

Found an issue? Give us feedback

INRIA2arrow_drop_down

INRIA2

Conference object . 2016

Data sources: INRIA2

HAL-Rennes 1

Conference object . 2016

Data sources: HAL-Rennes 1

HAL Sorbonne Université

Conference object . 2016

Data sources: HAL Sorbonne Université

INRIA a CCSD electronic archive server

Conference object . 2016

Data sources: INRIA a CCSD electronic archive server

Minimal perfect hash functions in large scale bioinformatics Problem

descriptionPublicationkeyboard_double_arrow_right Conference object 01 Jan 2016 France English

Authors: Limasset, Antoine; Marchet, Camille; Peterlongo, Pierre; Bittner, Lucie;

Minimal perfect hash functions in large scale bioinformatics Problem

- Summary
- Subjects
- Metrics

Abstract

. Genomic and metagenomic fields, generating huge sets ofshort genomic sequences, brought their own share of high performanceproblems. To extract relevant pieces of information from the huge datasets generated by current sequencing techniques, one must rely on extremelyscalable methods and solutions. Indexing billions of objects isa task considered too expensive while being a fundamental need in thisfield. In this paper we propose a straightforward indexing structure thatscales to billions of element and we propose two direct applications ingenomics and metagenomics. We show that our proposal solves probleminstances for which no other known solution scales-up. We believe thatmany tools and applications could benefit from either the fundamentaldata structure we provide or from the applications developed from thisstructure.

Country

France

Related Organizations

University of Rennes 1
France
French National Centre for Scientific Research
France
French Institute for Research in Computer Science and Automation
France
Inserm
France
Institut de Recherche en Informatique et Systèmes Aléatoires
France

View all View all

Keywords

[INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS], [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

INRIA