Comparing methods for constructing and representing human pangenome graphs

Name: Comparing methods for constructing and representing human pangenome graphs
Keywords: Genome, Pangenomics, QH301-705.5, Research, Sequence analysis, [INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS], Sequence Analysis, DNA, Genomics, QH426-470, Variation graphs

Andreace, Francesco; Lechat, Pierre; Dufresne, Yoann; Chikhi, Rayan

Found an issue? Give us feedback

downloadFull-Text

Genome Biologyarrow_drop_down

Genome Biology

Article

License: CC BY

Full-Text: https://link.springer.com/content/pdf/10.1186/s13059-023-03098-2.pdf

Data sources: Sygma

Genome Biology

Article . 2023 . Peer-reviewed

License: CC BY

Data sources: Crossref

Genome Biology

Article . 2023

Data sources: Europe PubMed Central

PubMed Central

Other literature type . 2023

License: http://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (http://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (http://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Data sources: PubMed Central

Genome Biology

Article . 2023

Data sources: DOAJ

HAL-Pasteur

Article . 2023

License: CC BY

Data sources: HAL-Pasteur

SPIRE - Sciences Po Institutional REpository

Article . 2023

License: CC BY

Data sources: SPIRE - Sciences Po Institutional REpository

HAL Descartes

Article . 2023

License: CC BY

Data sources: HAL Descartes

HAL Sorbonne Université

Article . 2023

License: CC BY

Data sources: HAL Sorbonne Université

Genome Biology

Article . 2023 . Peer-reviewed

Data sources: European Union Open Data Portal

Comparing methods for constructing and representing human pangenome graphs

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 30 Nov 2023 English Publisher:Springer Science and Business Media LLCJournal:Genome Biology, volume 24 (eissn: 1474-760X,

Copyright policy )Funded by:EC | PANGAIA, EC | ALPACA, ANR | PRAIRIE +2 projects

Authors: Andreace, Francesco; Lechat, Pierre; Dufresne, Yoann; Chikhi, Rayan;

doi: 10.1186/s13059-023-03098-2

pmid: 38037131

pmc: PMC10691155

Comparing methods for constructing and representing human pangenome graphs

- Summary
- Subjects
- Related research
  (4)
- Metrics

Abstract

Abstract Background As a single reference genome cannot possibly represent all the variation present across human individuals, pangenome graphs have been introduced to incorporate population diversity within a wide range of genomic analyses. Several data structures have been proposed for representing collections of genomes as pangenomes, in particular graphs. Results In this work, we collect all publicly available high-quality human haplotypes and construct the largest human pangenome graphs to date, incorporating 52 individuals in addition to two synthetic references (CHM13 and GRCh38). We build variation graphs and de Bruijn graphs of this collection using five of the state-of-the-art tools: , , , and . We examine differences in the way each of these tools represents variations between input sequences, both in terms of overall graph structure and representation of specific genetic loci. Conclusion This work sheds light on key differences between pangenome graph representations, informing end-users on how to select the most appropriate graph type for their application.

Related Organizations

View all View all

Keywords

Genome, Pangenomics, QH301-705.5, Research, Sequence analysis, [INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS], Sequence Analysis, DNA, Genomics, QH426-470, Variation graphs, Genetics, Humans, Biology (General), de Bruijn graphs, Algorithms, Software, [INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]

4 Research products, page 1 of 1

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	31
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%