
This directory contains highly accurate somatic benchmarking single nucleotide variants (SNVs), small indels (indels), structural variations (SVs), mobile element insertions (MEIs) and mitochondrial variants(mito) for the HapMap cell line mixture. The HapMap mixture consists of 83.5% of HG005 (Chinese male), 10% of HG02622 (European/Ashkenazi Jewish male), 2% of HG002 (African female), 2% of HG02257 (African male), 2% of HG02486 (East Asian female), and 0.5% HG00438 (Chinese male). The vcf.gz and bed files are intended to be used together to benchmark the performance of somatic variant callers. The benchmarking region is defined as the intersection of reliable regions across the 12 haploid assemblies from the 6 HapMap samples, encompassing 83.9% of GRCh38 and 84.0% of CHM13, respectively. Each benchmark set was produced from a minigraph-cactus graph constructed using: 1) The selected reference assembly (GRCh38, T2T-CHM13, HG005 maternal/paternal assemblies, or a personalized HapMap mixture assembly) and 2) Twelve haploid assemblies from the six HapMap samples. The following paper(s) can be cited for use of the benchmark variant set: Kong, Nahyun, et al. "A Pangenomic Method for Establishing a Somatic Variant Detection Resource in HapMap Mixtures." bioRxiv (2025): 2025-09.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
