Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2026
License: CC BY
Data sources: Datacite
versions View all 3 versions
addClaim

Genomic Variant Dataset (VCF) from Ethiopian Sorghum Founder Lines and Key Landrace Accessions

Authors: Wondifraw, Meseret; Sheehan, Moira; Beil, Craig; Zhao, Dongyan; Crain, Jared; Legesse, Tokuma;

Genomic Variant Dataset (VCF) from Ethiopian Sorghum Founder Lines and Key Landrace Accessions

Abstract

This dataset contains high-quality genome-wide variant calls (VCF format) generated from whole-genome sequencing (WGS) of 185 Ethiopian sorghum (Sorghum bicolor) accessions that passed quality control filtering. The original dataset included 188 accessions representing founder lines prioritized by the Ethiopian Institute of Agricultural Research (EIAR) breeding program and diverse landraces spanning major agroecological zones of Ethiopia, including arid, semi-arid, sub-humid, and humid environments. After quality assessment using FastQC and read filtering, 185 samples were retained for downstream variant calling. Raw FASTQ files for all 188 accessions have been deposited in the NCBI Sequence Read Archive (SRA) under BioProject accession PRJNA1428287. Reads were aligned to the Sorghum bicolor reference genome assembly NCBIv3 (GCF_000003195.3). Variant calling was performed using a standardized pipeline, followed by stringent filtering to retain high-confidence biallelic SNPs located on primary chromosomes. The final VCF file includes variants filtered using the following criteria: • Depth (DP) between 10 and 50• Missing rate ≤ 20%• Minor allele frequency (MAF) ≥ 0.05• Main chromosomes only The file provided here: Sorghum_WGS_185samples_MAINCHR_DP10_50_MISS20_MAF05_NCBIv3.vcf.gz contains a curated genome-wide variant dataset suitable for analyses of genetic diversity, population structure, genome-wide association studies, and genomics-assisted breeding applications. This version includes accession metadata and sequencing coverage summaries for the sorghum panel. These data are provided in Ethiopian_sorghum_accession_metadata.csv and Ethiopian_sorghum_sequencing_coverage.csv.

Keywords

Ethiopian Sorghum, sorghum bicolor, VCF, Whole genome sequencing, SNP, Agroecological adaptation, Climate resilence, Population genomics, landrace, WGS, Accession

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average