
This dataset contains high-quality genome-wide variant calls (VCF format) generated from whole-genome sequencing (WGS) of 185 Ethiopian sorghum (Sorghum bicolor) accessions that passed quality control filtering. The original dataset included 188 accessions representing founder lines prioritized by the Ethiopian Institute of Agricultural Research (EIAR) breeding program and diverse landraces spanning major agroecological zones of Ethiopia, including arid, semi-arid, sub-humid, and humid environments. After quality assessment using FastQC and read filtering, 185 samples were retained for downstream variant calling. Raw FASTQ files for all 188 accessions have been deposited in the NCBI Sequence Read Archive (SRA) under BioProject accession PRJNA1428287. Reads were aligned to the Sorghum bicolor reference genome assembly NCBIv3 (GCF_000003195.3). Variant calling was performed using a standardized pipeline, followed by stringent filtering to retain high-confidence biallelic SNPs located on primary chromosomes. The final VCF file includes variants filtered using the following criteria: • Depth (DP) between 10 and 50• Missing rate ≤ 20%• Minor allele frequency (MAF) ≥ 0.05• Main chromosomes only The file provided here: Sorghum_WGS_185samples_MAINCHR_DP10_50_MISS20_MAF05_NCBIv3.vcf.gz contains a curated genome-wide variant dataset suitable for analyses of genetic diversity, population structure, genome-wide association studies, and genomics-assisted breeding applications. This version includes accession metadata and sequencing coverage summaries for the sorghum panel. These data are provided in Ethiopian_sorghum_accession_metadata.csv and Ethiopian_sorghum_sequencing_coverage.csv.
Ethiopian Sorghum, sorghum bicolor, VCF, Whole genome sequencing, SNP, Agroecological adaptation, Climate resilence, Population genomics, landrace, WGS, Accession
Ethiopian Sorghum, sorghum bicolor, VCF, Whole genome sequencing, SNP, Agroecological adaptation, Climate resilence, Population genomics, landrace, WGS, Accession
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
