
This record contains the data used to perform the GWAS described in the research paper "A genome-wide association study of arabinoxylan content in flour of triticale (×Triticosecale Wittmack)". dart_curated.csv contains the curated DArTseq SNP panel. SNPs are coded to represent allele dosage, where: 0 = homozygous reference allele, 1 = heterozygous, 2 = homozygous alternative allele. Filtering: MAF >= 0.05, call rate >= 0.85, frequency of heterozygous calls per SNP < frequency homozygous reference or homozygous alternative calls, frequency homozygous calls per SNP != 0. NA imputed by kNN algorithm. phenotypes_blue.csv contains the BLUEs per genotype of the four phenotypes tested: total arabinoxylan content (TOT-AX), water-extractable arabinoxylan content (WE-AX), water-unextractable arabinoxylan content (WU-AX), and the proportion of water-extractable arabinoxylan to total arabinoxylan content (WE/TOT-AX).
