
All example datasets for 'The Portable Microhaplotype Object and Tools' manuscript are available. The archive contains five folders, each corresponding to one dataset: Dataset1: Public genomic surveillance data of Plasmodium falciparum from four countries: Eswatini, Namibia, South Africa, and Zambia. ANOSPP: Combined Anopheles and Plasmodium data. mips_v_mad4hatter: Data from the MAD4HatTeR amplicon sequencing assay and the DR23K molecular inversion probe (MIP) assay comparison. E_coli: Escherichia coli datasets sourced from the Sequence Read Archive (SRA). S_aureus: Staphylococcus aureus datasets sourced from SRA. For Dataset1 and ANOSPP, the archive includes all raw data files as well as Jupyter notebooks used to generate the PMO. Dataset1 additionally includes the notebook used to produce Figure 3. PMOs for all five datasets are included. Further details about the file structure and contents are provided in the README.txt file with the data.
