Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2024
License: CC 0
Data sources: ZENODO
DRYAD
Dataset . 2024
License: CC 0
Data sources: Datacite
versions View all 2 versions
addClaim

Data from: Minimally destructive hDNA extraction method for retrospective genetics of pinned historical Lepidoptera specimens

Authors: Rayo, Enrique; Ulrich, Gabriel; Zemp, Niklaus; Greeff, Michael; Schuenemann, Verena J.; Widmer, Alex; Fischer, Martin C.;

Data from: Minimally destructive hDNA extraction method for retrospective genetics of pinned historical Lepidoptera specimens

Abstract

# Data from: Minimally destructive hDNA extraction method for retrospective genetics of pinned historical Lepidoptera specimens [https://doi.org/10.5061/dryad.pzgmsbcvf](https://doi.org/10.5061/dryad.pzgmsbcvf) ***Melitaea diamina*** **draft reference genome assembly** **Martin C. Fischer (ETH Zurich) and Natalia Zajac (Functional Genomic Center Zurich)** The draft *de novo* *Melitaea* *diamina* reference genome (butterfly_v1.asm.bp.p_ctg_mtDNA_masked.fa) was based on 6.2 Gb PacBio HiFi reads and assembled and sequenced at the Functional Genomics Center Zurich, FGCZ. The genome was assembled and purged for duplicates with hifiasm (Cheng* et al.* 2021) and the parameters -l3 -s 0.55. The assembled genome is 805 Mb long and encompasses 3,918 contigs and has a BUSCO (v5.2.2 arthropoda_odb10; (Manni* et al.* 2021)value of 96.2%. The mtDNA contigs were masked except for one contiguous mtDNA sequence to ensure consistent mapping on mtDNA. | **Summery statistics of butterfly\_v1.asm.bp.p\_ctg\_mtDNA\_masked.fa** | | | :---------------------------------------------------------------------- | ------------------------------------------------------------------------------------------- | | COMPOSITION | A = 264817474 (32.9%) | | \*\* \*\* | \*\* \*\* | | **SCAFFOLD** | **sum = 805928632, n = 3918**, mean = 205698.987238387, largest = 2493802, smallest = 15168 | | SCAFFOLD | **N50 = 317515**, L50 = 772 | | SCAFFOLD | N60 = 251763, L60 = 1057 | | SCAFFOLD | N70 = 199969, L70 = 1416 | | SCAFFOLD | N80 = 150164, L80 = 1879 | | SCAFFOLD | N90 = 96505, **L90 = 2543** | | SCAFFOLD | N100 = 15168, L100 = 3918 | | | | | CONTIG | sum = 805890766, n = 3924, mean = 205374.812945974, largest = 2493802, smallest = 15168 | | CONTIG | N50 = 317249, L50 = 773 | | CONTIG | N60 = 251236, L60 = 1058 | | CONTIG | N70 = 199883, L70 = 1419 | | CONTIG | N80 = 149656, L80 = 1883 | | CONTIG | N90 = 96400, L90 = 2548 | | CONTIG | N100 = 15168, L100 = 3924 | | **GAP** | sum = **37866, n = 6**, mean = 6311, largest = 15149, smallest = 1075 | | | | | \*\*BUSCO \*\*v5.2.2 arthropoda\_odb10 (eukaryota, 2020-09-10) | C:**96.2%**\[S:68.8%,D:27.4%],F:1.4%,M:2.4%,n:1013 | **RAW PacBio Hifi reads** The raw MiSeq reads and PacBio HiFi longreads for the Melitaea diamina individuals can be found on the European Nucleotide Archive (ENA, [www.ebi.ac.uk](http://www.ebi.ac.uk), PRJEB72438). Cheng H, Concepcion GT, Feng X, Zhang H, Li H (2021) Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. *Nat Methods* **18**, 170-175. Manni M, Berkeley MR, Seppey M, Simao FA, Zdobnov EM (2021) BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. *Mol Biol Evol* **38**, 4647-4654.

The millions of specimens stored in entomological collections provide a unique opportunity to study historical insect diversity. Current technologies allow to sequence entire genomes of historical specimens and estimate past genetic diversity of present-day endangered species, advancing our understanding of anthropogenic impact on genetic diversity and enabling the implementation of conservation strategies. A limiting challenge is the extraction of historical DNA (hDNA) of adequate quality for sequencing platforms. We tested four hDNA extraction protocols on five body parts of pinned false heath fritillary butterflies, Melitaea diamina, aiming to minimise specimen damage, preserve their scientific value to the collections, and maximise DNA quality and yield for whole-genome re-sequencing. We developed a very effective approach that successfully recovers hDNA appropriate for short-read sequencing from a single leg of pinned specimens using silica-based DNA extraction columns and an extraction buffer that includes SDS, Tris, Proteinase K, EDTA, NaCl, PTB, and DTT. We observed substantial variation in the ratio of nuclear to mitochondrial DNA in extractions from different tissues, indicating that optimal tissue choice depends on project aims and anticipated downstream analyses. We found that sufficient DNA for whole genome re-sequencing can reliably be extracted from a single leg, opening the possibility to monitor changes in genetic diversity maintaining the scientific value of specimens while supporting current and future conservation strategies.

The draft de novo Melitaea diamina reference genome (butterfly_v1.asm.bp.p_ctg_mtDNA_masked.fa) was based on 6.2 Gb PacBio HiFi reads and assembled and sequenced at the Functional Genomics Center Zurich, FGCZ. The genome was assembled and purged for duplicates with hifiasm (Cheng et al. 2021) and the parameters -l3 -s 0.55. The assembled genome is 805 Mb long and encompasses 3,918 contigs and has a BUSCO (v5.2.2 arthropoda_odb10; (Manni et al. 2021)value of 96.2%. The mtDNA contigs were masked except for one contiguous mtDNA sequence to ensure consistent mapping on mtDNA.

Related Organizations
Keywords

butterfly, FOS: Biological sciences, Draft Reference genome assembly, Melitaea diamina

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average