
pmid: 40842725
pmc: PMC12365672
As part of its extensive Diptera holdings, the Royal Museum for Central Africa (RMCA) houses over 100,000 specimens of Tephritidae and Syrphidae, which represent a critical resource for taxonomic and systematic research. Here, we present a feasibility study evaluating streamlined workflows for genomic data mining and archiving in museum collections. We analysed DNA yield, quality and sequencing performance from more than 1,400 insect vouchers and found few predictable trends, reflecting the nature of heterogeneous and skewed groups of samples collected under largely unknown field conditions. Regardless, our results show that Illumina short read whole genome sequencing can work well even with degraded insect material. In this context, routine short-read sequencing offers a practical first step for genomic data mining, particularly for large collections. It enables us to reserve more complex and resource-intensive methods for the subset of samples that fail initial sequencing (7% of specimens, in our case). As an outcome of this work, RMCA’s archiving system has been adapted to integrate genomic data and metadata alongside traditional specimen records. We argue that genomic data should be treated as an integral component of collection management, enhancing scientific value, supporting long term preservation and improving traceability of genetic resources in natural history collections.
whole genome sequencing, natural history collections, true fruit flies, hoverflies, museomics, Research Article
whole genome sequencing, natural history collections, true fruit flies, hoverflies, museomics, Research Article
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
