
AbstractEach holotype specimen provides the only objective link to a particular Linnean binomen. Sequence information from them is increasingly valuable due to the growing usage of DNA barcodes in taxonomy. As type specimens are often old, it may only be possible to recover fragmentary sequence information from them. We tested the efficacy of short sequences from type specimens in the resolution of a challenging taxonomic puzzle: the Elachista dispunctella complex which includes 64 described species with minuscule morphological differences. We applied a multistep procedure to resolve the taxonomy of this species complex. First, we sequenced a large number of newly collected specimens and as many holotypes as possible. Second, we used all >400 bp examine species boundaries. We employed three unsupervised methods (BIN, ABGD, GMYC) with specified criteria on how to handle discordant results and examined diagnostic bases from each delineated putative species (operational taxonomic units, OTUs). Third, we evaluated the morphological characters of each OTU. Finally, we associated short barcodes from types with the delineated OTUs. In this step, we employed various supervised methods, including distance‐based, tree‐based and character‐based. We recovered 658 bp barcode sequences from 194 of 215 fresh specimens and recovered an average of 141 bp from 33 of 42 holotypes. We observed strong congruence among all methods and good correspondence with morphology. We demonstrate potential pitfalls with tree‐, distance‐ and character‐based approaches when associating sequences of varied length. Our results suggest that sequences as short as 56 bp can often provide valuable taxonomic information. The results support significant taxonomic oversplitting of species in the Elachista dispunctella complex.
Automatic Barcode Gap Discovery, species delineation, MITOCHONDRIAL-DNA, GMYC, CLASSIFICATION, DELIMITATION, GELECHIOIDEA, Animals, DNA Barcoding, Taxonomic, ELACHISTIDAE ELACHISTINAE, MORPHOLOGY REVEAL, IDENTIFICATION, RESOURCE ARTICLES, Computational Biology, DNA, Sequence Analysis, DNA, SIRCOM COMPLEX LEPIDOPTERA, REVISION, DELINEATION, Lepidoptera, Barcode Index Number, species delimitation, Haplotypes, Ecology, evolutionary biology, Elachista
Automatic Barcode Gap Discovery, species delineation, MITOCHONDRIAL-DNA, GMYC, CLASSIFICATION, DELIMITATION, GELECHIOIDEA, Animals, DNA Barcoding, Taxonomic, ELACHISTIDAE ELACHISTINAE, MORPHOLOGY REVEAL, IDENTIFICATION, RESOURCE ARTICLES, Computational Biology, DNA, Sequence Analysis, DNA, SIRCOM COMPLEX LEPIDOPTERA, REVISION, DELINEATION, Lepidoptera, Barcode Index Number, species delimitation, Haplotypes, Ecology, evolutionary biology, Elachista
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 57 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
