Downloads provided by UsageCounts
The international DNA sequence databases abound in fungal sequences not annotated beyond the kingdom level, typically bearing names such as “uncultured fungus”. These sequences beget low-resolution mycological results and invite further deposition of similarly poorly annotated entries. What do these sequences represent? This study uses a 767,918-sequence corpus of public full-length fungal ITS sequences to estimate what proportion of the 95,055 “uncultured fungus” sequences that represent truly unidentifiable fungal taxa – and what proportion of them that would have been straightforward to annotate to some more meaningful taxonomic level at the time of sequence deposition. Our results suggest that more than 70% of these sequences would have been trivial to identify to at least the order/family level at the time of sequence deposition, hinting that factors other than poor availability of relevant reference sequences explain the low-resolution names. We speculate that researchers’ perceived lack of time and lack of insight into the ramifications of this problem are the main explanations for the low-resolution names. We were surprised to find that more than a fifth of these sequences seem to have been deposited by mycologists rather than researchers unfamiliar with the consequences of poorly annotated fungal sequences in molecular repositories. The proportion of these needlessly poorly annotated sequences does not decline over time, suggesting that this problem must not be left unchecked.
570, Biologisk systematik, taxonomic, species identification, Botany, taxonomic annotation, Biological Systematics, data mining, annotation, QK1-989, Data interoperability, DNA barcoding, Data mining, scientific practice, Research Article, ddc: ddc:
570, Biologisk systematik, taxonomic, species identification, Botany, taxonomic annotation, Biological Systematics, data mining, annotation, QK1-989, Data interoperability, DNA barcoding, Data mining, scientific practice, Research Article, ddc: ddc:
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 19 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 106 | |
| downloads | 34 |

Views provided by UsageCounts
Downloads provided by UsageCounts