Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ Biodiversity Informa...arrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
Biodiversity Information Science and Standards
Article . 2023 . Peer-reviewed
License: CC BY
Data sources: Crossref
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2023
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
Pensoft
Conference object . 2023
Data sources: Pensoft
versions View all 3 versions
addClaim

Planetary Knowledge Base: Semantic Transcription Using Graph Neural Networks

Authors: Qianqian Gu; Ben Scott; Vincent Smith;

Planetary Knowledge Base: Semantic Transcription Using Graph Neural Networks

Abstract

The Natural History Museum, London (NHM), in collaboration with Amazon Web Services (AWS), has embarked on a project to build the Planetary Knowledge Base (PKB), a comprehensive graph network comprising data on all specimens, collectors, and localities. In the initial prototype, we have concentrated on botanical specimens, using all plant taxa and specimens within the Global Biodiversity Information Facility (GBIF), combined with geographic data from GeoNames and biographic data from WikiData, Bionomia, Harvard Index of Botany, TL2, and Tropicos. Development of the PKB is a huge undertaking—our first proof of concept has more than 100 million nodes. The primary application of this knowledge graph (KG) is powering the automated transcription of specimen labels. Using Graph Convolutional Neural Networks, textual information from labels can be aligned to the entities in the graph, creating structured semantic data from the raw text. Text is extracted from images using services from the AWS ecosystem, including Optical Character Recognition and Natural Language Processing to identify the units of information, creating a high-throughput auto-digitisation workflow for extracting structured data. The PKB graph network enables new ways to interrogate collections. It can help identify species that may require re-examination or re-identification due to taxonomic updates or inconsistencies. It can also flag potential discrepancies or conflicts in the data, such as cases where the same species is recorded under different names or classifications across various sources. Moreover, the PKB can detect possible errors and outliers in the knowledge graph and point out specimens that could represent new species misidentified within the collection. By cross-validating species with the International Union for Conservation of Nature (IUCN) Red List, it can also assist in analysing species populations with insufficient data. The PKB is being developed as a cloud service, so researchers and other institutions can experiment with this transformative technology, using it to support their own digitisation efforts.

Related Organizations
Keywords

machine learning, knowledge graph,  machine learning, knowledge base, cloud service,  knowledge base

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 8
    download downloads 7
  • 8
    views
    7
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
8
7
gold
Related to Research communities
Italian National Biodiversity Future Center