Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Presentation . 2020
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Other literature type . 2020
License: CC BY
Data sources: ZENODO
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Presentation . 2020
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

The Plazi Workflow to liberate PDF-imprisoned biodiversity data

Authors: Guidoti, Marcus; Agosti, Donat; Miller, Jeremy A.; Sautter, Guido;

The Plazi Workflow to liberate PDF-imprisoned biodiversity data

Abstract

Scholarly knowledge about biodiversity is published in a rapidly increasing corpus of scientific publications. A substantial portion of taxonomic literature is available as unstructured portable document format (PDF) electronic documents. But the data contained, including occurrence records and characteristics, are locked up within these PDFs. Semantic enhancement, applied either retrospectively to legacy literature or prospectively through innovative cybertaxonomic publishing models, allows for the electronic mobilization of these data. Of course, a large portion of legacy taxonomic literature is not even digitized in the sense of having the pages scanned and converted into a PDF electronic document, much less having the underlying data mobilized. Even when we publish open access, the data remain closed. The Plazi semantic enhancement workflow facilitates the conversion of publications into findable, open accessible, interoperable, reusable (FAIR) data. So far, over 32K publications have been converted, resulting in 350K figures, 375K taxonomic treatments, 406K materials cited accessible through the Biodiversity Literature Repository. The data is directly reused by the Global Biodiversity Information Facility (GBIF) where it is the sole contributor of over 45,000 species, mainly insects, for which no other digital data is available. The system permits the building of applications focused on images (e.g. Ocellus) or taxonomic names (e.g. Synospecies). This workflow described in this presentation can be applied manually to selected legacy publications, or employ automation to process related articles in batches. Further changes in taxonomic publishing practices could lead to having electronic data elements routinely linked and mobilized.

Keywords

taxonomy, taxonomic treatment, TreatmentBank, FAIR, data liberation

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 7
    download downloads 3
  • 7
    views
    3
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
7
3
Green
Related to Research communities
Italian National Biodiversity Future Center