<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
The Biodiversity Heritage Library (BHL) is a consortium of natural history and botanical libraries that cooperate to digitize taxonomic literature and to make that literature available to a global audience for open access and responsible use as a part of a global "biodiversity commons". In partnership with the Internet Archive and through local digitization efforts, BHL has digitized more than 200,000 volumes of taxonomic literature. Using the Global Names Recognition and Discovery (GNRD) service, BHL has identified over 177 million instances of species names (including more than 29 million unique names) within the text. This content, which includes over 52 million pages of text, provides a rich unstructured source of biodiversity big data that is associated with taxonomic and bibliographic metadata. BHL allows users to search the collection, read the texts online, and download select pages or entire volumes as PDF files. More importantly, BHL makes the source data available for reuse and Big Data analysis via a number of different services. These services include direct downloads of data files and machine interfaces. This talk will describe the downloads and machine interfaces through which BHL source data is available. Each service will be introduced and described. Where feasible, the usage of each service will be demonstrated. Theoretical examples of how each service could be used to facilitate Big Data analysis will be provided.
OCR, Metadata, BHL, Big-Data
OCR, Metadata, BHL, Big-Data
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |