Downloads provided by UsageCounts
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>Combining human expertise with information from book-consumer digital data may generate what it takes to face the following changes in such a critical market. Along with the publishing industry, researchers rely on book-related data to develop tools and applications, drawing constructive conclusions to make better informed and faster decisions. Such solutions range from best-selling prediction models to natural language processing to classify raw text. Besides require complex Artificial Intelligence (AI) methods, all of them are essentially data-dependent, mainly book-related data-dependent. Data, and more specifically data growth, is essential for developing and performing such AI-powered applications. None of these efforts can be achieved without a preliminary collection of data on literary works, readers, and their reading habits. Therefore, it is critically important to build and make available datasets that fully comprise the essential elements of the book industry ecosystem. Although some efforts have been made for English language books, little has been done regarding other lesser-spoken languages, such as Portuguese. The evaluation of specific data is of fundamental importance for literature analysis, as Portuguese has its own literary peculiarities. Hence, we present PPORTAL, a Public domain PORTuguese-lAnguage Literature dataset. PPORTAL's contributions are summarized as follows: Data integration of numerous public domain works from three digital libraries; Enriched metadata for works, authors and online reviews extracted from Goodreads; Feature engineering on the metadata to create meaningful additional features; and Unrestricted access in two formats (SQL database and compressed .csv files
Portugues-language literature, books, public domain, Goodreads, digital libraries
Portugues-language literature, books, public domain, Goodreads, digital libraries
| citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 13 | |
| downloads | 5 |

Views provided by UsageCounts
Downloads provided by UsageCounts