Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Dataset . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Dataset of Spanish Mammographic Reports with BI-RADS Classification

Authors: Vázquez Noguera, José Luis; Gómez Adorno, Helena; Torres Hurtado, Alejandro; Mello Román, Julio César; Fleitas Alvarez, Enrique Javier; Espinola Schulze, Federico Fernando; Garcia Torres, Miguel; +4 Authors

Dataset of Spanish Mammographic Reports with BI-RADS Classification

Abstract

This dataset contains a total of 4,357 reports of mammographic studies in Spanish, obtained through several medical units in Paraguay. This dataset aims to help with the shortage of public datasets within the area of natural language processing applied to radiological reports. This dataset contains key information from the mammographic reports through the 15 variables that make up our dataset, the full text of the reports is included, but each of the sections of the report is also included separately, these sections are clinical observations, diagnostic conclusions and follow-up recommendations, in addition to the BI-RADS classification that has been assigned to each report, finally there are metadata related to the reports such as a unique identifier, year, month and patient information such as age, patient reasons for the analysis, last menstruation period, type of hormonal therapy received, family history and number of children This dataset, containing data not generated artificially, represents a real-world scenario, which can be used by researchers to replicate results from articles within the area, as well as to develop and test new models and algorithms specifically for the classification of the BI-RADS system.

Keywords

Mammography reports, BI-RADS classification, Clinical data, Natural Language Processing

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average
Related to Research communities