
This dataset contains a total of 4,357 reports of mammographic studies in Spanish, obtained through several medical units in Paraguay. This dataset aims to help with the shortage of public datasets within the area of natural language processing applied to radiological reports. This dataset contains key information from the mammographic reports through the 15 variables that make up our dataset, the full text of the reports is included, but each of the sections of the report is also included separately, these sections are clinical observations, diagnostic conclusions and follow-up recommendations, in addition to the BI-RADS classification that has been assigned to each report, finally there are metadata related to the reports such as a unique identifier, year, month and patient information such as age, patient reasons for the analysis, last menstruation period, type of hormonal therapy received, family history and number of children This dataset, containing data not generated artificially, represents a real-world scenario, which can be used by researchers to replicate results from articles within the area, as well as to develop and test new models and algorithms specifically for the classification of the BI-RADS system.
Mammography reports, BI-RADS classification, Clinical data, Natural Language Processing
Mammography reports, BI-RADS classification, Clinical data, Natural Language Processing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
