Downloads provided by UsageCounts
This presentation reports on a case study investigating how Natural Language Processing technologies can support the measurement and evaluation of gender biased language in archival catalogs. Working with English descriptions from the catalog metadata of the University of Edinburgh’s Archives, we created an annotated dataset and classification models that identify types of gender biases in the descriptions. Though conducted with archival data, the case study holds relevance across Galleries, Libraries, Archives, and Museums (GLAM), particularly for institutions with catalog descriptions written in English. In addition to bringing Natural Language Processing methods to Archives, we identified opportunities to bring Archival Science methods, such as Cultural Humility (Tai, 2021) and Feminist Standpoint Appraisal (Caswell, 2022), to Natural Language Processing. Through this two-way disciplinary exchange, we demonstrate how Humanistic approaches to bias and uncertainty can upend legacies of gender-based oppression that most computational approaches to date uphold when working with data at scale.
text classification, machine learning, data, annotation, social bias, gender, GLAM, natural language processing, digital humanities, archives
text classification, machine learning, data, annotation, social bias, gender, GLAM, natural language processing, digital humanities, archives
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 44 | |
| downloads | 37 |

Views provided by UsageCounts
Downloads provided by UsageCounts