
In this paper, we aim to disentangle the dataset lens from the model architecture lens. We gather three multimodal datasets of the Japanese prints and paintings collections from the Tokyo National Museum, the Victoria and Albert Museum, and the Rijksmuseum. These datasets – similar but created with different implicit cultural frameworks – are used to highlight the ways in which inherent cultural biases are exaggerated and warped by different model architectures. We explore what can or cannot be projected in the model’s internal representation of the data (latent space), highlighting what is amplified, and what is underrepresented. We unpick the internal representations of different models to highlight how the relationships between data change depending on choices made during training and visualisation and how this consequently shapes our understanding of the data.
Bias, Cultural heritage, Computer vision, Digital humanities
Bias, Cultural heritage, Computer vision, Digital humanities
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
