
Dataset for paper "Sameness entices, but novelty enchants in fanfiction online", a dataset of fanfiction collected from the platform Archive Of Our Own (AO3). Please refer to the paper for dataset details. As authors on AO3 retain all rights for their works, we choose not to redistribute the fanfiction texts outside of AO3. Instead, for each fandom we analyze, we share the URLs of the works we collected in the fandom. Users can refer to this snippet for downloading contents from AO3 using the URLs. The dataset contains the following fields: 'URL' We also share a derived dataset of metadata of the fanfiction, along with the LDA topic distribution from fitting an LDA model using the fanfiction, and the Jensen-Shannon divergence value between each fiction and the center (see paper for details). These fields can be used to reproduce results in the paper. The dataset contains the following fields: 'AdditionalTags' 'ArchiveWarnings' 'Author' 'Category' 'Chapters' 'Characters' 'Fandoms' 'Kudos' 'Language' 'Rating' 'Relationship' 'Title' 'Words' 'PublishDate' 'UpdateDate' 'CompleteDate' 'Comments' 'Hits' 'Bookmarks' 'URL' 'Dist': LDA topic distribution 'JSD': Jensen-Shannon divergence value Additional code for analysis can be found in this Github repo. Please direct questions about the dataset to the corresponding author of the paper.
fanfiction, computational narratology, topic modeling, text mining
fanfiction, computational narratology, topic modeling, text mining
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
