
This repository documents how to build a language corpus from the Farms to Freeways history project data. The data is published on its own domain as an Omeka Classic site available in an Omeka Repository which is considered the published version of the collection. The data are archived at Western Sydney University. This does not appear to have a persistent ID and the web page is "orphaned" in that it does not have links to the data repository (which appears to be an instance of ReDBox, maintained by QCIF). The transcripts in the Omeka repository are in PDF format and speaker turns are only indicated using bold-face text. There are some plain text versions available but they don't have speaker turns indicated. This repository contains scripts to: Download the published version of Farms to Freeways as an RO-Crate Derive CSV-formatted transcripts from the PDF versions, which have been formatted to indicate which speaker is speaking in each turn (the interviewer is in bold text). These transcripts don't have the IDs of the speakers but can be used to distinguish interviewer from interviewee. If you got this dataset from Zenodo as a download then the data is already in this dataset.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
