Downloads provided by UsageCounts
This database contains data on the proceedings from each sitting day in the Australian Parliament by the House of Representatives from 02 March 1998 to 31 July 2025, in parquet form. These data were parsed entirely from the XML Hansard transcripts available on the Australian Parliament website. The database is stored in the folder corpus_1998_to_2025.parquet, which contains the full Hansard corpus. Since the last version released on 12 August 2025, we have made the following updates: Standardized and validated the electorate column. Standardized and validated the party abbreviation column. Added a column with the full party name. Improved question and answer flagging by identifying rows starting with text such as "My question is to", "My question goes to", etc. which were not flagged as questions, and corrected those. Fixed many instances of incorrectly flagged interjections Separated out interjections which had been detected that were not on their own row. Validated that all MPs who were present on each sitting day were actually Members of Parliament on that day. Identified and fixed any rows with a null body.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 123 | |
| downloads | 23 |

Views provided by UsageCounts
Downloads provided by UsageCounts