Powered by OpenAIRE graph
Found an issue? Give us feedback
ZENODOarrow_drop_down
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
ZENODO
Dataset . 2025
License: CC BY
Data sources: Datacite
versions View all 3 versions
addClaim

A new, comprehensive database of all proceedings of the Australian Parliamentary Debates (1998-2025)

Authors: Katz, Lindsay; Alexander, Rohan;

A new, comprehensive database of all proceedings of the Australian Parliamentary Debates (1998-2025)

Abstract

This database contains data on the proceedings from each sitting day in the Australian Parliament by the House of Representatives from 02 March 1998 to 31 July 2025, in parquet form. These data were parsed entirely from the XML Hansard transcripts available on the Australian Parliament website. The database is stored in the folder corpus_1998_to_2025.parquet, which contains the full Hansard corpus. Since the last version released on 12 August 2025, we have made the following updates: Standardized and validated the electorate column. Standardized and validated the party abbreviation column. Added a column with the full party name. Improved question and answer flagging by identifying rows starting with text such as "My question is to", "My question goes to", etc. which were not flagged as questions, and corrected those. Fixed many instances of incorrectly flagged interjections Separated out interjections which had been detected that were not on their own row. Validated that all MPs who were present on each sitting day were actually Members of Parliament on that day. Identified and fixed any rows with a null body.

Related Organizations
  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    1
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 123
    download downloads 23
  • 123
    views
    23
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
1
Average
Average
Average
123
23
Related to Research communities