Cleaning Data with OpenRefine

Article English OPEN
Seth van Hooland; Ruben Verborgh; Max De Wilde;
  • Publisher: Editorial Board of the Programming Historian
  • Journal: issn: 2397-2068
  • Publisher copyright policies & self-archiving
  • Subject: OpenRefine | Data cleaning | data manipulation | History (General) | D1-2009 | Computer software | QA76.75-76.765

Duplicate records, empty values and inconsistent formats are phenomena we should be prepared to deal with when using historical data sets. This lesson will teach you how to discover inconsistencies in data contained within a spreadsheet or a database. As we increasingly... View more
Share - Bookmark