publication . Article . 2012

Introduction to Beautiful Soup

Jeri Wieringa;
Open Access English
  • Published: 01 Dec 2012
  • Publisher: Editorial Board of the Programming Historian
Abstract
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages. Say you’ve found some webpages that display data relevant to your research, such as date or address information, but that do not provide any way of downloading the data directly. Beautiful Soup helps you pull particular content from a webpage, remove the HTML markup, and save the information. It is a tool for web scraping that helps you clean up and parse the documents you have pulled down from the web.
Subjects
ACM Computing Classification System: ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
free text keywords: Python, data manipulation, XML, HTML, Beautiful Soup, History (General), D1-2009, Computer software, QA76.75-76.765
Download from
Powered by OpenAIRE Research Graph
Any information missing or wrong?Report an Issue