software . 2013

CTRimages

Hakimov, Kurban; Hartwell, Andrew; Ulmet, Robert;
Open Access English
  • Published: 18 May 2013
  • Country: United States
Abstract
parse_images.py – a Python script that finds all URLs inside of HTML image tags and creates a text document with URLs and another with ALT tags. bannedUrls.txt – a list of URLs from which no images will be downloaded. ctrfilter – a bash file that runs the script on all .html and .htm files in the current directory and its subdirectories. filter_images.py – a Python script that filters our URLs for download based on banned URLs, image dimensions, ALT tags, and file types. It also downloads the images into a specified folder. CS4624_Documentation.docx – documentation for the project. ImageProperties.xlsx – Excel spreadsheet that has information on all images on th...
Subjects
free text keywords: python script, image parsing, image filtering, CTR, Drupal gallery, Crisis, Tragedy, and Recovery Network Project
Download from
VTechWorks
Software . 2013
Provider: VTechWorks
Powered by OpenAIRE Research Graph
Any information missing or wrong?Report an Issue