wikiextractor software on GitHub

Software OPEN SOURCE
  • Subject:
    acm: Software_OPERATINGSYSTEMS | Software_SOFTWAREENGINEERING | Data_GENERAL | ComputingMethodologies_DOCUMENTANDTEXTPROCESSING | ComputerApplications_COMPUTERSINOTHERSYSTEMS

Convert Wikipedia dumps into XML format ready for indexing by solr, by Walid Shalaby
Share - Bookmark