wikiextractor software on GitHub

Software OPEN SOURCE
  • Subject:
    acm: ComputingMethodologies_DOCUMENTANDTEXTPROCESSING | ComputerApplications_COMPUTERSINOTHERSYSTEMS | InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL

Convert Wikipedia dumps into XML format ready for indexing by solr, by Walid Shalaby
Share - Bookmark