Automated Disaster News Collection Classification and Geoparsing

Chelsea Fernandes; Joshua Fernandes; Sharon Mathew; Shubham Raorane; Anuradha Srinivasaraghavan

Found an issue? Give us feedback

SSRN Electronic Jour...arrow_drop_down

SSRN Electronic Journal

Article . 2021 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.2139/ssr...

Other literature type

Data sources: Microsoft Academic Graph

Automated Disaster News Collection Classification and Geoparsing

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type 01 Jan 2021 English Publisher:Elsevier BVJournal:SSRN Electronic Journal (eissn: 1556-5068,

Copyright policy )

Authors: Chelsea Fernandes; Joshua Fernandes; Sharon Mathew; Shubham Raorane; Anuradha Srinivasaraghavan;

doi: 10.2139/ssrn.3852688

Automated Disaster News Collection Classification and Geoparsing

- Summary
- Metrics

Abstract

A disaster is an unforeseen event, which can have a tremendous impact on human life as well as on the environment. The Internet provides a lot of sources that generate huge amounts of news articles daily. With the increase in the number of online news articles, it has become difficult for users to access disaster relevant news, which makes it a necessity to extract and classify news so that they could be easily accessed. This paper presents an automated system that scraps news from various online sources and identifies disaster relevant news. The news articles are scraped with the help of a scrapy framework and a model is trained using Machine Learning algorithms to classify news as disaster and non-disaster. The system also uses a geoparsing model to identify the focus location from the extracted news articles. The geoparsing model is built using Named Entity Recognition (NER).

Related Organizations

St. Francis Institute of Technology
India
University of Mumbai
India
St. Francis Institute of Technology
India

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%