
doi: 10.2139/ssrn.3852688
A disaster is an unforeseen event, which can have a tremendous impact on human life as well as on the environment. The Internet provides a lot of sources that generate huge amounts of news articles daily. With the increase in the number of online news articles, it has become difficult for users to access disaster relevant news, which makes it a necessity to extract and classify news so that they could be easily accessed. This paper presents an automated system that scraps news from various online sources and identifies disaster relevant news. The news articles are scraped with the help of a scrapy framework and a model is trained using Machine Learning algorithms to classify news as disaster and non-disaster. The system also uses a geoparsing model to identify the focus location from the extracted news articles. The geoparsing model is built using Named Entity Recognition (NER).
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
