
handle: 10722/47074
Vertical search engines provide Web users with an alternative way to search for information on the Web by providing customized searching in particular domains. However, two issues need to be addressed when developing these search engines: how to locate relevant documents on the Web and how to filter out irrelevant documents from a set of documents collected from the Web. This paper reports the research in addressing the second issue. In this research a machine learning-based approach that combines Web content analysis and Web structure analysis is proposed.
Web page filtering, Support vector machines, Vertical search engines, Machine learning, Web analysis, Information retrieval, Web page classification, Neural networks
Web page filtering, Support vector machines, Vertical search engines, Machine learning, Web analysis, Information retrieval, Web page classification, Neural networks
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
