Comparing classification methods for link context based focused crawlers

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Nov 2013 Turkey Publisher:IEEEJournal:2013 International Conference on Electronics, Computer and Computation (ICECCO)

Authors: Caliskan, Kamil; Ozcan, Rifat;

doi: 10.1109/icecco.2013.6718249

handle: 20.500.12899/4121 , 20.500.12899/3135

Comparing classification methods for link context based focused crawlers

- Summary
- Subjects
- Metrics

Abstract

Focused crawlers aim to fetch pages only related to a specific subject area from millions of web pages on the Internet. The essential task in a focused crawler is to predict whether a page is related to the target subject area or not without actually fetching the page content itself. Link context based focused crawlers focus on the surrounding text around each link to classify the page pointed by the URL. In this paper, we aim to compare three different classification methods (naive bayes, decision tree, and support vector machines) for the task of link context based focused crawling.

Country

Turkey

Related Organizations

Malatya Turgut Özal Üniversitesi
Turkey
Turgut Özal University
Turkey

Keywords

focused crawling; classification; link context, classification, link context, focused crawling

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now