Powered by OpenAIRE graph
Found an issue? Give us feedback
addClaim

This Research product is the result of merged Research products in OpenAIRE.

You have already added 0 works in your ORCID record related to the merged Research product.

Focused web crawler for Indonesian recipes

Authors: Gusti Ahmad Fanshuri Alfarisy; Fitra A. Bachtiar;

Focused web crawler for Indonesian recipes

Abstract

Crawlers are commonly used to traverse and collect all public webs that are connected through links. The general crawlers could not be used for crawling or collecting web pages with a particular topic such as food recipe. This paper, propose focused web crawler for Indonesian food recipes using simple classification based on the analysis of Indonesian recipes available on the internet, providing priority levels of a link through anchor text and URLs, and restricting the traverse by the depth. The focused crawler is tested on 4 different query to collect 100 recipes each. The results show that focused web crawler provide higher relevance of 81.75 % than general crawler that uses breath first with 16.00 % relevance. Furthermore, with the same amount of time, focused web crawler is able to collect more relevant web page than the general crawler. Therefore, the proposed crawler can collect recipes on the web based on user query effectively.

Related Organizations
  • BIP!
    Impact byBIP!
    citations
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    4
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
citations
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
4
Average
Average
Average
Upload OA version
Are you the author? Do you have the OA version of this publication?