<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
This document provides a detailed overview of the second deliverable for D1.2 "The OpenWebSearch Crawler and the Crawling Frontier", a deliverable of the OpenWebSearch.eu initiative, which is supported by the European Commission (EC) under the Horizon Europe Framework Programme grant agreement number GA 101070014. It outlines the achievements in developing and launching the Open Web Crawler (OWLer) and its related software components. These are reviewed in the context of their initial proposal stage as a Proof-of-Concept, whose main motivation is to investigate the feasibility of a distributed, heterogeneous and yet scalable crawling system. Our work sheds a light on how a future software system – backed up with more power in engineering and infrastructure – may look like in order to support the creation process of the Open Web Index with the continuous collection of relevant web documents.
The deliverable expresses the opinion of the authors and has not yet been approved by the EC.
Open Web Crawler, Centralized URL Management Tier, Distributed Crawling Tier
Open Web Crawler, Centralized URL Management Tier, Distributed Crawling Tier
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |