
Research information, i.e., data about research projects, organisations, researchers or research outputs such as publications or patents, is spread across the web, usually residing in institutional and personal web pages or in semi-open databases and information systems. While there exists a wealth of unstructured information, structured data is limited and often exposed following proprietary or less-established schemas and interfaces. Therefore, a holistic and consistent view on research information across organisational and national boundaries is not feasible. On the other hand, web crawling and information extraction techniques have matured throughout the last decade, allowing for automated approaches of harvesting, extracting and consolidating research information into a more coherent knowledge graph. In this work, we give an overview of the current state of the art in research information sharing on the web and present initial ideas towards a more holistic approach for boot-strapping research information from available web sources.
Information extraction, Linked datum, Information extraction techniques, Automated approach, Knowledge graphs, Information retrieval, Information systems, ddc:020, information extraction, web crawling, Data mining, Konferenzschrift, Holistic approach, Dewey Decimal Classification::000 | Allgemeines, Wissenschaft::000 | Informatik, Wissen, Systeme::000 | Informatik, Informationswissenschaft, allgemeine Werke, Information sharing, Information analysis, Information retrieval systems, Web Crawling, Linked data, Web crawling, Social networking (online), Dewey Decimal Classification::000 | Allgemeines, Wissenschaft::020 | Bibliotheks- und Informationswissenschaft, linked data, Websites, Research information, 004, Research outputs, World Wide Web, Sounding apparatus
Information extraction, Linked datum, Information extraction techniques, Automated approach, Knowledge graphs, Information retrieval, Information systems, ddc:020, information extraction, web crawling, Data mining, Konferenzschrift, Holistic approach, Dewey Decimal Classification::000 | Allgemeines, Wissenschaft::000 | Informatik, Wissen, Systeme::000 | Informatik, Informationswissenschaft, allgemeine Werke, Information sharing, Information analysis, Information retrieval systems, Web Crawling, Linked data, Web crawling, Social networking (online), Dewey Decimal Classification::000 | Allgemeines, Wissenschaft::020 | Bibliotheks- und Informationswissenschaft, linked data, Websites, Research information, 004, Research outputs, World Wide Web, Sounding apparatus
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
