
Abstract The Lattes platform is the major scientific information system maintained by the National Council for Scientific and Technological Development (CNPq). This platform allows to manage the curricular information of researchers and institutions working in Brazil based on the so called Lattes Curriculum. However, the public information is individually available for each researcher, not providing the automatic creation of reports of several scientific productions for research groups. It is thus difficult to extract and to summarize useful knowledge for medium to large size groups of researchers. This paper describes the design, implementation and experiences with scriptLattes: an open-source system to create academic reports of groups based on curricula of the Lattes Database. The scriptLattes system is composed by the following modules: (a) data selection, (b) data preprocessing, (c) redundancy treatment, (d) collaboration graph generation among group members, (e) research map generation based on geographical information, and (f) automatic report creation of bibliographical, technical and artistic production, and academic supervisions. The system has been extensively tested for a large variety of research groups of Brazilian institutions, and the generated reports have shown an alternative to easily extract knowledge from data in the context of Lattes platform. The source code, usage instructions and examples are available at http://scriptlattes.sourceforge.net/.
FOS: Computer and information sciences, Data Quality Assessment and Improvement, Artificial intelligence, FOS: Political science, MEDLINE, knowledge discovery, Data extraction, Variety (cybernetics), Social Sciences, FOS: Law, Management Science and Operations Research, Knowledge Representation, Decision Sciences, Data science, Context (archaeology), Artificial Intelligence, Data Mining Techniques and Applications, Political science, Preprocessor, Geography, Open source, academic production report, Computer science, Programming language, World Wide Web, Archaeology, Computer Science, Physical Sciences, Lattes platform, Curriculum, Law, Semantic Web and Ontology Development, Software, Computer Science(all), Information Systems
FOS: Computer and information sciences, Data Quality Assessment and Improvement, Artificial intelligence, FOS: Political science, MEDLINE, knowledge discovery, Data extraction, Variety (cybernetics), Social Sciences, FOS: Law, Management Science and Operations Research, Knowledge Representation, Decision Sciences, Data science, Context (archaeology), Artificial Intelligence, Data Mining Techniques and Applications, Political science, Preprocessor, Geography, Open source, academic production report, Computer science, Programming language, World Wide Web, Archaeology, Computer Science, Physical Sciences, Lattes platform, Curriculum, Law, Semantic Web and Ontology Development, Software, Computer Science(all), Information Systems
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 65 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
