Advanced search in
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
85 Research products, page 1 of 9

  • Publications
  • 2018-2022
  • Open Access
  • English
  • DARIAH EU

10
arrow_drop_down
Relevance
arrow_drop_down
  • Open Access English
    Authors: 
    Bowers, Jack; Herold, Axel; Romary, Laurent; Tasovac, Toma;
    Publisher: HAL CCSD
    Country: France

    The present paper describes the etymological component of the TEI Lex-0 initiative which aims at defining a terser subset of the TEI guidelines for the representation of etymological features in dictionary entries. Going beyond the basic provision of etymological mechanisms in the TEI guidelines, TEI Lex-0 Etym proposes a systematic representation of etymological and cognate descriptions by means of embedded constructs based on the (for etymologies) and (for etymons and cognates) elements. In particular, given that all the potential contents of etymons are highly analogous to those of dictionary entries in general, the contents presented herein heavily re-use many of the corresponding features and constraints introduced in other components of the TEI Lex-0 to the encoding of etymologies and etymons. The TEI Lex-0 Etym model is also closely aligned to ISO 24613-3 on modelling etymological data and the corresponding TEI serialisation available in ISO 24613-4.

  • Publication . Other literature type . Book . Part of book or chapter of book . 2020
    Open Access English
    Authors: 
    Edmond, Jennifer; Romary, Laurent;
    Publisher: HAL CCSD
    Country: France

    Introduction The scholarly monograph has been compared to the Hapsburg monarchy in that it seems to have been in decline forever! It was in 2002 that Stephen Greenblatt, in his role as president of the US Modern Language Association, urged his membership to recognise what he called a ‘crisis in scholarly publication’. It is easy to forget now that this crisis, as he then saw it, had nothing to do with the rise of digital technologies, e-publishing, or open access. Indeed, it puts his words in...

  • Open Access English
    Authors: 
    van Nispen, Annelies;
    Publisher: HAL CCSD
    Country: Netherlands

    The European Holocaust Research Infrastructure (EHRI) started in October 2010 to build on a network that connects both people (Holocaust researchers, archivists, curators, librarians and digital humanists) and dispersed Holocaust source material and collections. EHRI’s aim is making sources visible in a systematic way in order to counteract the fragmentation of the sources and to reveal interconnections. EHRI focuses on Archive and collection descriptions, which are available through the EHRI Portal. EHRI is currently in its second phase and is on the ESFRI Roadmap2 for a more sustainable future. EHRI has developed a set of controlled vocabularies that serves both as a retrieval and cataloguing tool for the multilingual and highly heterogeneous data of the EHRI portal. These vocabularies were partly implemented in the first phase of the project. In the current phase of EHRI the vocabularies are in the process of quality improvement improve and enrich the existing terms, add new terms, disambiguate and remove the mistakes (deduplication, merging, adding multilingual labels, consistency checks, multiple parent relations, etc.) and increase their coverage. In the EHRI portal the subject terms are currently not available for the public, as they are used only for retrieval purposes.

  • Publication . Article . Other literature type . Conference object . 2020
    Open Access English
    Authors: 
    Stefan Bornhofen; Marten Düring;
    Publisher: HAL CCSD
    Country: France
    Project: ANR | BLIZAAR (ANR-15-CE23-0002)

    AbstractThe paper presents Intergraph, a graph-based visual analytics technical demonstrator for the exploration and study of content in historical document collections. The designed prototype is motivated by a practical use case on a corpus of circa 15.000 digitized resources about European integration since 1945. The corpus allowed generating a dynamic multilayer network which represents different kinds of named entities appearing and co-appearing in the collections. To our knowledge, Intergraph is one of the first interactive tools to visualize dynamic multilayer graphs for collections of digitized historical sources. Graph visualization and interaction methods have been designed based on user requirements for content exploration by non-technical users without a strong background in network science, and to compensate for common flaws with the annotation of named entities. Users work with self-selected subsets of the overall data by interacting with a scene of small graphs which can be added, altered and compared. This allows an interest-driven navigation in the corpus and the discovery of the interconnections of its entities across time.

  • Open Access English
    Authors: 
    Bernard, Loup;
    Publisher: HAL CCSD
    Country: France

    International audience; After more than a decade online, the ArkeoGIS project illustrates the benefits of data sharing. Thanks to free software bricks, and with the precious help of the CNRS’s Huma-Num infrastructure, this spreadsheet sharing platform has shown its efficiency. Users can freely select their language, chronology and the data they wish to share. With over 100 database extracts from professionals, research grants and advanced students, the tool now offers more than 100,000 spatialized data units about the past - in the Upper Rhine valley and also worldwide depending on users’ needs. In this contribution, good practices, hindrances and accelerators of data sharing among archaeologists and (paleo-) environmentalists on the ArkeoGIS platform will be discussed, with the hope of generating more sharing in the digital humanities.

  • Publication . Article . Preprint . 2020
    Open Access English
    Authors: 
    Del Gratta, Riccardo;

    In this article, we propose a Category Theory approach to (syntactic) interoperability between linguistic tools. The resulting category consists of textual documents, including any linguistic annotations, NLP tools that analyze texts and add additional linguistic information, and format converters. Format converters are necessary to make the tools both able to read and to produce different output formats, which is the key to interoperability. The idea behind this document is the parallelism between the concepts of composition and associativity in Category Theory with the NLP pipelines. We show how pipelines of linguistic tools can be modeled into the conceptual framework of Category Theory and we successfully apply this method to two real-life examples. Paper submitted to Applied Category Theory 2020 and accepted for Virtual Poster Session

  • Open Access English
    Authors: 
    Kolar, Jana; Cugmas, Marjan; Ferligoj, Anu��ka;
    Project: EC | ACCELERATE (731112)

    In 2018, the European Strategic Forum for research infrastructures (ESFRI) was tasked by the Competitiveness Council, a configuration of the Council of the EU, to develop a common approach for monitoring of Research Infrastructures' performance. To this end, ESFRI established a working group, which has proposed 21 Key Performance Indicators (KPIs) to monitor the progress of the Research Infrastructures (RIs) addressed towards their objectives. The RIs were then asked to assess their relevance for their institution. The paper aims to identify the relevance of certain indicators for particular groups of RIs by using cluster and discriminant analysis. This could contribute to development of a monitoring system, tailored to particular RIs. To obtain a typology of the RIs, we first performed cluster analysis of the RIs according to their properties, which revealed clusters of RIs with similar characteristics, based on to the domain of operation, such as food, environment or engineering. Then, discriminant analysis was used to study how the relevance of the KPIs differs among the obtained clusters. This analysis revealed that the percentage of RIs correctly classified into five clusters, using the KPIs, is 80%. Such a high percentage indicates that there are significant differences in the relevance of certain indicators, depending on the ESFRI domain of the RI. The indicators therefore need to be adapted to the type of infrastructure. It is therefore proposed that the Strategic Working Groups of ESFRI addressing specific domains should be involved in the tailored development of the monitoring of pan-European RIs. 15 pages, 8 tables, 3 figures

  • Publication . Part of book or chapter of book . 2019
    Open Access English
    Authors: 
    Gelati, Francesco;
    Publisher: Zenodo
    Project: EC | EHRI (654164)

    The European Holocaust Research Infrastructure (EHRI) portal website aims to aggregate digitally available archival descriptions concerning the Holocaust. This portal is actually a meta-catalogue, or an information aggregator, whose biggest goal is to have up-to-date information by means of building sustainable data pipelines between EHRI and its content providers. Just like in similar archival information aggregators (e.g. Archives Portal Europe or Monasterium), the XML-based metadata standard Encoded Archival Description (EAD) plays a key role. The article presents how EADs are imported into the portal, mainly thanks to the Open Archive Initiative protocols.

  • Open Access English
    Authors: 
    Van Der Eycken, Johan; Styven, Dorien; Gheldof, Tom; Depoortere, Rolande;
    Publisher: HAL CCSD
    Countries: Belgium, France

    This article shows that metadata plays a central role in our society and concludes that through collaborative work, it is possible to pool solutions and to establish relationships of cooperation, both at the level of practical tool development and with regard to sharing and creating knowledge and know-how. ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:135-144 status: published

  • Open Access English
    Authors: 
    Soyez, Sébastien;
    Publisher: HAL CCSD

    International audience

Advanced search in
Research products
arrow_drop_down
Searching FieldsTerms
Any field
arrow_drop_down
includes
arrow_drop_down
Include:
85 Research products, page 1 of 9
  • Open Access English
    Authors: 
    Bowers, Jack; Herold, Axel; Romary, Laurent; Tasovac, Toma;
    Publisher: HAL CCSD
    Country: France

    The present paper describes the etymological component of the TEI Lex-0 initiative which aims at defining a terser subset of the TEI guidelines for the representation of etymological features in dictionary entries. Going beyond the basic provision of etymological mechanisms in the TEI guidelines, TEI Lex-0 Etym proposes a systematic representation of etymological and cognate descriptions by means of embedded constructs based on the (for etymologies) and (for etymons and cognates) elements. In particular, given that all the potential contents of etymons are highly analogous to those of dictionary entries in general, the contents presented herein heavily re-use many of the corresponding features and constraints introduced in other components of the TEI Lex-0 to the encoding of etymologies and etymons. The TEI Lex-0 Etym model is also closely aligned to ISO 24613-3 on modelling etymological data and the corresponding TEI serialisation available in ISO 24613-4.

  • Publication . Other literature type . Book . Part of book or chapter of book . 2020
    Open Access English
    Authors: 
    Edmond, Jennifer; Romary, Laurent;
    Publisher: HAL CCSD
    Country: France

    Introduction The scholarly monograph has been compared to the Hapsburg monarchy in that it seems to have been in decline forever! It was in 2002 that Stephen Greenblatt, in his role as president of the US Modern Language Association, urged his membership to recognise what he called a ‘crisis in scholarly publication’. It is easy to forget now that this crisis, as he then saw it, had nothing to do with the rise of digital technologies, e-publishing, or open access. Indeed, it puts his words in...

  • Open Access English
    Authors: 
    van Nispen, Annelies;
    Publisher: HAL CCSD
    Country: Netherlands

    The European Holocaust Research Infrastructure (EHRI) started in October 2010 to build on a network that connects both people (Holocaust researchers, archivists, curators, librarians and digital humanists) and dispersed Holocaust source material and collections. EHRI’s aim is making sources visible in a systematic way in order to counteract the fragmentation of the sources and to reveal interconnections. EHRI focuses on Archive and collection descriptions, which are available through the EHRI Portal. EHRI is currently in its second phase and is on the ESFRI Roadmap2 for a more sustainable future. EHRI has developed a set of controlled vocabularies that serves both as a retrieval and cataloguing tool for the multilingual and highly heterogeneous data of the EHRI portal. These vocabularies were partly implemented in the first phase of the project. In the current phase of EHRI the vocabularies are in the process of quality improvement improve and enrich the existing terms, add new terms, disambiguate and remove the mistakes (deduplication, merging, adding multilingual labels, consistency checks, multiple parent relations, etc.) and increase their coverage. In the EHRI portal the subject terms are currently not available for the public, as they are used only for retrieval purposes.

  • Publication . Article . Other literature type . Conference object . 2020
    Open Access English
    Authors: 
    Stefan Bornhofen; Marten Düring;
    Publisher: HAL CCSD
    Country: France
    Project: ANR | BLIZAAR (ANR-15-CE23-0002)

    AbstractThe paper presents Intergraph, a graph-based visual analytics technical demonstrator for the exploration and study of content in historical document collections. The designed prototype is motivated by a practical use case on a corpus of circa 15.000 digitized resources about European integration since 1945. The corpus allowed generating a dynamic multilayer network which represents different kinds of named entities appearing and co-appearing in the collections. To our knowledge, Intergraph is one of the first interactive tools to visualize dynamic multilayer graphs for collections of digitized historical sources. Graph visualization and interaction methods have been designed based on user requirements for content exploration by non-technical users without a strong background in network science, and to compensate for common flaws with the annotation of named entities. Users work with self-selected subsets of the overall data by interacting with a scene of small graphs which can be added, altered and compared. This allows an interest-driven navigation in the corpus and the discovery of the interconnections of its entities across time.

  • Open Access English
    Authors: 
    Bernard, Loup;
    Publisher: HAL CCSD
    Country: France

    International audience; After more than a decade online, the ArkeoGIS project illustrates the benefits of data sharing. Thanks to free software bricks, and with the precious help of the CNRS’s Huma-Num infrastructure, this spreadsheet sharing platform has shown its efficiency. Users can freely select their language, chronology and the data they wish to share. With over 100 database extracts from professionals, research grants and advanced students, the tool now offers more than 100,000 spatialized data units about the past - in the Upper Rhine valley and also worldwide depending on users’ needs. In this contribution, good practices, hindrances and accelerators of data sharing among archaeologists and (paleo-) environmentalists on the ArkeoGIS platform will be discussed, with the hope of generating more sharing in the digital humanities.

  • Publication . Article . Preprint . 2020
    Open Access English
    Authors: 
    Del Gratta, Riccardo;

    In this article, we propose a Category Theory approach to (syntactic) interoperability between linguistic tools. The resulting category consists of textual documents, including any linguistic annotations, NLP tools that analyze texts and add additional linguistic information, and format converters. Format converters are necessary to make the tools both able to read and to produce different output formats, which is the key to interoperability. The idea behind this document is the parallelism between the concepts of composition and associativity in Category Theory with the NLP pipelines. We show how pipelines of linguistic tools can be modeled into the conceptual framework of Category Theory and we successfully apply this method to two real-life examples. Paper submitted to Applied Category Theory 2020 and accepted for Virtual Poster Session

  • Open Access English
    Authors: 
    Kolar, Jana; Cugmas, Marjan; Ferligoj, Anu��ka;
    Project: EC | ACCELERATE (731112)

    In 2018, the European Strategic Forum for research infrastructures (ESFRI) was tasked by the Competitiveness Council, a configuration of the Council of the EU, to develop a common approach for monitoring of Research Infrastructures' performance. To this end, ESFRI established a working group, which has proposed 21 Key Performance Indicators (KPIs) to monitor the progress of the Research Infrastructures (RIs) addressed towards their objectives. The RIs were then asked to assess their relevance for their institution. The paper aims to identify the relevance of certain indicators for particular groups of RIs by using cluster and discriminant analysis. This could contribute to development of a monitoring system, tailored to particular RIs. To obtain a typology of the RIs, we first performed cluster analysis of the RIs according to their properties, which revealed clusters of RIs with similar characteristics, based on to the domain of operation, such as food, environment or engineering. Then, discriminant analysis was used to study how the relevance of the KPIs differs among the obtained clusters. This analysis revealed that the percentage of RIs correctly classified into five clusters, using the KPIs, is 80%. Such a high percentage indicates that there are significant differences in the relevance of certain indicators, depending on the ESFRI domain of the RI. The indicators therefore need to be adapted to the type of infrastructure. It is therefore proposed that the Strategic Working Groups of ESFRI addressing specific domains should be involved in the tailored development of the monitoring of pan-European RIs. 15 pages, 8 tables, 3 figures

  • Publication . Part of book or chapter of book . 2019
    Open Access English
    Authors: 
    Gelati, Francesco;
    Publisher: Zenodo
    Project: EC | EHRI (654164)

    The European Holocaust Research Infrastructure (EHRI) portal website aims to aggregate digitally available archival descriptions concerning the Holocaust. This portal is actually a meta-catalogue, or an information aggregator, whose biggest goal is to have up-to-date information by means of building sustainable data pipelines between EHRI and its content providers. Just like in similar archival information aggregators (e.g. Archives Portal Europe or Monasterium), the XML-based metadata standard Encoded Archival Description (EAD) plays a key role. The article presents how EADs are imported into the portal, mainly thanks to the Open Archive Initiative protocols.

  • Open Access English
    Authors: 
    Van Der Eycken, Johan; Styven, Dorien; Gheldof, Tom; Depoortere, Rolande;
    Publisher: HAL CCSD
    Countries: Belgium, France

    This article shows that metadata plays a central role in our society and concludes that through collaborative work, it is possible to pool solutions and to establish relationships of cooperation, both at the level of practical tool development and with regard to sharing and creating knowledge and know-how. ispartof: ABB: Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België vol:106 pages:135-144 status: published

  • Open Access English
    Authors: 
    Soyez, Sébastien;
    Publisher: HAL CCSD

    International audience

Send a message
How can we help?
We usually respond in a few hours.