EuReCo: Not Building and Yet Using Federated Comparable Corpora for Cross-Linguistic Research
This paper gives an overview of recent developments concerning the European Reference Corpus EuReCo, an open long-term initiative aimed at providing and using virtual and dynamically definable comparable corpora based on existing national, reference...
mehr
Volltext:
|
|
Zitierfähiger Link:
|
|
This paper gives an overview of recent developments concerning the European Reference Corpus EuReCo, an open long-term initiative aimed at providing and using virtual and dynamically definable comparable corpora based on existing national, reference or other large corpora. Given the problems and shortcomings of other types of multilingual corpora – such as the shining-through effects in parallel corpora or the limitation to web material only in web-based comparable corpora – EuReCo constitutes a unique linguistic resource that offers new perspectives for fine-grained cross-linguistic research. The approach advocated here puts forward new solutions to notorious IPR and licensing issues, as well as to challenges of interoperability. It also addresses methodological questions concerning comparability and representativeness. While the focus of this paper is on EuReCo’s implementation-based approach to ensuring interoperability in a feasible and maintainable way, it also presents preliminary results of pilot comparative studies on light verb constructions in German, Romanian, Hungarian, Polish and Bulgarian, and reports on recent extensions and plans.
|
Export in Literaturverwaltung |
|
Standards Information System for CLARIN centres and beyond
The present contribution describes features of the CLARIN Standards Information System that have been designed to assist data deposition centres in CLARIN. We also show what is needed and what has been done in order to go beyond the originally...
mehr
Volltext:
|
|
Zitierfähiger Link:
|
|
The present contribution describes features of the CLARIN Standards Information System that have been designed to assist data deposition centres in CLARIN. We also show what is needed and what has been done in order to go beyond the originally designated target, so as to provide service to sibling and descendant research infrastructures, of which DARIAH and Text+ are taken as examples. This paper is aimed primarily at representatives of research infrastructure nodes, responsible for preparing and sharing data deposition information about their centres or repositories. It assumes a degree of technical knowledge or experience in using the XML format and tools, the REST API, and version control systems.
|
Standards information system for CLARIN centres and beyond.
In the present contribution, we describe the features of the CLARIN SIS (Standards Information System) that have been designed to assist data-deposition centres in CLARIN. We also show what is needed to go beyond the originally designated target, in...
mehr
Volltext:
|
|
Zitierfähiger Link:
|
|
In the present contribution, we describe the features of the CLARIN SIS (Standards Information System) that have been designed to assist data-deposition centres in CLARIN. We also show what is needed to go beyond the originally designated target, in order to provide service to sibling and descendant research infrastructures, of which DARIAH and Text+ are taken as examples.
|
Export in Literaturverwaltung |
|