Filtern nach
Letzte Suchanfragen

Ergebnisse für *

Es wurden 12 Ergebnisse gefunden.

Zeige Ergebnisse 1 bis 12 von 12.

Sortieren

  1. Text+ – Concept and benefits for empirical researchers
    Erschienen: 2025
    Verlag:  Sofia : Bulgarian Academy of Sciences ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

    In this contribution, we report on ongoing efforts in the German national research infrastructure consortium Text+ to make research data and services for text- and language-oriented disciplines FAIR, that is findable, accessible, interoperable, and... mehr

     

    In this contribution, we report on ongoing efforts in the German national research infrastructure consortium Text+ to make research data and services for text- and language-oriented disciplines FAIR, that is findable, accessible, interoperable, and reusable, as well as compliant with the CARE principles for language resources.

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Deutsch
    Medientyp: Aufsatz aus einer Zeitschrift
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Textanalyse; Nationale Forschungsdateninfrastruktur (NFDI)
    Lizenz:

    creativecommons.org/licenses/by-nc-nd/4.0/ ; info:eu-repo/semantics/openAccess

  2. Report on the first NFDI Metadata Workshop. Version v1
    Erschienen: 2025
    Verlag:  Genf : Zenodo ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

    Metadata is a topic of significant interest across all consortia, serving as a crucial link between them. When properly addressed, it enables consortia to share their specific needs and experiences, fostering collaboration and knowledge exchange.... mehr

     

    Metadata is a topic of significant interest across all consortia, serving as a crucial link between them. When properly addressed, it enables consortia to share their specific needs and experiences, fostering collaboration and knowledge exchange. This report summarizes the outcomes of the first NFDI Metadata Workshop, which took place on January 14-15, 2025, in Dresden, and was organized by the Taskforce Metadata. The workshop marked the beginning of a series of NFDI-wide metadata discussions aimed at developing joint recommendations for metadata schemas for datasets and the use of re3data as a central registry for repositories.

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Aufsatz aus einer Zeitschrift
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Metadaten; Konsortium; Elektronischer Datenaustausch; Standard
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

  3. Acquiring lexical information from multilevel temporal annotations
    Erschienen: 2025
    Verlag:  Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

    The extraction of lexical information for machine readable lexica from multilevel annotations is addressed in this paper. Relations between these levels of annotation are used for subclassification of lexical entries. A method for relating annotation... mehr

     

    The extraction of lexical information for machine readable lexica from multilevel annotations is addressed in this paper. Relations between these levels of annotation are used for subclassification of lexical entries. A method for relating annotation units is presented, based on a temporal calculus. Relating the annotation units manually is errorprone, time consuming and tends to be inconsistent, and a method is presented for automatically accomplishing this task, and evaluated using German, Japanese and Anyi (W. Africa) corpora.

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Konferenzveröffentlichung
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Annotation; Korpus; Texttechnologie; Deutsch; Japanisch; Anyi-Sprache
    Lizenz:

    rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

  4. A computational model of arm gestures in conversation
    Erschienen: 2025
    Verlag:  Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

    Currently no standardised gesture annotation systems are available. As a contribution towards solving this problem, CoGesT, a machine processable and human usable computational model for the annotation of a subset of conversational gestures is... mehr

     

    Currently no standardised gesture annotation systems are available. As a contribution towards solving this problem, CoGesT, a machine processable and human usable computational model for the annotation of a subset of conversational gestures is presented, its empirical and formal properties are detailed, and application areas are discussed.

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Deutsch
    Medientyp: Konferenzveröffentlichung
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Texttechnologie
    Lizenz:

    rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

  5. Concordancing for parallel spoken language corpora
    Erschienen: 2025
    Verlag:  Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

    Concordancing is one of the oldest corpus analysis tools, especially for written corpora. In NLP concordancing appears intraining of speech-recognition system. Additionally, comparative studies of different languages result in parallel corpora.... mehr

     

    Concordancing is one of the oldest corpus analysis tools, especially for written corpora. In NLP concordancing appears intraining of speech-recognition system. Additionally, comparative studies of different languages result in parallel corpora. Concordancing for these corpora in a NLP context is a new approach. We propose to combine these fields of interest for a multi-purpose concordance for Spoken Language Data, opening the opportunity of combining corpus-linguistic and NLP methods resulting in a broader empirical basis for NLP research. Theoretic models for audio-concordances are discussed. Principles of the structure and design of a parallel audio concordance are given, coding by means of XML to ensure reusability and flexibility, using time stamps for referencing from annotations to the signal.

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Deutsch
    Medientyp: Konferenzveröffentlichung
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Neurolinguistisches Programmieren
    Lizenz:

    rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

  6. Terminology principles and support for spoken language system development
    Erschienen: 2025
    Verlag:  Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

    Spoken language (SL) system development is an increasingly interdisciplinary effort. Speech-to-speech system development, for example, involves speech engineers, software engineers, phoneticians, and a variety of computational linguistic... mehr

     

    Spoken language (SL) system development is an increasingly interdisciplinary effort. Speech-to-speech system development, for example, involves speech engineers, software engineers, phoneticians, and a variety of computational linguistic subdisciplines from morphology, syntax and lexicology through semantics and pragmatics, each with their own historically motivated terminology. In our experience this ‘terminology barrier’ makes communication between the disciplines unnecessarily difficult. As a contribution to reducing the terminology barrier we propose a set of new speech specific terminological principles and a prototype term bank with an Internet interface for this specific purpose. The result is one of the outputs of the spoken language working group of the LE EAGLES Phase II project (LE3-4244 10484/0).

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Deutsch
    Medientyp: Konferenzveröffentlichung
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Texttechnologie; Gesprochene Sprache
    Lizenz:

    rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

  7. Unlocking the corpus. Enriching metadata with state-of-the-art NLP methodology and linked data
    Erschienen: 2025
    Verlag:  Linköping : Linköping University Electronic Press ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

    In research data management, metadata are indispensable to describing data and are a key element in preparing data according to the FAIR principles. Metadata in catalogues and registries are usually recorded either by archivists or subject matter... mehr

     

    In research data management, metadata are indispensable to describing data and are a key element in preparing data according to the FAIR principles. Metadata in catalogues and registries are usually recorded either by archivists or subject matter experts, i.e. researchers involved in the creation or assembling of the data, or provided in the data preparation workflow. Extracting metadata from textual research data is currently not part of most metadata workflows, even more so if a research data set can be subdivided into smaller parts, such as a newspaper corpus containing multiple newspaper articles. If we look at descriptive metadata from a large corpus of newspapers, the basic metadata may consist of information, for example, about the title, or year of publication. Our approach is to add semantic metadata on the text level to facilitate the search over data. We show how to enrich metadata with three methods: named entity recognition, keyword extraction, and topic modeling. The goal is to make it possible to search for texts that are about certain topics or described using certain keywords or to identify people, places, and organisations mentioned in texts without actually having to read them.

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Aufsatz aus einem Sammelband
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Korpus; Metadaten; Automatische Sprachanalyse; Datenmanagement; Named-Entity-Recognition; Computerlinguistik
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

  8. Collaborative standardization of key performance indicator assessment within NFDI using NocoDB

    Germany's National Research Data Infrastructure (NFDI, Nationale Forschungsdateninfrastruktur) comprises 26 consortia, each providing data management services for a specific domain or methodology. Consortia operate as cooperative projects funded by... mehr

     

    Germany's National Research Data Infrastructure (NFDI, Nationale Forschungsdateninfrastruktur) comprises 26 consortia, each providing data management services for a specific domain or methodology. Consortia operate as cooperative projects funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation). Additionally, the consortia are reflected as departments of the NFDI association. A central aim of NFDI is to cooperatively build a domainand method-spanning research data infrastructure embodied by the vision of OneNFDI, i.e., the desired convergence of multiple infrastructures for RDM across domains into a comprehensive, flexible network of interconnected services. This vision drives extensive cross-consortial collaboration facilitating data to become a common good for excellent research. Such collaborations include sections of the NFDI association addressing overarching topics ; the cross-consortia project Base4NFDI ; and task forces on governance, sustainability, evaluation and reporting, and the implementation of technical tools. NFDI as a whole needs to assess its impact as required by the Scientific Senate of NFDI and the Federal Ministry of Education and Research (BMBF), and each consortium must monitor progress and performance in serving their scientific communities. Therefore, a robust monitoring and reporting strategy is needed, built on overarching KPIs collected with a standardized, yet sufficiently flexible database facilitating both internal evaluation and external reporting. In 2023, the Task Force Evaluation and Reporting (TFER) compiled and devised criteria for quantitative and—importantly—qualitative performance indicators, introducing distinct definitions and parameters for reporting on each consortium's progress. These served as a reference for the DFG's design of a data sheet template with over 40 indicators, now compulsory in reports and follow-up proposals submitted to the DFG. Several indicators pose socio-technical challenges to ensuring accurate and meaningful assessment. ...

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Aufsatz aus einem Sammelband
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Berichterstattung; Evaluation
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

  9. Repository Engagement in the German NFDI (REGEN). Towards a harmonized collection of metadata on repositories with re3data
    Erschienen: 2025
    Verlag:  Genf : Zenodo ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

    The Registry of Research Data Repositories re3data.org is a major directory [1] of research data repositories across all disciplines worldwide and makes metadata on repositories for the permanent storage and access of data sets openly available... mehr

     

    The Registry of Research Data Repositories re3data.org is a major directory [1] of research data repositories across all disciplines worldwide and makes metadata on repositories for the permanent storage and access of data sets openly available according to re3data's Metadata Schema Version 4.0. It is widely used by researchers to identify repositories to find and deposit data. Funders and publishers recommend re3data to guide investigators and authors to reposi-tories to meet their requirements for sharing the data that support grant-funded research and publications. [2], [3] Librarians and lecturers use re3data to promote data literacy. [4], [5], [6] Other service providers such as DataCite and OpenAIRE integrate with the registry to feed information about data repositories into scholarly workflows. As such, it plays a key role in fos-tering data accessibility, preservation, interoperability, and reuse. The registry acts as a refer-ence point for repository metadata, providing a comprehensive overview of the landscape. [7], [8], [9] Similarly, it allows for analysing and monitoring developments within the repository landscape. Curating the repository information is therefore crucial and contributes to high standards in data management, follows the FAIR principles and promotes a culture of open science. Due to the broad uptake by the community and participation of institutions across the German research landscape, a collaboration of the NFDI with re3data offers a unique opportunity to comprehensively map repository information that is important for researchers. The high-quality, comprehensive, and up-to-date records will inform decisions on where best to store research data, depending on the needs and requirements. Additionally, it improves the exposure of re-search data infrastructures of the NFDI, including the data generated, to the international re-search community. To organize this community effort to curate the repository data, the Taskforce Metadata from the Section (Meta)data, Termino¬¬lo¬gies, ...

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Aufsatz aus einem Sammelband
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Dokumentenserver; Konfigurationsdatenbank; Forschungsdaten; Metadaten
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

  10. Report on the first NFDI Metadata Workshop
    Erschienen: 2025
    Verlag:  Genf : Zenodo ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

    Metadata is a topic of significant interest across all consortia, serving as a crucial link between them. When properly addressed, it enables consortia to share their specific needs and experiences, fostering collaboration and knowledge exchange.... mehr

     

    Metadata is a topic of significant interest across all consortia, serving as a crucial link between them. When properly addressed, it enables consortia to share their specific needs and experiences, fostering collaboration and knowledge exchange. This report summarizes the outcomes of the first NFDI Metadata Workshop, which took place on January 14-15, 2025, in Dresden, and was organized by the Taskforce Metadata. The workshop marked the beginning of a series of NFDI-wide metadata discussions aimed at developing joint recommendations for metadata schemas for datasets and the use of re3data as a central registry for repositories.

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Bericht
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Nationale Forschungsdateninfrastruktur (NFDI) e.V; Metadaten; Bericht; Repository; DataCite; Konsortium
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

  11. Federated content search for Lexical Resources (LexFCS): Specification

    The landscape of digital lexical resources is often characterized by dedicated local portals and proprietary interfaces as primary access points for scholars and the interested public. In addition, legal and technical restrictions are potential... mehr

     

    The landscape of digital lexical resources is often characterized by dedicated local portals and proprietary interfaces as primary access points for scholars and the interested public. In addition, legal and technical restrictions are potential issues that can make it difficult to efficiently query and use these valuable resources. As part of the research data consortium Text+, solutions for the storage and provision of digital language resources are being developed and provided in the context of the unified cross-domain German research data infrastructure NFDI. The specific topic of accessing lexical resources in a diverse and heterogenous landscape with a variety of participating institutions and established technical solutions is met with the development of the federated search and query framework LexFCS. The LexFCS extends the established CLARIN Federated Content Search that already allows accessing spatially distributed text corpora using a common specification of technical interfaces, data formats, and query languages. This paper describes the current state of development of the LexFCS, gives an insight into its technical details, and provides an outlook on its future development.

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Bericht
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Information Retrieval; Forschungsdaten; Infrastruktur; Korpus; Lexikalische Analyse
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

  12. Nationale Forschungsdateninfrastruktur (NFDI). Collaborative work in NFDI. Dataset Documentation

    The non-profit association National Research Data Infrastructure (NFDI) promotes science and research through a National Research Data Infrastructure. Its aim is to develop and establish an overarching research data management (RDM) for Germany and... mehr

     

    The non-profit association National Research Data Infrastructure (NFDI) promotes science and research through a National Research Data Infrastructure. Its aim is to develop and establish an overarching research data management (RDM) for Germany and to increase the efficiency of the entire German science system. After a two-and-a-half year build up phase, the process of adding new consortia, each representing a different data domain, has ended in March 2023. NFDI consists of 26 disciplinary consortia and one additional basic service collaboration (Base4NFDI). The attached table of jointly documented cross-consortial collaborations is based on a White Paper ratified by the NFDI association’s consortia assembly in January 2023. It defines collaborations as “the exchange of information on or development of common approaches to managing the research data of at least one domain.” From the perspective of the consortia assembly, “A necessary condition for any collaboration is that activities are on behalf and in line with the strategic aims of a consortium and are not activities by individuals within them only.” The tabular overview of the collaborations was created as a collaborative work in which all consortia had the opportunity to enter joint activities. Nevertheless, this document does not claim to be complete. Instead, it is constantly updated to reflect the evolving state of NFDI. A first version of the document was published in 2023, a second in 2024. This is the third version and future collaboration will be published in updated versions.

     

    Export in Literaturverwaltung
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Bericht
    Format: Online
    DDC Klassifikation: Bibliotheks- und Informationswissenschaften (020)
    Schlagworte: Datenmanagement; Forschungsdaten; Informationsmodellierung; Infrastruktur; Standardisierung; Nationale Forschungsdateninfrastruktur (NFDI) e.V
    Lizenz:

    creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess