Ergebnisse für *

Es wurden 3 Ergebnisse gefunden.

Zeige Ergebnisse 1 bis 3 von 3.

Sortieren

  1. ZuMult: Neue Zugangswege zu Korpora gesprochener Sprache
  2. „Korpora in der germanistischen Sprachwissenschaft – mündlich, schriftlich, multimedial“; Bericht von der 58. Jahrestagung des Leibniz-Instituts für Deutsche Sprache (als Online-Konferenz), 15. - 17. März 2022
    Erschienen: 2022
    Verlag:  Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

  3. Querying Interaction Structure: Approaches to Overlap in Spoken Language Corpora
    Erschienen: 2022
    Verlag:  Paris : European Language Resources Association (ELRA) ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

    In this paper, we address two problems in indexing and querying spoken language corpora with overlapping speaker contributions. First, we look into how token distance and token precedence can be measured when multiple primary data streams are... mehr

     

    In this paper, we address two problems in indexing and querying spoken language corpora with overlapping speaker contributions. First, we look into how token distance and token precedence can be measured when multiple primary data streams are available and when transcriptions happen to be tokenized, but are not synchronized with the sound at the level of individual tokens. We propose and experiment with a speaker based search mode that enables any speaker’s transcription tier to be the basic tokenization layer whereby the contributions of other speakers are mapped to this given tier. Secondly, we address two distinct methods of how speaker overlaps can be captured in the TEI based ISO Standard for Spoken Language Transcriptions (ISO 24624:2016) and how they can be queried by MTAS – an open source Lucene-based search engine for querying text with multilevel annotations. We illustrate the problems, introduce possible solutions and discuss their benefits and drawbacks.

     

    Export in Literaturverwaltung   RIS-Format
      BibTeX-Format
    Quelle: BASE Fachausschnitt Germanistik
    Sprache: Englisch
    Medientyp: Aufsatz aus einem Sammelband
    Format: Online
    DDC Klassifikation: Sprache (400)
    Schlagworte: Deutsch; Korpus; Gesprochene Sprache; Sprecherwechsel; Token; Abfragesprache; Suchmaschine
    Lizenz:

    creativecommons.org/licenses/by-nc/4.0/deed.de ; info:eu-repo/semantics/openAccess