Suchergebnisse

Word sense alignment and disambiguation for historical encyclopedias

Autor*in: Hagen, Thora ; Jannidis, Fotis ; Witt, Andreas

Erschienen: 2022

Verlag: Gießen : Graphen & Netzwerke; AG des Verbandes Digital Humanities im deutschsprachigen Raum e.V. ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

This paper will address the challenge of creating a knowledge graph from a corpus of historical encyclopedias with a special focus on word sense alignment (WSA) and disambiguation (WSD). More precisely, we examine WSA and WSD approaches based on... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/10983 https://ids-pub.bsz-bw.de/files/10983/Hagen_Jannidis_Witt_Word_sense_alignment_2021.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-109834

This paper will address the challenge of creating a knowledge graph from a corpus of historical encyclopedias with a special focus on word sense alignment (WSA) and disambiguation (WSD). More precisely, we examine WSA and WSD approaches based on article similarity to link messy historical data, utilizing Wikipedia as aground-truth component – as the lack of a critical overlap in content paired with the amount of variation between and within the encyclopedias does not allow for choosing a ”baseline” encyclopedia to align the others to. Additionally, we are comparing the disambiguation performance of conservative methods like the Lesk algorithm to more recent approaches, i.e. using language models to disambiguate senses.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Semasiologie; Enzyklopädie; Wissensgraph; Korpus; Wikipedia; Computerlinguistik
Lizenz:	creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

LRTwiki: enriching the likelihood ratio test with encyclopedic information for the extraction of relevant terms

Autor*in: Jakob, Niklas ; Müller, Mark-Christoph ; Gurevych, Iryna

Erschienen: 2022

Verlag: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

This paper introduces LRTwiki, an improved variant of the Likelihood Ratio Test (LRT). The central idea of LRTwiki is to employ a comprehensive domain specific knowledge source as additional “on-topic” data sets, and to modify the calculation of the... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11090 https://ids-pub.bsz-bw.de/files/11090/Jakob_Mueller_Gurevych_LRTwiki_2009.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-110906

This paper introduces LRTwiki, an improved variant of the Likelihood Ratio Test (LRT). The central idea of LRTwiki is to employ a comprehensive domain specific knowledge source as additional “on-topic” data sets, and to modify the calculation of the LRT algorithm to take advantage of this new information. The knowledge source is created on the basis of Wikipedia articles. We evaluate on the two related tasks product feature extraction and keyphrase extraction, and find LRTwiki to yield a significant improvement over the original LRT in both tasks.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Bibliotheks- und Informationswissenschaften (020); Sprache (400)
Schlagworte:	Likelihood-Quotienten-Test; Enzyklopädie; Information Extraction; Datensatz; Algorithmus; Wikipedia; Fehleranalyse
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Knowledge sources for bridging resolution in multi-party dialog

Autor*in: Müller, Mark-Christoph ; Mieskes, Margot ; Strube, Michael

Erschienen: 2022

Verlag: Paris : European Language Resources Association (ELRA) ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

In this paper we investigate the coverage of the two knowledge sources WordNet and Wikipedia for the task of bridging resolution. We report on an annotation experiment which yielded pairs of bridging anaphors and their antecedents in spoken... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11102 https://ids-pub.bsz-bw.de/files/11102/Mueller_Mieskes_Knowledge_sources_2008.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111024

In this paper we investigate the coverage of the two knowledge sources WordNet and Wikipedia for the task of bridging resolution. We report on an annotation experiment which yielded pairs of bridging anaphors and their antecedents in spoken multi-party dialog. Manual inspection of the two knowledge sources showed that, with some interesting exceptions, Wikipedia is superior to WordNet when it comes to the coverage of information necessary to resolve the bridging anaphors in our data set. We further describe a simple procedure for the automatic extraction of the required knowledge from Wikipedia by means of an API, and discuss some of the implications of the procedure’s performance.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Dialog; WordNet; Wikipedia; Gesprochene Sprache; Information; Datensatz; Wissensextraktion; API; Diskurs; Semantic Web; Lexikon
Lizenz:	creativecommons.org/licenses/by-nc-sa/3.0/ ; info:eu-repo/semantics/openAccess

Learning from students. On the design and usability of an e-dictionary of mathematical graph theory

Autor*in: Kruse, Theresa ; Heid, Ulrich

Erschienen: 2022

Verlag: Mannheim : IDS-Verlag ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

We created a prototype of an electronic dictionary for the mathematical domain of graph theory. We evaluate our prototype and compare its effectiveness in task-based tests with that of Wikipedia. Our dictionary is based on a corpus; the terms and... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11214 https://ids-pub.bsz-bw.de/files/11214/Kruse_Heid_Learning_from_students_2022.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-112145 https://doi.org/10.14618/ids-pub-11214

We created a prototype of an electronic dictionary for the mathematical domain of graph theory. We evaluate our prototype and compare its effectiveness in task-based tests with that of Wikipedia. Our dictionary is based on a corpus; the terms and their definitions were automatically extracted and annotated by experts (cf. Kruse/Heid 2020). The dictionary is bilingual, covering German and English; it gives equivalents, definitions and semantically related terms. For the implementation of the dictionary, we used LexO (Bellandi et al. 2017). The target group of the dictionary are students of mathematics who attend lectures in German and work with English resources. We carried out tests to understand which items the students search for when they work on graph-theoretical tasks. We ran the same test twice, with comparable student groups, either allowing Wikipedia as an information source or our dictionary. The dictionary seems to be especially helpful for students who already have a vague idea of a term because they can use the resource to check if their idea is right.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Zweisprachiges Wörterbuch; Korpus; Wikipedia
Lizenz:	creativecommons.org/licenses/by-sa/4.0/deed.de ; info:eu-repo/semantics/openAccess

A comparable Wikipedia corpus: from wiki syntax to POS tagged XML

Autor*in: Bubenhofer, Noah ; Haupt, Stefanie ; Schwinn, Horst

Erschienen: 2016

Verlag: Hamburg : Universität Hamburg

To build a comparable Wikipedia corpus of German, French, Italian, Norwegian, Polish and Hungarian for contrastive grammar research, we used a set of XSLT stylesheets to transform the mediawiki anntations to XML. Furthermore, the data has been... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/5189 https://ids-pub.bsz-bw.de/files/5189/Bubenhofer_Schwinn_Haupt-A_comparable_corpus-2011.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-51897

To build a comparable Wikipedia corpus of German, French, Italian, Norwegian, Polish and Hungarian for contrastive grammar research, we used a set of XSLT stylesheets to transform the mediawiki anntations to XML. Furthermore, the data has been amnntated with word class information using different taggers. The outcome is a corpus with rich meta data and linguistic annotation that can be used for multilingual research in various linguistic topics.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Korpus; Wikipedia; Kontrastive Grammatik
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

CMC Corpora in DeReKo

Autor*in: Lüngen, Harald ; Kupietz, Marc

Erschienen: 2017

Verlag: Mannheim : Institut für Deutsche Sprache

We introduce three types of corpora of computer-mediated communication that have recently been compiled at the Institute for the German Language or curated from an external project and included in DeReKo, the German Reference Corpus, namely Wikipedia... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/6259 https://ids-pub.bsz-bw.de/files/6259/Luengen_Kupietz_CMC%20Corpora_2017.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-62592

We introduce three types of corpora of computer-mediated communication that have recently been compiled at the Institute for the German Language or curated from an external project and included in DeReKo, the German Reference Corpus, namely Wikipedia (discussion) corpora, the Usenet news corpus, and the Dortmund Chat Corpus. The data and corpora have been converted to I5, the TEI customization to represent texts in DeReKo, and are researchable via the web-based IDS corpus research interfaces and in the case of Wikipedia and chat also downloadable from the IDS repository and download server, respectively.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Korpus; Deutsch; Internet; Wikipedia; UseNet
Lizenz:	creativecommons.org/licenses/by-nc-nd/3.0/de/deed.de ; info:eu-repo/semantics/openAccess

Studying the distribution of reply relations in Wikipedia talk pages

Autor*in: Lüngen, Harald ; Herzberg, Laura

Erschienen: 2023

Verlag: Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

This paper presents an extended annotation and analysis of interpretative reply relations focusing on a comparison of reply relation types and targets between conflictual pages and neutral pages of German Wikipedia (WP) talk pages. We briefly present... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/12098 https://ids-pub.bsz-bw.de/files/12098/Luengen_Herzberg_Studying_the_distribution_2023.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-120980 https://doi.org/10.14618/1z5k-pb25

This paper presents an extended annotation and analysis of interpretative reply relations focusing on a comparison of reply relation types and targets between conflictual pages and neutral pages of German Wikipedia (WP) talk pages. We briefly present the different categories identified for interpretative reply relations to analyze the relationship between WP postings as well as linguistic cues for each category. We investigate referencing strategies of WP authors in discussion page postings, illustrated by means of reply relation types and targets taking into account the degree of disagreement displayed on a WP talk page. We provide richly annotated data that can be used for further analyses such as the identification of interactional relations on higher levels, or for training tasks in machine learning algorithms.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Wikipedia; Computerunterstützte Kommunikation; Annotation; Korpus
Lizenz:	creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

Introduction

Autor*in: Poudat, Céline ; Lüngen, Harald ; Herzberg, Laura ; Ho-Dac, Lydia-Mai

Erschienen: 2024

Verlag: Amsterdam/Philadelphia : Benjamins ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/12950 https://ids-pub.bsz-bw.de/files/12950/Poudat_Luengen_Herzberg_Introduction_2024.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-129504 https://doi.org/10.1075/scl.121.int

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Einleitung; Wikipedia; Korpus
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Investigating reply relations on Wikipedia talk pages to reconstruct interactional strategies of Wikipedia authors

Autor*in: Herzberg, Laura ; Lüngen, Harald

Erschienen: 2024

Verlag: Amsterdam/Philadelphia : Benjamins ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

This chapter presents the annotation and analysis of interpretative reply relations on Wikipedia talk pages using data from the WikiDemoCorpus (WDC). Building on an approach of annotating interpretative reply relations to analyze these relations in... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/12953 https://ids-pub.bsz-bw.de/files/12953/Herzberg_Luengen_Investigating_reply_relations_2024.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-129532 https://doi.org/10.1075/scl.121.04her

This chapter presents the annotation and analysis of interpretative reply relations on Wikipedia talk pages using data from the WikiDemoCorpus (WDC). Building on an approach of annotating interpretative reply relations to analyze these relations in Wikipedia talk page posts, the chapter presents nine reply relation categories found in the German WDC. Additionally, linguistic cues for each category and the Wikipedia discussion pages overall are explained in detail, illustrated through reply relation targets. The results of the linguistic annotation are threefold: First, we provide an annotation scheme that can be used by third parties to produce more data according to their needs. Second, we shed light on and quantify the numerous ways Wikipedia authors reply to each other’s posts on talk pages. Finally, we provide richly annotated data that can be used for further analyses, such as identifying interactional relations on higher levels or training tasks in machine learning algorithms.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Korpus; Wikipedia; Interaktion; Annotation; Deutsch; Computerunterstützte Kommunikation
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Investigating interaction signs across genres, modes and languages: The example of OKAY

Autor*in: Herzberg, Laura ; Storrer, Angelika

Erschienen: 2024

Verlag: Genf : Zenodo ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

This paper presents results of a case study that compared the usage of OKAY across genre types (Wikipedia articles vs. talk pages), across modes (spoken vs. written language), and across languages (German vs. French CMC data from Wikipedia... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/12969 https://ids-pub.bsz-bw.de/files/12969/Herzberg_Storrer_Example_of_OKAY_2017.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-129699 https://doi.org/10.5281/zenodo.1040875

This paper presents results of a case study that compared the usage of OKAY across genre types (Wikipedia articles vs. talk pages), across modes (spoken vs. written language), and across languages (German vs. French CMC data from Wikipedia talkpages).The cross-genre study builds on the results of Herzberg (2016), who compared the usage of OKAY in German Wikipedia articles with its usage in Wikipedia talk pages. These results also form the basis for comparing the CMC genre of Wikipedia talk pages with occurrences of OKAY in the German spoken language corpus FOLK. Finally, we compared the results on the usage of OKAY in German Wikipedia talk pages with the usage of OKAY in French Wikipedia talk pages. With our case study, we want to demonstrate that it is worthwhile to investigate interaction signs across genres and languages,and to compare the usage in written CMC with the usage in spoken interaction.

Export in Literaturverwaltung

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Konferenzveröffentlichung
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Internetsprache; Interjektion; o.k; Wikipedia; Interaktionsanalyse; Computerunterstützte Kommunikation
Lizenz:	creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

Linguistische Wikipedistik

Autor*in: Gredel, Eva ; Herzberg, Laura ; Storrer, Angelika

Erschienen: 2018

Verlag: Berlin : de Gruyter ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

Die Wikipedia ist nicht nur die größte Online-Enzyklopädie weltweit, sondern auch eines der erfolgreichsten Projekte im Web 2.0: In nur 16 Jahren sind rund 48 Millionen Einträge in 295 Sprachversionen entstanden (Wikimedia 2018). Mit Rang 5 des... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/13045 https://ids-pub.bsz-bw.de/files/13045/Gredel_Herzberg_Storrer_Linguistische_Wikipedistik_2018.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-130459 https://doi.org/10.1515/zgl-2018-0029

Die Wikipedia ist nicht nur die größte Online-Enzyklopädie weltweit, sondern auch eines der erfolgreichsten Projekte im Web 2.0: In nur 16 Jahren sind rund 48 Millionen Einträge in 295 Sprachversionen entstanden (Wikimedia 2018). Mit Rang 5 des Alexa-Rankings ist die Wikipedia eine der meistgenutzten Plattformen im Internet (Alexa 2018). Durch ihre Relevanz und Reichweite wird die Wikipedia auch intensiv beforscht. Die Seite „Wikipedistik“ (WP-Wikipedistik; Wikipedia 2018) im Metabereich der deutschsprachigen Wikipedia gibt einen Überblick über nationale und internationale Forschungsaktivitäten und -ergebnisse. Die interessierten Disziplinen, die Erkenntnisinteressen und methodischen Zugänge der Wikipedistik sind vielfältig. Hammwöhner (2007) beschäftigt sich aus informationswissenschaftlicher Perspektive mit Methoden und Ergebnissen der Qualitätsbewertung von Wikipedia-Artikeln. Pscheida (2010) untersucht die Wikipedia unter wissenssoziologischer Perspektive und begründet am Beispiel der Wikipedia interessante Thesen zur „Wissenskultur des digitalen Zeitalters“ (Pscheida 2010: 458 ff.). Stegbauer (2009) untersucht das soziale Rollengefüge und die Motivation der Akteure in der deutschen Wikipedia und gibt einen empirisch sehr gut gestützten Einblick in die sozialen Prozesse im Projekt. In diesem Beitrag geben wir einen Überblick über die aktuelle Forschung zur Wikipedia aus der Perspektive der Sprach- und Diskursanalyse. Zunächst (Abschnitte 2.1–2.4) verdeutlichen wir das Potenzial der Wikipedia als Forschungsgegenstand an vier Themenfeldern: Text und Interaktion, Diskurslinguistik, Multimodalität, Sprach- und Kulturvergleich. Der anschließende Abschnitt 2.5 „Wikipedaktik“ beschäftigt sich mit der Wikipedia als lohnenswertem Lerngegenstand in Schule und Hochschule. Wikipedia ist nicht nur interessant als Ressource, an der sich die Besonderheiten digitaler Diskurse, multimodaler Hypertexte und kollaborativer Schreib- und Aushandlungsprozesse gut verdeutlichen lassen. Es ist auch ein Projekt des freien Wissens, ...

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Wikipedia; Sprachanalyse; Diskursanalyse; Text; Interaktion; Multimodalität
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Investigating OKAY across genres, modes and languages: A corpus-based study on German and French

Autor*in: Herzberg, Laura ; Storrer, Angelika

Erschienen: 2025

Verlag: Clermont-Ferrand : Presses Universitaires Blaise Pascal ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS) [Zweitveröffentlichung]

In our study, we used the spoken language corpus FOLK and the Wikipedia corpus family, provided by the Institute for the German Language (IDS) in Mannheim, to examine the usage of OKAY in various spelling and pronunciation variants across genre types... mehr

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/13046 https://ids-pub.bsz-bw.de/files/13046/Herzberg_Storrer_Investigating_OKAY_2019.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-130460

In our study, we used the spoken language corpus FOLK and the Wikipedia corpus family, provided by the Institute for the German Language (IDS) in Mannheim, to examine the usage of OKAY in various spelling and pronunciation variants across genre types (Wikipedia articles vs. talk pages), across modes (transcribed spoken vs. written language), and across languages (German vs. French Wikipedia talk pages). Our comparison of German Wikipedia talk and article pages made evident that OKAY is used far more frequently in the CMC-like Wikipedia talk pages than in the text-like Wikipedia articles. The comparison of the CMC data with the FOLK corpus of transcribed spoken language revealed interesting differences in the distribution of functional and topological features. The results suggest the emergence of particular functions and usage patterns for OKAY in written CMC that differ from the patterns observed in spoken interaction. The comparison of German and French Wikipedia talk pages yielded common usage patterns in both languages, e.g. the preference for "speedy" spelling variants (ok, OK, Ok) and a similar distribution of topological features, but also differences in the distribution of functional features.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einem Sammelband
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	O.K; Korpus; Deutsch; Französisch; Wikipedia; Computerunterstützte Kommunikation
Lizenz:	rightsstatements.org/page/InC/1.0/ ; info:eu-repo/semantics/openAccess

Filtern nach

Aktive Filter

Kategorien:

Bereich

Quelle

Format

Beteiligt

Medientyp

Sprache

Jahr

Letzte Suchanfragen

Ergebnisse für *

Word sense alignment and disambiguation for historical encyclopedias

LRTwiki: enriching the likelihood ratio test with encyclopedic information for the extraction of relevant terms

Knowledge sources for bridging resolution in multi-party dialog

Learning from students. On the design and usability of an e-dictionary of mathematical graph theory

A comparable Wikipedia corpus: from wiki syntax to POS tagged XML

CMC Corpora in DeReKo

Studying the distribution of reply relations in Wikipedia talk pages

Introduction

Investigating reply relations on Wikipedia talk pages to reconstruct interactional strategies of Wikipedia authors

Investigating interaction signs across genres, modes and languages: The example of OKAY

Linguistische Wikipedistik

Investigating OKAY across genres, modes and languages: A corpus-based study on German and French

Kontakt

Partner