Suchergebnisse

Filtern nach

Letzte Suchanfragen

Ergebnisse für *

Es wurden 1 Ergebnisse gefunden.

Zeige Ergebnisse 1 bis 1 von 1.

Sortieren

Supplementing CEFR-graded vocabulary lists for language learners by leveraging information on dictionary views, corpus frequency, part-of-speech, and polysemy

Autor*in: Wolfer, Sascha ; Lew, Robert

Erschienen: 2025

Verlag: Berlin : Springer Nature ; Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

Volltext:	https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/13281 https://ids-pub.bsz-bw.de/files/13281/Wolfer_Lew_Supplementing_CEFR_graded_vocabulary_lists_2025.pdf
Zitierfähiger Link:	https://nbn-resolving.org/urn:nbn:de:bsz:mh39-132818 https://doi.org/10.1057/s41599-025-05446-y

The study explores an approach to supplementing existing CEFR-graded vocabulary lists, which are often incomplete, by imputing CEFR levels for additional vocabulary items. This is achieved by analysing word-level data such as dictionary views, corpus frequency, part-of-speech, and polysemy. Using English as a test case, the study employs a variety of machine-learning models to predict CEFR levels for words not included in the initial set. The models significantly outperform a random baseline, indicating their effectiveness. The findings suggest that corpus frequency is the most influential predictor, followed by dictionary views and polysemy. The study reveals the potential of this semi-automatic approach to expand CEFR-graded word lists, making them more comprehensive and accessible for language learners. At the same time, human oversight is recommended to ensure the appropriateness of the imputed words for language learners, such as regarding the inclusion of potentially offensive terms. Future research may extend this methodology to other languages, provided that sufficient linguistic data is available.

Export in Literaturverwaltung

RIS-Format
BibTeX-Format

Quelle:	BASE Fachausschnitt Germanistik
Sprache:	Englisch
Medientyp:	Aufsatz aus einer Zeitschrift
Format:	Online
DDC Klassifikation:	Sprache (400)
Schlagworte:	Common European Framework of Reference for Languages; Wortschatz; Fremdsprachenlernen; Wörterbuch; Korpus; Worthäufigkeit; Polysemie; Englisch; Maschinelles Lernen
Lizenz:	creativecommons.org/licenses/by/4.0/ ; info:eu-repo/semantics/openAccess

Filtern nach

Aktive Filter

Kategorien:

Bereich

Quelle

Format

Beteiligt

Medientyp

Sprache

Jahr

Letzte Suchanfragen

Ergebnisse für *

Supplementing CEFR-graded vocabulary lists for language learners by leveraging information on dictionary views, corpus frequency, part-of-speech, and polysemy

Kontakt

Partner