Primerjava sistemov za označevanje jezikovnih popravkov v štirih slovenskih besedilnih korpusih
Synopsis
For Slovenian, four text corpora that contain linguistic error annotations are available or under construction: Šolar, KOST, Lektor, and STIKit. The errors and corrections in these corpora are labeled with different annotation systems, each adapted to the specific characteristics of the corpus material. This article analyses the systems, identifying similarities and differences in the annotation categories, and it explores possibilities for label-mapping and comparative analyses of the corpus material.
Downloads
Volume
Pages
11-20
Published
November 19, 2024
Series
Categories
Copyright (c) 2024 University of Ljubljana, Faculty of Arts
License
LicenseHow to Cite
Arhar Holdt, Špela, Popič, D., & Stritar Kučuk, M. (2024). Primerjava sistemov za označevanje jezikovnih popravkov v štirih slovenskih besedilnih korpusih. In S. Štumberger (Ed.), Language Rules and Norms: Vol. Obdobja 43 (pp. 11-20). University of Ljubljana Press. https://ebooks.uni-lj.si/ZalozbaUL/catalog/book/664/chapter/3901