Primerjava sistemov za označevanje jezikovnih popravkov v štirih slovenskih besedilnih korpusih

Authors

Špela Arhar Holdt
University of Ljubljana, Faculty of Arts; University of Ljubljana, Faculty of Computer and Information Science, Slovenia
Damjan Popič
University of Ljubljana, Faculty of Arts, Slovenia
Mojca Stritar Kučuk
University of Ljubljana, Faculty of Arts, Slovenia

Synopsis

For Slovenian, four text corpora that contain linguistic error annotations are available or under construction: Šolar, KOST, Lektor, and STIKit. The errors and corrections in these corpora are labeled with different annotation systems, each adapted to the specific characteristics of the corpus material. This article analyses the systems, identifying similarities and differences in the annotation categories, and it explores possibilities for label-mapping and comparative analyses of the corpus material.

Downloads

Pages

11-20

Published

November 19, 2024

Series

License

License

How to Cite

Arhar Holdt, Špela, Popič, D., & Stritar Kučuk, M. (2024). Primerjava sistemov za označevanje jezikovnih popravkov v štirih slovenskih besedilnih korpusih. In S. Štumberger (Ed.), Language Rules and Norms: Vol. Obdobja 43 (pp. 11-20). University of Ljubljana Press. https://ebooks.uni-lj.si/ZalozbaUL/catalog/book/664/chapter/3901