Zbornik konference Jezikovne tehnologije in digitalna humanistika

Darja Fišer; Andrej Pančur; Malvina Nissim; Martijn Kleppe; Mihael Arčan; Vuk Batanovi´ć; Nikola Ljubešić; Tanja Samardžić; Narvika Bovcon; Aleš Vaupotič; Petar Božović; Tomaž Erjavec; Jörg Tiedemann; Vojko Gorjanc; Nina Ditmajer; Matija Ogrin; Helena Dobrovoljc; Urška Vranjek Ošlak; Kaja Dobrovoljc; Maja Dolinar; Janez Štebe; Sonja Bezjak; Gregor Donaj; Mirjam Sepesy Maučec; Monika Kalin Golob; Jakob Lenardič; Polona Gantar; Špela Arhar Holdt; Jaka Čibej; Taja Kuzman; Teja Kavčič; Kristina Štrkalj Despot; Simon Krek; Peter Holozan; Lana Hudeček; Milica Mihaljević; Maria José Bocorny Finatto; Paulo Quaresma; Maria Filomena Gonçalves; Alenka Kavčič; Ivan Lovrić; Vera Smole; Aleksander Ključevšek; Marko Robnik-Šikonja; Iztok Kosem; Cyprian Laskowski; Aniko Kovač; Maja Marković; Milan M. van Lange; Ralf D. Futselaar; Alenka Pirman; Maruša Kocjančič; Miha Pavlovič; Rena Ito; Benedikt Perak; Filip Rodik; Dan Podjed; Ajda Pretnar; Ivanka Rajh; Siniša Runjaić; Tadej Škvorc; Senja Pollak; Tobias Weber; Jeremy Bradley; Katja Mihurko Poniž; Marie Nedregotten Sørbø; Viola Parente-Čapková; Amelia Sanz; Suzan van Dijk; Aleš Vaupotič; Damjan Popič; Sara Ries; Christof Schöch; Maciej Eder; Carolin Odebrecht; Mike Kestemont; Antonija Primorac; Justin Tonra; Catherine Kanellopoulou; Darinka Verdonik; Urška Bratoš; Isolde van Dorst; Klara Eva Kukovičič; Gabi Rolih; Željko Agić; Filip Klubička; Filip Dobranič; Nataša Logar; Tatjana Marvin; Jure Derganc; Samo Beguš; Saba Battelino; Eneja Osrajnik

doi:10.4312/9789610601111

Proceedings of the conference on Language Technologies & Digital Humanities

Authors

Darja Fišer (ed)

Faculty of Arts, University of Ljubljana; Jožef Stefan Institute

Andrej Pančur (ed)

Institute of Contemporary History

DOI: https://doi.org/10.4312/9789610601111

Keywords:

Language Technologies, Digital Humanities

Synopsis

With this year's conference we are celebrating the 20th anniversary since the first conference »Language technologies« which took place in 1998 in Cankarjev dom, Ljubljana and was organized by Tomaž Erjavec, Vojko Gorjanc, Jerneja Žganec Gros and Anica Rant. The topics of the first conference were the development and application of language technologies for Slovene and directions for the future. 26 papers were presented, dealing with speech technologies and phonology, computerassisted translation and teaching, corpora, encoding standards for language data and searching for information on the internet. Following the conference a round table discussion was held, the direct result of which was the establishment of the Slovenian language technologies society which has since been the main initiator and organizer of all the following editions of the conference. Together with the Centre of lagnuage resources and technologies of the University of Ljubljana (CJVT), Faculty of Electrical Engineering of the University in Ljubljana and research infrastructures CLARIN.SI and DARIAH-SI the Society is also organizing this year's conference, held on 20-21 September 2018 at the Faculty of Electrical Engineering. In its 11th installment and after a successful expansion of the conference programme to Digital Humanities in 2016, we have retained the focus on the integration of the two disciplines and at the same time aimed to position the conference as an important meeting hub for fellow researchers in the region.

This year, 47 papers will be presented, including 2 talks by invited lecturers, 36 regular full papers and 5 abstracts, and 4 student papers. All the papers were reviewed by 3 reviewers. 21 papers weresubmitted in Slovene and 26 in English. The total number of all authors of the accepted papers is 92. Over half of the authors of the accepted papers are Slovene, 10% are from Croatia and the rest of the authors come from as many as 19 different countries. This is why the conference programme was designed in such a way that the first day is international, with the talks in English while talks on the second day will be held in Slovene. As opposed to the previous edition of the conference we have opted for a single track programme so that all the participants can attend all the talks, aiming to promote and foster closer collaboration among the researchers in language technologies and digital humanities. In addition, we have also introduced a poster session with 9 posters.

The editors would like to thank everyone who has contributed to the success of this conference, especially the invited lecturers and the authors of the papers for co-creating an inspiring conference programme, the Programme Committee for their dedicated reviews, the Organizing Committee for all the organizational efforts, the ession Chairs for their smooth and efficient management of the conference programme, the Technical Editors for preparing the online proceedings and the loyal sponsors for their selfless support of our activities.

Chapters

Preface

PDF
Too good to be true
Current approaches to author profiling

Malvina Nissim

PDF
Bringing Digital Humanities to the wider public
libraries as incubator for DH research results

Martijn Kleppe

PDF
A Comparison of Statistical and Neural Machine Translation for Slovene, Serbian and Croatian

Mihael Arčan

PDF
SETimes.SR – A Reference Training Corpus of Serbian

Vuk Batanovi´ć, Nikola Ljubešić, Tanja Samardžić

PDF
Artistic Visualizations and Beyond
A Study of Materializations of a Digital Database

Narvika Bovcon, Aleš Vaupotič

PDF
Opus-MontenegrinSubs 1.0
First electronic corpus of the Montenegrin language

Nikola Ljubešić, Petar Božović, Tomaž Erjavec, Jörg Tiedemann, Vojko Gorjanc

PDF
Zapis in prikaz starejših pesniških besedil ter njihovih variant v TEI

Tomaž Erjavec, Nina Ditmajer, Matija Ogrin

PDF
Zakaj ne z eno poizvedbo hkrati po različnih korpusih?
Troje korpusnih preverb pod primerjalnim drobnogledom

Helena Dobrovoljc, Urška Vranjek Ošlak

PDF
Frekvenčni seznami n-gramov v korpusih slovenskega jezika

Kaja Dobrovoljc

PDF
Razvoj smernic za predajo in arhiviranje kvalitativnih podatkov v Arhivu družboslovnih podatkov

Maja Dolinar, Janez Štebe, Sonja Bezjak

PDF
Prehod iz statističnega strojnega prevajanja na prevajanje z nevronskimi omrežji za jezikovni par slovenščina-angleščina

Gregor Donaj, Mirjam Sepesy Maučec

PDF
Analiza tvitov slovenskih korporativnih uporabnikov

Darja Fišer, Monika Kalin Golob

PDF
Citiranje jezikoslovnih podatkov v slovenskih znanstvenih objavah: stanje in priporočila

Darja Fišer, Tomaž Erjavec, Jakob Lenardič

PDF
Glagolske večbesedne enote v učnem korpusu ssj500k 2.1

Polona Gantar, Špela Arhar Holdt, Jaka Čibej, Taja Kuzman, Teja Kavčič

PDF
Towards Semantic Role Labeling in Slovene and Croatian

Nikola Ljubešić, Polona Gantar, Kristina Štrkalj Despot, Simon Krek

PDF
Zbirka primerov rabe vejice Vejica 1.3

Peter Holozan

PDF
Croatian Web Dictionary Mrežnik
One year later - What is different?

Lana Hudeček, Milica Mihaljević

PDF
Portuguese Corpora of the 18th century
Old Medicine texts for teaching and research activities

Maria José Bocorny Finatto, Paulo Quaresma, Maria Filomena Gonçalves

PDF
Interaktivna karta slovenskih narečnih besedil

Alenka Kavčič, Ivan Lovrić, Vera Smole

PDF
Učinkovit izračun frekvenčnih statistik za slovenske jezikovne korpuse

Simon Krek, Aleksander Ključevšek, Marko Robnik-Šikonja

PDF
Kolokacijski slovar sodobne slovenščine

Aniko Kovač, Maja Marković
A Rule-Based Syllabifier for Serbian

Aniko Kovač, Maja Marković

PDF
Debating Evil
Using Word Embeddings to Analyze Parliamentary Debates on War Criminals in The Netherlands

Milan M. van Lange, Ralf D. Futselaar

PDF
hr500k – A Reference Training Corpus of Croatian

Vuk Batanovi´ć, Nikola Ljubešić, Tomaž Erjavec, Željko Agić, Filip Klubička

PDF
The Parlameter corpus of contemporary Slovene parliamentary proceedings

Darja Fišer, Nikola Ljubešić, Tomaž Erjavec, Filip Dobranič

PDF
KAS-term and KAS-biterm
Datasets and baselines for monolingual and bilingual terminology extraction from academic writing

Darja Fišer, Nikola Ljubešić, Tomaž Erjavec

PDF
Strokovno-znanstvena slovenščina: besednovrstne in oblikoskladenjske značilnosti

Tomaž Erjavec, Nataša Logar

PDF
Word Selection in the Slovenian Sentence Matrix Test for Speech Audiometry

Tatjana Marvin, Jure Derganc, Samo Beguš, Saba Battelino

PDF
Korpusna analiza nestandardne stave vejice po uvajalnih prislovnih zvezah

Darja Fišer, Vojko Gorjanc, Eneja Osrajnik

PDF
Trajnost digitalnih izdaj
Uporaba statičnih spletnih strani na portalu Zgodovina Slovenije - SIstory

Andrej Pančur

PDF
Spregledana kulturna dediščina in uporaba digitalne raziskovalne infrastrukture za humanistiko v raziskavi Odlivanje smrti

Andrej Pančur, Alenka Pirman, Maruša Kocjančič

PDF
Analiza slovničnih napak v korpusu spisov učencev japonščine na osnovni ravni

Miha Pavlovič, Rena Ito

PDF
Building a corpus of the Croatian parliamentary debates using UDPipe open source NLP tools and Neo4j graph database for creation of social ontology model, text classification and extraction of semantic information

Benedikt Perak, Filip Rodik

PDF
Samopromocija na Instagramu
Primer predsednikovega profila

Dan Podjed, Ajda Pretnar

PDF
Data Mining Workspace Sensors
A New Approach to Anthropology

Dan Podjed, Ajda Pretnar

PDF
Crowdsourcing terminology: harnessing the potential of translator’s glossaries

Ivanka Rajh, Siniša Runjaić

PDF
Evaluation of Statistical Readability Measures on Slovene texts

Špela Arhar Holdt, Simon Krek, Marko Robnik-Šikonja, Tadej Škvorc, Senja Pollak

PDF
Exploring Finno-Ugric linguistics through solving IT problems

Tobias Weber, Jeremy Bradley

PDF
Teaching women writers with NEWW Virtual Research Environment

Narvika Bovcon, Katja Mihurko Poniž, Marie Nedregotten Sørbø, Viola Parente-Čapková, Amelia Sanz, Suzan van Dijk, Aleš Vaupotič

PDF
Odnosi do jezika v slovenski, hrvaški in srbski računalniško posredovani komunikaciji

Darja Fišer, Damjan Popič

PDF
Online database in Research of Correspondence of Franjo Ksaver Kuhač (1834-1911)

Sara Ries

PDF
Distant Reading for European Literary History. A COST Action

Katja Mihurko Poniž, Christof Schöch, Maciej Eder, Carolin Odebrecht, Mike Kestemont, Antonija Primorac, Justin Tonra, Catherine Kanellopoulou

PDF
Korpus in baza Gos Videolectures

Darinka Verdonik

PDF
Korpus tvitov slovenskih politikov Janes TwePo

Urška Bratoš

PDF
You, thou and thee
A statistical analysis of Shakespeare’s use of pronominal address terms

Isolde van Dorst

PDF
Primerjava luščilnikov terminologije Sketch Engine in CollTerm za znanstvena besedila

Klara Eva Kukovičič

PDF
K-means Clustering for POS Tagger Improvement

Gabi Rolih

PDF

Downloads

Download data is not yet available.

Downloads

PDF

Published

July 2, 2018

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Details about this monograph

ISBN-13 (15)

978-961-06-0111-1

Date of first publication (11)

2018-10-01

How to Cite

Fišer, D., & Pančur, A. (Eds.). (2018). Proceedings of the conference on Language Technologies & Digital Humanities. University of Ljubljana Press. https://doi.org/10.4312/9789610601111

Download Citation

Proceedings of the conference on Language Technologies & Digital Humanities

Authors

Keywords:

Synopsis

Chapters

Downloads

Downloads

Published

Categories

License

Details about this monograph

ISBN-13 (15)

Date of first publication (11)

How to Cite

Language

Information