Proceedings of the conference on Language Technologies & Digital Humanities
Keywords:
Language Technologies, Digital HumanitiesSynopsis
With this year's conference we are celebrating the 20th anniversary since the first conference »Language technologies« which took place in 1998 in Cankarjev dom, Ljubljana and was organized by Tomaž Erjavec, Vojko Gorjanc, Jerneja Žganec Gros and Anica Rant. The topics of the first conference were the development and application of language technologies for Slovene and directions for the future. 26 papers were presented, dealing with speech technologies and phonology, computerassisted translation and teaching, corpora, encoding standards for language data and searching for information on the internet. Following the conference a round table discussion was held, the direct result of which was the establishment of the Slovenian language technologies society which has since been the main initiator and organizer of all the following editions of the conference. Together with the Centre of lagnuage resources and technologies of the University of Ljubljana (CJVT), Faculty of Electrical Engineering of the University in Ljubljana and research infrastructures CLARIN.SI and DARIAH-SI the Society is also organizing this year's conference, held on 20-21 September 2018 at the Faculty of Electrical Engineering. In its 11th installment and after a successful expansion of the conference programme to Digital Humanities in 2016, we have retained the focus on the integration of the two disciplines and at the same time aimed to position the conference as an important meeting hub for fellow researchers in the region.
This year, 47 papers will be presented, including 2 talks by invited lecturers, 36 regular full papers and 5 abstracts, and 4 student papers. All the papers were reviewed by 3 reviewers. 21 papers weresubmitted in Slovene and 26 in English. The total number of all authors of the accepted papers is 92. Over half of the authors of the accepted papers are Slovene, 10% are from Croatia and the rest of the authors come from as many as 19 different countries. This is why the conference programme was designed in such a way that the first day is international, with the talks in English while talks on the second day will be held in Slovene. As opposed to the previous edition of the conference we have opted for a single track programme so that all the participants can attend all the talks, aiming to promote and foster closer collaboration among the researchers in language technologies and digital humanities. In addition, we have also introduced a poster session with 9 posters.
The editors would like to thank everyone who has contributed to the success of this conference, especially the invited lecturers and the authors of the papers for co-creating an inspiring conference programme, the Programme Committee for their dedicated reviews, the Organizing Committee for all the organizational efforts, the ession Chairs for their smooth and efficient management of the conference programme, the Technical Editors for preparing the online proceedings and the loyal sponsors for their selfless support of our activities.
Chapters
-
Preface
-
Too good to be trueCurrent approaches to author profiling
-
Bringing Digital Humanities to the wider publiclibraries as incubator for DH research results
-
A Comparison of Statistical and Neural Machine Translation for Slovene, Serbian and Croatian
-
SETimes.SR – A Reference Training Corpus of Serbian
-
Artistic Visualizations and BeyondA Study of Materializations of a Digital Database
-
Opus-MontenegrinSubs 1.0First electronic corpus of the Montenegrin language
-
Zapis in prikaz starejših pesniških besedil ter njihovih variant v TEI
-
Zakaj ne z eno poizvedbo hkrati po različnih korpusih?Troje korpusnih preverb pod primerjalnim drobnogledom
-
Frekvenčni seznami n-gramov v korpusih slovenskega jezika
-
Razvoj smernic za predajo in arhiviranje kvalitativnih podatkov v Arhivu družboslovnih podatkov
-
Prehod iz statističnega strojnega prevajanja na prevajanje z nevronskimi omrežji za jezikovni par slovenščina-angleščina
-
Analiza tvitov slovenskih korporativnih uporabnikov
-
Citiranje jezikoslovnih podatkov v slovenskih znanstvenih objavah: stanje in priporočila
-
Glagolske večbesedne enote v učnem korpusu ssj500k 2.1
-
Towards Semantic Role Labeling in Slovene and Croatian
-
Zbirka primerov rabe vejice Vejica 1.3
-
Croatian Web Dictionary MrežnikOne year later - What is different?
-
Portuguese Corpora of the 18th centuryOld Medicine texts for teaching and research activities
-
Interaktivna karta slovenskih narečnih besedil
-
Učinkovit izračun frekvenčnih statistik za slovenske jezikovne korpuse
-
Kolokacijski slovar sodobne slovenščine
-
A Rule-Based Syllabifier for Serbian
-
Debating EvilUsing Word Embeddings to Analyze Parliamentary Debates on War Criminals in The Netherlands
-
hr500k – A Reference Training Corpus of Croatian
-
The Parlameter corpus of contemporary Slovene parliamentary proceedings
-
KAS-term and KAS-bitermDatasets and baselines for monolingual and bilingual terminology extraction from academic writing
-
Strokovno-znanstvena slovenščina: besednovrstne in oblikoskladenjske značilnosti
-
Word Selection in the Slovenian Sentence Matrix Test for Speech Audiometry
-
Korpusna analiza nestandardne stave vejice po uvajalnih prislovnih zvezah
-
Trajnost digitalnih izdajUporaba statičnih spletnih strani na portalu Zgodovina Slovenije - SIstory
-
Spregledana kulturna dediščina in uporaba digitalne raziskovalne infrastrukture za humanistiko v raziskavi Odlivanje smrti
-
Analiza slovničnih napak v korpusu spisov učencev japonščine na osnovni ravni
-
Building a corpus of the Croatian parliamentary debates using UDPipe open source NLP tools and Neo4j graph database for creation of social ontology model, text classification and extraction of semantic information
-
Samopromocija na InstagramuPrimer predsednikovega profila
-
Data Mining Workspace SensorsA New Approach to Anthropology
-
Crowdsourcing terminology: harnessing the potential of translator’s glossaries
-
Evaluation of Statistical Readability Measures on Slovene texts
-
Exploring Finno-Ugric linguistics through solving IT problems
-
Teaching women writers with NEWW Virtual Research Environment
-
Odnosi do jezika v slovenski, hrvaški in srbski računalniško posredovani komunikaciji
-
Online database in Research of Correspondence of Franjo Ksaver Kuhač (1834-1911)
-
Distant Reading for European Literary History. A COST Action
-
Korpus in baza Gos Videolectures
-
Korpus tvitov slovenskih politikov Janes TwePo
-
You, thou and theeA statistical analysis of Shakespeare’s use of pronominal address terms
-
Primerjava luščilnikov terminologije Sketch Engine in CollTerm za znanstvena besedila
-
K-means Clustering for POS Tagger Improvement
Downloads
Downloads
Published
Categories
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.