NLP Web Services for Slovene and English: Morphosyntactic Tagging, Lemmatisation and Definition Extraction

Anže Vavpetič , Nejc Trdin , Senja Pollak , Tomaž Erjavec
Informatica (lithuanian Academy of Sciences) 36 ( 4) 441 -449

2012
Japanese-Slovene learner's dictionary jaSlo 3.1

Irena Srdanović , Kristina Hmeljak , Tomaž Erjavec
Faculty of Arts, University of Ljubljana

2016
Slovenian parliamentary corpus SlovParl 1.0

Andrej Pančur , Mojca Šorn , Tomaž Erjavec
Institute of Contemporary History

2016
Japanese web corpus with difficulty levels jpWaC-L 1.0

Kristina Hmeljak Sangawa , Yoshiko Kawamura , Tomaž Erjavec
Jožef Stefan Institute

2008
MULTEXT-East "1984" document corpus 4.0

Tomaž Erjavec , Ştefan Bruda , Ludmila Dimitrova , Nancy Ide
Jožef Stefan Institute

2010
Tweet comma corpus Janes-Vejica 1.0

Darja Fišer , Damjan Popič , Teja Kavčič , Polona Logar
Jožef Stefan Institute

2017
CMC training corpus Janes-Syn 1.0

Darja Fišer , Špela Arhar Holdt , Tomaž Erjavec
Jožef Stefan Institute

2017
Closing a gap in the language resources landscape : Groundwork and best practices from projects on computer-mediated communication in four European countries.

Thierry Chanier , Angelika Storrer , Céline Poudat , Isabella Chiari
Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26–28 October 2016, CLARIN Common Language Resources and Technology Infrastructure 136 ( 136) 1 -19

2017
CMC shortening corpus Janes-Kratko 1.0

Darja Fišer , Tomaž Erjavec , Teja Goli , Eneja Osrajnik
Jožef Stefan Institute

2017
Forum corpus Janes-Forum 1.0

Darja Fišer , Tomaž Erjavec , Nikola Ljubešić
Jožef Stefan Institute

2017
Twitter corpus Janes-Tweet 1.0

Darja Fišer , Tomaž Erjavec , Nikola Ljubešić
Jožef Stefan Institute

2017
Blog post and comment corpus Janes-Blog 1.0

Darja Fišer , Tomaž Erjavec , Nikola Ljubešić
Jožef Stefan Institute

2017
Wikipedia talk corpus Janes-Wiki 1.0

Darja Fišer , Tomaž Erjavec , Nikola Ljubešić
Jožef Stefan Institute

2017
News comment corpus Janes-News 1.0

Darja Fišer , Tomaž Erjavec , Nikola Ljubešić
Jožef Stefan Institute

2017
Tweet code-switching corpus Janes-Preklop 1.0

Darja Fišer , Tomaž Erjavec , Špela Reher
Jožef Stefan Institute

2017
MULTEXT-East Resources for Serbian

Cvetana Krstev , Duško Vitas , Tomaž Erjavec
Zbornik 7. mednarodne multikonference Informacijska druzba IS 2004 Jezikovne tehnologije 9-15 Oktober 2004, Ljubljana, Slovenija, 2004

8
2004
Dataset of normalised Slovene text KonvNormSl 1.0

Darja Fišer , Katja Zupan , Tomaž Erjavec , Nikola Ljubešić
Jožef Stefan Institute

1
2016
Bilingual terminology extraction dataset KAS-biterm 1.0

Darja Fišer , Tomaž Erjavec , Maja Bitenc , Nikola Ljubešić
Jožef Stefan Institute

2018
Training corpus SETimes.SR 1.0

Nikola Ljubešić , Vuk Batanović , Tanja Samardžić , Tomaž Erjavec
Regional Linguistic Data Initiative Centre ReLDI

1
2018