Automated collection of Japanese word usage examples from a parallel and a monolingual corpus

作者: Kristina Hmeljak Sangawa , Yoshiko Kawamura , Tomaz Erjavec

DOI:

关键词: LinguisticsWord usageComputer scienceWord lists by frequencyWord (computer architecture)Corpus linguisticsArtificial intelligenceSecond languageNatural language processing

摘要: Examples are an important source of information on word usage for language learners, but existing reference sources Japanese as a second limited. This paper describes two projects the automated collection examples. extracted from ad-hoc Japanese-Slovene parallel corpus were included into jaSlo, learners' dictionary, and examples monolingual web-harvested 400 million selected to be used supplementary Chuta, multilingualized dictionary learners language.

参考文章(9)
B. T. S. Atkins, Michael Rundell, The Oxford Guide to Practical Lexicography ,(2008)
Kazuo Otsubo, The Japanese Language Proficiency Test 日本語教育. ,vol. 86, ,(1995)
Adam Kilgarriff, Pavel Rychlý, Pavel Smrž, David Tugwell, The Sketch Engine Proceedings of the Corpus Linguistics Conference 2009 (CL2009),, 2009, pág. 177. pp. 105- 116 ,(2004)
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, Toma Erjavec, Massive multi lingual corpus compilation: Acquis Communautaire and totale Archives of Control Sciences. ,vol. 15, pp. 529- 540 ,(2005)
Irena Srdanovic Erjavec, Tomaz Erjavec, Adam Kilgarriff, A Web Corpus and Word Sketches for Japanese Journal of Information Processing. ,vol. 15, pp. 137- 159 ,(2008) , 10.5715/JNLP.15.2_137
Adam Kilgarriff, Pavel Rychlý, Milos Husak, Michael Rundell, Katy McAdam, GDEX: Automatically Finding Good Dictionary Examples in a Corpus Proceedings of the XIII EURALEX International Congress (Barcelona, 15-19 July 2008), 2008, ISBN 978-84-96742-67-3, págs. 425-432. pp. 425- 432 ,(2008)
Kristina Hmeljak Sangawa, Irena Srdanovic Erjavec, Tomaz Erjavec, JaSlo, a Japanese-Slovene Learners’ Dictionary: Methods for Dictionary Enhancement Atti del XII Congresso Internazionale di Lessicografia: Torino, 6-9 settembre 2006, Vol. 1, 2006, ISBN 88-7694-918-6, págs. 611-616. pp. 611- 616 ,(2006)
G Williams, S. Vessier, Proceedings of the Eleventh EURALEX International Congress Université de Bretagne Sud. ,(2004)