作者: Kristina Hmeljak Sangawa , Yoshiko Kawamura , Tomaz Erjavec
DOI:
关键词: Linguistics 、 Word usage 、 Computer science 、 Word lists by frequency 、 Word (computer architecture) 、 Corpus linguistics 、 Artificial intelligence 、 Second language 、 Natural language processing
摘要: Examples are an important source of information on word usage for language learners, but existing reference sources Japanese as a second limited. This paper describes two projects the automated collection examples. extracted from ad-hoc Japanese-Slovene parallel corpus were included into jaSlo, learners' dictionary, and examples monolingual web-harvested 400 million selected to be used supplementary Chuta, multilingualized dictionary learners language.