KRES corpus n-grams 1.0

作者: Kaja Dobrovoljc

DOI:

关键词:

摘要: This is a collection of n-grams extracted from the KRES corpus written Slovene. In addition to separate lists for tokens and their attributes (morphosyntacic tag, lemma), an adjusted frequency list with statistical substring reduction has also been added (as described in O'Donnell 2011). Only within sentences have counted.

参考文章(0)