作者: Tomáš Brychcín , Pavel Král
DOI: 10.1007/978-3-319-13647-9_8
关键词:
摘要: … The word clusters are created using Repeated bisection algorithm. The document is then represented as a bag of clusters and we use a tf-idf weighting scheme for each cluster to create …