作者: David L. Marvit , Yannis Labrou , John J. Sidorowich , B. Thomas Adler , Alex Gilman
DOI:
关键词:
摘要: In one embodiment, modeling topics includes accessing a corpus comprising documents that include words. Words of document are selected as keywords the document. The clustered according to yield clusters, where each cluster corresponds topic. A statistical distribution is generated for from words cluster. topic modeled using corresponding