Systems and methods for phrase clustering

作者: Indrajit Bhattacharya , Shantanu Ravindra Godbole , Akshit Sharma

DOI:

关键词:

摘要: Systems and associated methods for enhanced concept understanding in large document collections through phrase clustering are described. Embodiments take as input an initial set of phrases estimate centroids using a process. then generate new around each the current phrases. These added to set, process is iterated. Upon convergence, embodiments finalize clusters based on any given length.

参考文章(14)
Balachander Krishnamurthy, Guy Jacobson, Divesh Srivastava, Method of clustering electronic documents in response to a search query ,(1997)
Matthew S. Sommer, Kevin B. Thompson, Information exploration systems and methods ,(2006)
Emily Pitler, Shane Bergsma, David Yarowsky, Satoshi Sekine, Kailash Patil, Kenneth Ward Church, Sushant Narsale, Rachel Lathbury, Kapil Dalwani, Heng Ji, Vikram Rao, Dekang Lin, New Tools for Web-Scale N-grams language resources and evaluation. ,(2010)