作者: Indrajit Bhattacharya , Shantanu Ravindra Godbole , Akshit Sharma
DOI:
关键词:
摘要: Systems and associated methods for enhanced concept understanding in large document collections through phrase clustering are described. Embodiments take as input an initial set of phrases estimate centroids using a process. then generate new around each the current phrases. These added to set, process is iterated. Upon convergence, embodiments finalize clusters based on any given length.