Text mining apparatus and associated methods

作者: Yunbo Cao , Hang Li , Olivier Ribet , Benjamin Martin

DOI:

关键词:

摘要: A method for extracting key terms and associated use in text mining is provided. The includes receiving unstructured documents, such as emails over a customer service system. Term candidates are extracted based on identifying consecutive word strings satisfying context independency threshold. weighted using mutual information to generate list of terms. then recounted. Terms Chi-square values. Associated can be used retrieval. user interface personalized with individual profiles.

参考文章(16)
Martin Rajman, Romaric Besançon, Text Mining - Knowledge extraction from unstructured textual data Studies in Classification, Data Analysis, and Knowledge Organization. pp. 473- 480 ,(1998) , 10.1007/978-3-642-72253-0_64
Lee-Feng Chien, PAT-tree-based adaptive keyphrase extraction for intelligent Chinese information retrieval Information Processing and Management. ,vol. 35, pp. 501- 521 ,(1999)
Davide Turcato, Gordon Tisher, Daniel Clifford Fass, Janine Toole, James Devlan Nicholson, Ryan Yeske, Andrej Dobos, Magnus Byne, Milan Mosny, Frederick Paul Popowich, A method and system for concept generation and management ,(2004)
Hinrich Schuetze, James E. Pitkow, Jun Li, Ed H. Chi, Peter L. Pirolli, System and method for clustering data objects in a collection ,(1999)
Ullas Gargi, Francine R. Chen, Hinrich Schuetze, James E. Pitkow, Jun Li, Ed H. Chi, Peter L. Pirolli, System and method for quantitatively representing data objects in vector space ,(1999)
Lubos Pochman, Carsten Tusk, Navdeep S. Dhillon, Thien Nguyen, Krzysztof Koperski, Alejandro Murua, Jisheng Liang, Giovanni B. Marchisio, Method and system for enhanced data searching ,(2002)