A method, apparatus, and system for clustering and classification

作者: Seth Patinkin

DOI:

关键词:

摘要: The invention provides a method, apparatus and system for classification clustering electronic data streams such as email, images sound files identification, sorting efficient storage. inventive systems disclose labeling document belonging to predefined class though computer methods that comprise the steps of identifying an stream using one or more learning machines comparing outputs from determine label associate with data. method further utilizes in combination hashing schemes cluster classify documents. In embodiment hash apparatuses taxonomize clusters. yet another embodiment, clusters documents utilize geometric contain corpus without overhead search

参考文章(100)
Mark Alexander Shand, Andrei Z. Broder, Michael David Mitzenmacher, Laurent Rene Moll, Method for determining a random permutation of variables by applying a test function ,(1998)
Piotr Indyk, Nearest Neighbors in High-Dimensional Spaces Handbook of Discrete and Computational Geometry, Second Edition. pp. 877- 892 ,(2004) , 10.1201/9781420035315.CH39
D Gabor, INFORMATION THEORY IN ELECTRON MICROSCOPY. Laboratory Investigation. ,vol. 14, pp. 801- 807 ,(1965)
Stefano Paraboschi, Sabrina De Capitani di Vimercati, Pierangela Samarati, Ernesto Damiani, An Open Digest-based Technique for Spam Detection. iasted international conference on parallel and distributed computing and systems. pp. 559- 564 ,(2004)
S. K. M. Wong, Vijay V. Raghavan, Vector space model of information retrieval: a reevaluation international acm sigir conference on research and development in information retrieval. pp. 167- 185 ,(1984) , 10.5555/636805.636816