Clustering rankings in the fourier domain

作者: Stéphan Clémençon , Romaric Gaudel , Jérémie Jakubowicz

DOI: 10.1007/978-3-642-23780-5_32

关键词: AlgorithmSymmetric groupSet (abstract data type)CardinalityDiscrete mathematicsCluster analysisFourier transformRank (computer programming)MathematicsFeature selectionUnsupervised learning

摘要: It is the purpose of this paper to introduce a novel approach clustering rank data on set possibly large cardinality n ∈ N*, relying upon Fourier representation functions defined symmetric group Sn. In present setup, covering wide variety practical situations, are viewed as distributions Cluster analysis aims at segmenting into homogeneous subgroups, hopefully very dissimilar in certain sense. Whereas considering dissimilarity measures/distances between non commutative Sn, coordinate manner by viewing it embedded [0, 1]n! for instance, hardly yields interpretable results and leads face obvious computational issues, evaluating closeness groups permutations domain may be much easier contrast. Indeed, few well-chosen (matrix) coefficients permit approximate efficiently two Sn well their degree dissimilarity, while describing global properties an fashion. Following footsteps recent advances automatic feature selection context unsupervised learning, we propose cast task rankings terms optimization criterion that can expressed simple manner. The effectiveness method proposed illustrated numerical experiments based artificial real data.

参考文章(33)
Pierre-Gilles Lemarié-Rieusset, Jean-Pierre Kahane, Fourier series and wavelets ,(1995)
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)
Francis D. Murnaghan, The theory of group representations ,(1938)
Stéphan Clémençon, Jérémie Jakubowicz, Kantorovich distances between rankings with applications to rank aggregation european conference on machine learning. pp. 248- 263 ,(2010) , 10.1007/978-3-642-15880-3_22
Risi Kondor, Marconi S. Barbosa, Ranking with Kernels in Fourier space. conference on learning theory. pp. 451- 463 ,(2010)
Jean Pierre Serre, Algebraic Groups and Class Fields ,(1987)
Risi Kondor, Tony Jebara, Andrew G. Howard, Multi-object tracking with representations of the symmetric group. international conference on artificial intelligence and statistics. pp. 211- 218 ,(2007)
Michael A. Fligner, Joseph S. Verducci, Multistage Ranking Models Journal of the American Statistical Association. ,vol. 83, pp. 892- 901 ,(1988) , 10.1080/01621459.1988.10478679