Representing Points in Many Dimensions by Trees and Castles

作者: B. Kleiner , J. A. Hartigan

DOI: 10.1080/01621459.1981.10477638

关键词: CombinatoricsMathematicsTree (set theory)OutlierSymbol (chemistry)Point (geometry)Discrete mathematicsStructure (category theory)Cluster analysisHierarchical clusteringMatching (graph theory)

摘要: Abstract A number of points in k dimensions are displayed by associating with each point a symbol: drawing tree or castle. All symbols have the same structure derived from hierarchical clustering algorithm applied to variables (dimensions) over all points, but their parts coded according coordinates individual point. Trees and castles show general size effects, change whole clusters point, trends, outliers. They especially appropriate for evaluating observing points. Their major advantage earlier attempts represent multivariate observations (such as profiles, stars, faces, boxes, Andrews's curves) lies matching relationships between features representing symbol. Several examples given, including one 48 variables.

参考文章(15)
Lawrence A. Bruckner, ON CHERNOFF FACES Graphical Representation of Multivariate Data. pp. 93- 121 ,(1978) , 10.1016/B978-0-12-734750-9.50009-5
Juan E. Mezzich, David R.L. Worthington, A COMPARISON OF GRAPHICAL REPRESENTATIONS OF MULTIDIMENSIONAL PSYCHIATRIC DIAGNOSTIC DATA Graphical Representation of Multivariate Data. pp. 123- 141 ,(1978) , 10.1016/B978-0-12-734750-9.50010-1
Edgar Anderson, A Semigraphical Method for the Analysis of Complex Problems Technometrics. ,vol. 2, pp. 387- 391 ,(1960) , 10.1080/00401706.1960.10489905
Stephen E. Fienberg, Graphical Methods in Statistics The American Statistician. ,vol. 33, pp. 165- 178 ,(1979) , 10.1080/00031305.1979.10482688
J.A. Hartigan, Printer graphics for clustering Journal of Statistical Computation and Simulation. ,vol. 4, pp. 187- 213 ,(1975) , 10.1080/00949657508810123
Herman Chernoff, M. Haseeb Rizvi, Effect on Classification Error of Random Permutations of Features in Representing Multivariate Data by Faces Journal of the American Statistical Association. ,vol. 70, pp. 548- 554 ,(1975) , 10.1080/01621459.1975.10482470
D. F. Andrews, PLOTS OF HIGH-DIMENSIONAL DATA Biometrics. ,vol. 28, pp. 125- ,(1972) , 10.2307/2528964
Roger M. Goldwyn, Herman P. Friedman, John H. Siegel, Iteration and interaction in computer data bank analysis: A case study in the physiologic classification and assessment of the critically III Computers and Biomedical Research. ,vol. 4, pp. 607- 622 ,(1971) , 10.1016/0010-4809(71)90038-3