作者: Pradeep Ravikumar , Martin J. Wainwright , Garvesh Raskutti , Bin Yu
DOI: 10.1214/11-EJS631
关键词: Norm (mathematics) 、 Estimator 、 Multivariate random variable 、 Combinatorics 、 Multivariate normal distribution 、 Covariance 、 Operator norm 、 Mathematics 、 Estimation of covariance matrices 、 Covariance matrix 、 Statistics
摘要: Given i.i.d. observations of a random vector X 2 R p , we study the problem estimating both its covariance matrix � ∗ and inverse or concentration = (� ) −1 . We estimate by minimizing an l1-penalized log-determinant Bregman divergence; in multivariate Gaussian case, this approach corresponds to maximum likelihood, structure is specified graph associated Markov field. analyze performance estim ator under high-dimensional scaling, which number nodes p, edges s node degree d, are allowed grow as function sample size n. In addition parameters (p, s, d), our analysis identifies other key quantities that control rates: (a) l∞-operator norm true ; (b) l∞ operator submatrix ∗, where S indexes edges, (c) mutual incoherence irrepresentability measure on (d) rate decay 1/f(n, δ) probabilities {|b n ∗| > δ}, b based samples. Our first result establishes consistency elementwise maximum-norm. This turn allows us derive convergence rates Frobenius spectral norms, with improvements upon existing results for graphs degrees d o( s). second result, show probability converging one, correctly specifies zero pattern illustrate theoretical via simulations various parameters, showing good correspondences between predictions behavior simulations. 1. Introduction. The area statistics deals estimation “large small n” setting, correspond, respectively, dimensionality dat size. Such problems arise variety applications, among them remote sensing, computational biology natural language processing, model dimension may be comparable substantially larger than It well-known such scaling can lead dramatic breakdowns many classical procedures. absence additional assumptions, it frequently impossible obtain consistent procedures when ≫ Accordingly, active line statistical research imposing restrictions model—-for instance, sparsity, manifold structure, graphical structure—-and then studying different estimators n, ambient related these structural assu mptions.