Information Recovery from Pairwise Measurements

作者: Andrea J. Goldsmith , Changho Suh , Yuxin Chen

DOI:

关键词: Pairwise comparisonMetric (mathematics)Minimum cutDecoding methodsCombinatoricsMathematicsStochastic block modelDiscrete mathematicsDegree (graph theory)Divergence (statistics)Binary logarithm

摘要: This paper is concerned with jointly recovering $n$ node-variables $\left\{ x_{i}\right\}_{1\leq i\leq n}$ from a collection of pairwise difference measurements. Imagine we acquire few observations taking the form $x_{i}-x_{j}$; observation pattern represented by measurement graph $\mathcal{G}$ an edge set $\mathcal{E}$ such that $x_{i}-x_{j}$ observed if and only $(i,j)\in\mathcal{E}$. To account for noisy measurements in general manner, model data acquisition process channels given input/output transition measures. Employing information-theoretic tools applied to channel decoding problems, develop \emph{unified} framework characterize fundamental recovery criterion, which accommodates structures, alphabet sizes, In particular, our results isolate family \emph{minimum} \emph{channel divergence measures} degree corruption, together size minimum cut dictates feasibility exact information recovery. For various homogeneous graphs, condition depends almost on sparsity irrespective other graphical metrics; alternatively, sample complexity required these graphs scales like \[ \text{minimum }\asymp\frac{n\log n}{\mathsf{Hel}_{1/2}^{\min}} \] certain metric $\mathsf{Hel}_{1/2}^{\min}$ defined main text, as long not super-polynomial $n$. We apply theory three concrete applications, including stochastic block model, outlier haplotype assembly problem. Our leads order-wise tight conditions all scenarios.

参考文章(25)
Zongming Ma, Yihong Wu, Computational barriers in minimax submatrix detection arXiv: Statistics Theory. ,(2013) , 10.1214/14-AOS1300
Afonso S. Bandeira, Emmanuel Abbe, Georgina Hall, Exact Recovery in the Stochastic Block Model arXiv: Social and Information Networks. ,(2014)
David Tse, Govinda M. Kamath, Eren Şaşoğlu, Optimal Haplotype Assembly from High-Throughput Mate-Pair Reads arXiv: Information Theory. ,(2015)
Tim Roughgarden, Amir Globerson, Cafer Yildirim, David Sontag, How Hard is Inference for Structured Prediction international conference on machine learning. pp. 2181- 2190 ,(2015)
Leonidas J. Guibas, Qi-Xing Huang, Yuxin Chen, Near-Optimal Joint Object Matching via Convex Relaxation arXiv: Learning. ,(2014)
Elchanan Mossel, Joe Neeman, Allan Sly, Stochastic Block Models and Reconstruction arXiv: Probability. ,(2012)
David J. Crandall, Andrew Owens, Noah Snavely, Daniel P. Huttenlocher, SfM with MRFs: Discrete-Continuous Optimization for Large-Scale Structure from Motion IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 2841- 2853 ,(2013) , 10.1109/TPAMI.2012.218
Arvind Ganesh, John Wright, Xiaodong Li, Emmanuel J. Candes, Yi Ma, Dense error correction for low-rank matrices via Principal Component Pursuit 2010 IEEE International Symposium on Information Theory. pp. 1513- 1517 ,(2010) , 10.1109/ISIT.2010.5513538
Sriram Vishwanath, Haris Vikalo, Hongbo Si, Haplotype Assembly: An Information Theoretic View arXiv: Information Theory. ,(2014)