Efficient distance metric learning by adaptive sampling and mini-batch stochastic gradient descent (SGD)

作者: Qi Qian , Rong Jin , Jinfeng Yi , Lijun Zhang , Shenghuo Zhu

DOI: 10.1007/S10994-014-5456-X

关键词: Task (computing)Matrix (mathematics)Stochastic gradient descentOnline learningMathematicsHybrid approachConstraint (information theory)Adaptive samplingMathematical optimization

摘要: Distance metric learning (DML) is an important task that has found applications in many domains. The high computational cost of DML arises from the large number variables to be determined and constraint a distance positive semi-definite (PSD) matrix. Although stochastic gradient descent (SGD) been successfully applied improve efficiency DML, it can still computationally expensive order ensure solution PSD It to, at every iteration, project updated onto cone, operation. We address this challenge by developing two strategies within SGD, i.e. mini-batch adaptive sampling, effectively reduce updates (i.e. projections cone) SGD. also develop hybrid approaches combine strength sampling with online techniques further SGD for DML. prove theoretical guarantees both based conduct extensive empirical study verify effectiveness proposed algorithms

参考文章(26)
Rong Jin, Stochastic Optimization of Smooth Loss arXiv: Learning. ,(2013)
Tong Zhang, Frank J. Oles, Text Categorization Based on Regularized Linear Classification Methods Information Retrieval. ,vol. 4, pp. 5- 31 ,(2001) , 10.1023/A:1011441423217
Samy Bengio, Uri Shalit, Varun Sharma, Gal Chechik, Large Scale Online Learning of Image Similarity Through Ranking Journal of Machine Learning Research. ,vol. 11, pp. 1109- 1135 ,(2010)
Nicolo Cesa-Bianchi, Gabor Lugosi, Prediction, learning, and games ,(2006)
Jason V. Davis, Inderjit S. Dhillon, Structured metric learning for high dimensional problems Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 195- 203 ,(2008) , 10.1145/1401890.1401918
Ron Bekkerman, Martin Scholz, Data weaving Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08. pp. 1083- 1092 ,(2008) , 10.1145/1458082.1458226
Shai Shalev-Shwartz, Yoram Singer, Andrew Y. Ng, Online and batch learning of pseudo-metrics Twenty-first international conference on Machine learning - ICML '04. pp. 94- ,(2004) , 10.1145/1015330.1015376
Amir Globerson, Sam T. Roweis, Metric Learning by Collapsing Classes neural information processing systems. ,vol. 18, pp. 451- 458 ,(2005)
Kilian Q. Weinberger, Lawrence K. Saul, Distance Metric Learning for Large Margin Nearest Neighbor Classification Journal of Machine Learning Research. ,vol. 10, pp. 207- 244 ,(2009) , 10.5555/1577069.1577078
Nati Srebro, Ohad Shamir, Karthik Sridharan, Andrew Cotter, Better Mini-Batch Algorithms via Accelerated Gradient Methods neural information processing systems. ,vol. 24, pp. 1647- 1655 ,(2011)