Strong mixed-integer programming formulations for trained neural networks

作者: Joey Huchette , Ross Anderson , Juan Pablo Vielma , Will Ma , Christian Tjandraatmadja

DOI:

关键词:

摘要: We present strong mixed-integer programming (MIP) formulations for high-dimensional piecewise linear functions that correspond to trained neural networks. These can be used a number of important tasks, such as verifying an image classification network is robust adversarial inputs, or solving decision problems where the objective function machine learning model. generic framework, which may independent interest, provides way construct sharp ideal maximum d affine over arbitrary polyhedral input domains. apply this result derive MIP most popular nonlinear operations (e.g. ReLU and max pooling) are strictly stronger than other approaches from literature. corroborate computationally, showing our able offer substantial improvements in solve time on verification tasks

参考文章(83)
Andrea Bartolini, Michele Lombardi, Michela Milano, Luca Benini, Neuron constraints to model complex real-world problems principles and practice of constraint programming. pp. 115- 129 ,(2011) , 10.1007/978-3-642-23786-7_11
Michela Milano, Andrea Bartolini, Luca Benini, Michele Lombardi, Optimization and controlled systems: a case study on thermal awareworkload dispatching national conference on artificial intelligence. ,vol. 1, pp. 427- 433 ,(2012)
Willem K. Klein Haneveld, Robustness against dependence in PERT: An application of duality and distributions with known marginals Mathematical Programming Studies. ,vol. 27, pp. 153- 182 ,(1986) , 10.1007/BFB0121119
Babak Alipanahi, Andrew Delong, Matthew T Weirauch, Brendan J Frey, Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning Nature Biotechnology. ,vol. 33, pp. 831- 838 ,(2015) , 10.1038/NBT.3300
Diederik P. Kingma, Jimmy Ba, Adam: A Method for Stochastic Optimization arXiv: Learning. ,(2014)
José del R. Millán, Daniele Posenato, Eric Dedieu, Continuous-Action Q-Learning Machine Learning. ,vol. 49, pp. 247- 265 ,(2002) , 10.1023/A:1017988514716
Jens Vygen, Bernhard Korte, Combinatorial Optimization: Theory and Algorithms ,(2012)
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
John Tsitsiklis, Dimitris Bertsimas, Introduction to linear optimization ,(1997)