Colorful Image Colorization

作者: Richard Zhang , Phillip Isola , Alexei A. Efros

DOI: 10.1007/978-3-319-46487-9_40

关键词:

摘要: Given a grayscale photograph as input, this paper attacks the problem of hallucinating plausible color version photograph. This is clearly underconstrained, so previous approaches have either relied on significant user interaction or resulted in desaturated colorizations. We propose fully automatic approach that produces vibrant and realistic embrace underlying uncertainty by posing it classification task use class-rebalancing at training time to increase diversity colors result. The system implemented feed-forward pass CNN test trained over million images. evaluate our algorithm using “colorization Turing test,” asking human participants choose between generated ground truth image. Our method successfully fools humans 32 % trials, significantly higher than methods. Moreover, we show colorization can be powerful pretext for self-supervised feature learning, acting cross-channel encoder. results state-of-the-art performance several learning benchmarks.

参考文章(40)
Xiaolong Wang, Abhinav Gupta, Unsupervised Learning of Visual Representations Using Videos 2015 IEEE International Conference on Computer Vision (ICCV). pp. 2794- 2802 ,(2015) , 10.1109/ICCV.2015.320
Carl Doersch, Abhinav Gupta, Alexei A. Efros, Unsupervised Visual Representation Learning by Context Prediction international conference on computer vision. pp. 1422- 1430 ,(2015) , 10.1109/ICCV.2015.167
Pulkit Agrawal, Joao Carreira, Jitendra Malik, Learning to See by Moving international conference on computer vision. pp. 37- 45 ,(2015) , 10.1109/ICCV.2015.13
Ross Girshick, Fast R-CNN international conference on computer vision. pp. 1440- 1448 ,(2015) , 10.1109/ICCV.2015.169
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Guillaume Charpiat, Matthias Hofmann, Bernhard Schölkopf, Automatic Image Colorization Via Multimodal Predictions Lecture Notes in Computer Science. pp. 126- 139 ,(2008) , 10.1007/978-3-540-88690-7_10
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Bharath Hariharan, Pablo Arbelaez, Ross Girshick, Jitendra Malik, Hypercolumns for object segmentation and fine-grained localization computer vision and pattern recognition. pp. 447- 456 ,(2015) , 10.1109/CVPR.2015.7298642
Raj Kumar Gupta, Alex Yong-Sang Chia, Deepu Rajan, Ee Sin Ng, Huang Zhiyong, Image colorization using similar images Proceedings of the 20th ACM international conference on Multimedia - MM '12. pp. 369- 378 ,(2012) , 10.1145/2393347.2393402
Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Learning Hierarchical Features for Scene Labeling IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 1915- 1929 ,(2013) , 10.1109/TPAMI.2012.231