Attribute-Guided Face Generation Using Conditional CycleGAN

作者: Yongyi Lu , Yu-Wing Tai , Chi-Keung Tang

DOI: 10.1007/978-3-030-01258-8_18

关键词: Image (mathematics)Identity (music)Training setTransfer (computing)SuperresolutionFace (geometry)Pattern recognitionArtificial intelligenceComputer science

摘要: We are interested in attribute-guided face generation: given a low-res input image, an attribute vector that can be extracted from high-res image (attribute image), our new method generates for the satisfies attributes. To address this problem, we condition CycleGAN and propose conditional CycleGAN, which is designed to (1) handle unpaired training data because low/high-res images may not necessarily align with each other, (2) allow easy control of appearance generated via demonstrate high-quality results on synthesize realistic easily controlled by user-supplied attributes (e.g., gender, makeup, hair color, eyeglasses). Using as identity produce corresponding incorporating verification network, network becomes identity-guided produces interesting transfer. three applications CycleGAN: identity-preserving superresolution, swapping, frontal generation, consistently show advantage method.

参考文章(23)
Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Learning a Deep Convolutional Network for Image Super-Resolution european conference on computer vision. pp. 184- 199 ,(2014) , 10.1007/978-3-319-10593-2_13
Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang, Deep Learning Face Attributes in the Wild 2015 IEEE International Conference on Computer Vision (ICCV). pp. 3730- 3738 ,(2015) , 10.1109/ICCV.2015.425
Florian Schroff, Dmitry Kalenichenko, James Philbin, FaceNet: A unified embedding for face recognition and clustering computer vision and pattern recognition. pp. 815- 823 ,(2015) , 10.1109/CVPR.2015.7298682
, Generative Adversarial Nets neural information processing systems. ,vol. 27, pp. 2672- 2680 ,(2014) , 10.3156/JSOFT.29.5_177_2
Z. Wang, A.C. Bovik, H.R. Sheikh, E.P. Simoncelli, Image quality assessment: from error visibility to structural similarity IEEE Transactions on Image Processing. ,vol. 13, pp. 600- 612 ,(2004) , 10.1109/TIP.2003.819861
Zhenan Sun, Ran He, Xiang Wu, A Lightened CNN for Deep Face Representation arXiv: Computer Vision and Pattern Recognition. ,(2015)
Soumith Chintala, Alec Radford, Luke Metz, Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks arXiv: Learning. ,(2015)
Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee, Deeply-Recursive Convolutional Network for Image Super-Resolution computer vision and pattern recognition. pp. 1637- 1645 ,(2016) , 10.1109/CVPR.2016.181
Justin Johnson, Alexandre Alahi, Li Fei-Fei, Perceptual Losses for Real-Time Style Transfer and Super-Resolution Computer Vision – ECCV 2016. pp. 694- 711 ,(2016) , 10.1007/978-3-319-46475-6_43
Wenzhe Shi, Jose Caballero, Ferenc Huszar, Johannes Totz, Andrew P. Aitken, Rob Bishop, Daniel Rueckert, Zehan Wang, Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network computer vision and pattern recognition. pp. 1874- 1883 ,(2016) , 10.1109/CVPR.2016.207