ReSeg: A Recurrent Neural Network for Object Segmentation

作者: Yoshua Bengio , Kyle Kastner , Matteo Matteucci , Aaron C. Courville , KyungHyun Cho

DOI:

关键词:

摘要: We propose a structured prediction architecture for images centered around deep recurrent neural networks. The proposed network, called ReSeg, is based on the recently introduced ReNet model object classification. modify and extend it to perform segmentation, noting that avoidance of pooling can greatly simplify pixel-wise tasks images. ReSeg layer composed four networks sweep image horizontally vertically in both directions, along with final expands back original size. combines multiple layers several possible input as well which size, making suitable variety tasks. evaluate specific task segmentation three widely-used datasets, namely Weizmann Horse, Fashionista Oxford Flower. results suggest challenge state art may have further applications at large.

参考文章(37)
Hugo Larochelle, Atousa Torabi, Li Yao, Christopher Joseph Pal, Aaron C. Courville, Kyunghyun Cho, Nicolas Ballas, Video Description Generation Incorporating Spatio-Temporal Features and a Soft-Attention Mechanism arXiv: Machine Learning. ,(2015)
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, Yoshua Bengio, None, Show, Attend and Tell: Neural Image Caption Generation with Visual Attention international conference on machine learning. ,vol. 3, pp. 2048- 2057 ,(2015)
Yoshua Bengio, Yoshua Bengio, Yoshua Bengio, Jan Chorowski, Kyunghyun Cho, Dzmitry Bahdanau, End-to-end continuous speech recognition using attention-based recurrent nn: First results arXiv: Neural and Evolutionary Computing. ,(2014)
Anat Levin, Yair Weiss, Learning to Combine Bottom-Up and Top-Down Segmentation Computer Vision – ECCV 2006. pp. 581- 594 ,(2006) , 10.1007/11744085_45
Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, James Bergstra, Ian Goodfellow, Arnaud Bergeron, Nicolas Bouchard, David Warde-Farley, Yoshua Bengio, None, Theano: new features and speed improvements arXiv: Symbolic Computation. ,(2012)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Shuicheng Yan, Qiang Chen, Min Lin, Network In Network arXiv: Neural and Evolutionary Computing. ,(2013)
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Philipp Fischer, Thomas Brox, None, U-Net: Convolutional Networks for Biomedical Image Segmentation medical image computing and computer assisted intervention. pp. 234- 241 ,(2015) , 10.1007/978-3-319-24574-4_28