作者: Xiaodan Liang , Chunyan Xu , Xiaohui Shen , Jianchao Yang , Si Liu
DOI: 10.1109/TPAMI.2016.2537339
关键词:
摘要: In this work, we address the human parsing task with a novel Contextualized Convolutional Neural Network (Co-CNN) architecture, which well integrates the cross-layer context, global image-level context, within-super-pixel context and cross-super-pixel neighborhood context into a unified network. Given an input human image, Co-CNN produces the pixel-wise categorization in an end-to-end way. First, the cross-layer context is captured by our basic local-to-global-to-local structure, which hierarchically combines the global semantic …