作者: Jason Kuen , Federico Perazzi , Zhe Lin , Jianming Zhang , Yap-Peng Tan
关键词: Set (abstract data type) 、 Generalization 、 Feature (computer vision) 、 Object detection 、 Object (computer science) 、 Normalization (statistics) 、 Artificial intelligence 、 Scale (descriptive set theory) 、 Pattern recognition 、 Computer science
摘要: Large scale object detection datasets are constantly increasing their size in terms of the number classes and annotations count. Yet, object-level categories annotated is an order magnitude smaller than image-level classification labels. State-of-the art models trained a supervised fashion this limits they can detect. In paper, we propose novel weight transfer network (WTN) to effectively efficiently knowledge from network's weights allow without box supervision. We first introduce input feature normalization schemes curb under-fitting during training vanilla WTN. then autoencoder-WTN (AE-WTN) which uses reconstruction loss preserve information over all target latent space ensure generalization classes. Compared WTN, AE-WTN obtains absolute performance gains 6% on two Open Images evaluation sets with 500 seen 57 respectively, 25% Visual Genome set 200