作者: Zsolt Kira , Iain Melvin , Hans Peter Graf , Ghassan AlRegib , Chih-Yao Ma
DOI:
关键词:
摘要: Human actions often involve complex interactions across several inter-related objects in the scene. However, existing approaches to fine-grained video understanding or visual …