作者: Ali Jannesari , Chandan Kumar , Subrahmanyam Vaddi
DOI:
关键词:
摘要: Unmanned Aerial Vehicles (UAVs) especially drones, equipped with vision techniques have become very popular in recent years, their extensive use wide range of applications. Many these applications require computer techniques, particularly object detection from the information captured by on-board camera. In this paper, we propose an end to model running on a UAV platform which is suitable for real-time We deep feature pyramid architecture makes inherent properties features extracted Convolutional Networks capturing more generic images (such as edge, color etc.) along minute detailed specific classes contained our problem. VisDrone-18 dataset studies contain different objects such pedestrians, vehicles, bicycles etc. provide software and hardware used study. implemented both ResNet MobileNet convolutional bases. Our combined modified focal loss function, produced desirable performance 30.6 mAP inference time 14 fps. compared results RetinaNet-ResNet-50 HAL-RetinaNet shown that backend extractor gave best terms accuracy, speed memory efficiency real drones.