POLO – Point-Based, Multi-class Animal Detection
Automated wildlife surveys based on drone imagery and object detection technology are a powerful and increasingly popular tool in conservation biology. Most detectors require training images with annotated bounding boxes, which are tedious, expensive, and not always unambiguous to create. To reduce the annotation load associated with this practice, we develop POLO, a multi-class object detection model that can be trained entirely on point labels. POLO is based on simple, yet effective modifications to the YOLOv8 architecture, including alterations to the prediction process, training losses, and post-processing. We test POLO on drone recordings of waterfowl containing up to multiple thousands of individual birds in one image and compare it to a regular YOLOv8. Our experiments show that at the same annotation cost, POLO achieves improved accuracy in counting animals in aerial imagery.
2-s2.0-105007132888
École Polytechnique Fédérale de Lausanne
École Polytechnique Fédérale de Lausanne
University College London
École Polytechnique Fédérale de Lausanne
2025
978-3-031-92387-6
Part II
Lecture Notes in Computer Science; 15624
1611-3349
0302-9743
169
177
REVIEWED
EPFL
Event name | Event acronym | Event place | Event date |
ECCV 2024 | Milan, Italy | 2024-09-29 - 2024-10-04 | |