Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?
 
conference paper not in proceedings

Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?

Xu, Yihong
•
Chambon, Loick
•
zablocki, Eloi
Show more
May 15, 2024
International Conference on Robotics and Automation (ICRA)

Motion forecasting is crucial in enabling autonomous vehicles to anticipate the future trajectories of surrounding agents. To do so, it requires solving mapping, detection, tracking, and then forecasting problems, in a multi-step pipeline. In this complex system, advances in conventional forecasting methods have been made using curated data, i.e., with the assumption of perfect maps, detection, and tracking. This paradigm, however, ignores any errors from upstream modules. Meanwhile, an emerging end-to-end paradigm, that tightly integrates the perception and forecasting architectures into joint training, promises to solve this issue. So far, however, the evaluation protocols between the two methods were incompatible and their comparison was not possible. In fact, and perhaps surprisingly, conventional forecasting methods are usually not trained nor tested in real-world pipelines (e.g., with upstream detection, tracking, and mapping modules). In this work, we aim to bring forecasting models closer to real-world deployment. First, we propose a unified evaluation pipeline for forecasting methods with real-world perception inputs, allowing us to compare the performance of conventional and end-to-end methods for the first time. Second, our in-depth study uncovers a substantial performance gap when transitioning from curated to perception-based data. In particular, we show that this gap (1) stems not only from differences in precision but also from the nature of imperfect inputs provided by perception modules, and that (2) is not trivially reduced by simply finetuning on perception outputs. Based on extensive experiments, we provide recommendations for critical areas that require improvement and guidance towards more robust motion forecasting in the real world. We will release an evaluation library to benchmark models under standardized and practical conditions.

  • Files
  • Details
  • Metrics
Type
conference paper not in proceedings
ArXiv ID

https://doi.org/10.48550/arXiv.2306.09281

Author(s)
Xu, Yihong
Chambon, Loick
zablocki, Eloi
Chen, Mickaël
Alahi, Alexandre  
Cord, Matthieu
Perez, Patrick
Date Issued

2024-05-15

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VITA  
Event nameEvent placeEvent date
International Conference on Robotics and Automation (ICRA)

yokohama, japan

May 13- May 17, 2024

Available on Infoscience
January 31, 2024
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/203346
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés