Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Towards Generalizable Trajectory Prediction using Dual-Level Representation Learning and Adaptive Prompting
 
conference paper

Towards Generalizable Trajectory Prediction using Dual-Level Representation Learning and Adaptive Prompting

Messaoud Ben Amor, Kaouther  
•
Matthieu Cord
•
Alahi, Alexandre  
June 11, 2025
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition [Forthcoming publication]
The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025

Existing vehicle trajectory prediction models struggle with generalizability, prediction uncertainties, and handling complex interactions. It is often due to limitations like complex architectures customized for a specific dataset and inefficient multimodal handling. We propose Perceiver with Register queries (PerReg+), a novel trajectory prediction framework that introduces: (1) Dual-Level Representation Learning via Self-Distillation (SD) and Masked Reconstruction (MR), capturing global context and finegrained details. Additionally, our approach of reconstructing segment-level trajectories and lane segments from masked inputs with query drop, enables effective use of contextual information and improves generalization; (2) Enhanced Multimodality using register-based queries and pretraining, eliminating the need for clustering and suppression; and (3) Adaptive Prompt Tuning during fine-tuning, freezing the main architecture and optimizing a small number of prompts for efficient adaptation. PerReg+ sets a state-of-the-art performance on nuScenes [5], Argoverse 2 [42], and Waymo Open Motion Dataset (WOMD) [13]. Remarkably, our model reduces the error by 6.8% on smaller datasets, and multi-dataset training enhances generalization. In cross-domain tests, PerReg+ reduces B-FDE by 11.8% compared to its non-pretrained variant.

  • Files
  • Details
  • Metrics
Type
conference paper
Author(s)
Messaoud Ben Amor, Kaouther  

EPFL

Matthieu Cord

Sorbonne Université

Alahi, Alexandre  

EPFL

Date Issued

2025-06-11

Publisher

Institute of Electrical Engineering

Published in
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition [Forthcoming publication]
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VITA  
Event nameEvent acronymEvent placeEvent date
The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025

CVPR 2025

Nashville, Tennessee, US

2025-06-11 - 2025-06-15

Available on Infoscience
May 5, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/249747
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés