Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings
 
Loading...
Thumbnail Image
conference paper

A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings

Gupta, Anshul
•
Tafasca, Samy
•
Odobez, Jean-Marc  
January 1, 2022
2022 Ieee/Cvf Conference On Computer Vision And Pattern Recognition Workshops (Cvprw 2022)
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Predicting where a person is looking is a complex task, requiring to understand not only the person's gaze and scene content, but also the 3D scene structure and the person's situation (are they manipulating? interacting or observing others? attentive?) to detect obstructions in the line of sight or apply attention priors that humans typically have when observing others. In this paper, we hypothesize that identifying and leveraging such priors can be better achieved through the exploitation of explicitly derived multimodal cues such as depth and pose. We thus propose a modular multimodal architecture allowing to combine these cues using an attention mechanism. The architecture can naturally be exploited in privacy-sensitive situations such as surveillance and health, where personally identifiable information cannot be released. We perform extensive experiments on the GazeFollow and VideoAttentionTarget public datasets, obtaining state-of-the-art performance and demonstrating very competitive results in the privacy setting case. (1)

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/CVPRW56347.2022.00552
Web of Science ID

WOS:000861612705015

Author(s)
Gupta, Anshul
•
Tafasca, Samy
•
Odobez, Jean-Marc  
Date Issued

2022-01-01

Publisher

IEEE

Publisher place

New York

Published in
2022 Ieee/Cvf Conference On Computer Vision And Pattern Recognition Workshops (Cvprw 2022)
ISBN of the book

978-1-6654-8739-9

Start page

5037

End page

5046

Subjects

Computer Science, Artificial Intelligence

•

Computer Science, Theory & Methods

•

Computer Science

•

attention

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

New Orleans, LA

Jun 18-24, 2022

Available on Infoscience
December 19, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/193307
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés