Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Self-Supervision By Prediction For Object Discovery In Videos
 
conference paper

Self-Supervision By Prediction For Object Discovery In Videos

Besbinar, Beril  
•
Frossard, Pascal  
January 1, 2021
2021 Ieee International Conference On Image Processing (Icip)
IEEE International Conference on Image Processing (ICIP)

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data, and unsupervised settings pose many challenges, such as finding the right inductive bias in diverse scenarios. In this paper, we propose an object-centric model for image sequence representation that uses the prediction task for self-supervision. By disentangling object representation and motion dynamics, our novel compositional structure explicitly handles occlusion and inpaints inferred objects and background for the composition of the predicted frame. Using auxiliary losses to promote spatially and temporally consistent object representations, we train our self-supervised framework without the help of any annotation or pretrained network. Initial experiments confirm that our new pipeline is a promising step towards object-centric video prediction.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICIP42928.2021.9506062
Web of Science ID

WOS:000819455101125

Author(s)
Besbinar, Beril  
Frossard, Pascal  
Date Issued

2021-01-01

Publisher

IEEE

Publisher place

New York

Published in
2021 Ieee International Conference On Image Processing (Icip)
ISBN of the book

978-1-6654-4115-5

Series title/Series vol.

IEEE International Conference on Image Processing ICIP

Start page

1509

End page

1513

Subjects

Computer Science, Artificial Intelligence

•

Computer Science, Software Engineering

•

Engineering, Electrical & Electronic

•

Imaging Science & Photographic Technology

•

Computer Science

•

Engineering

•

self-supervision

•

video prediction

•

object representation

•

unsupervised scene decomposition

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LTS4  
Event nameEvent placeEvent date
IEEE International Conference on Image Processing (ICIP)

ELECTR NETWORK

Sep 19-22, 2021

Available on Infoscience
August 15, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/189974
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés