Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Composite Relationship Fields with Transformers for Scene Graph Generation
 
conference paper

Composite Relationship Fields with Transformers for Scene Graph Generation

Adaimi, George  
•
Mizrahi, David
•
Alahi, Alexandre  
2023
2023 IEEE/CVF Winter Conference On Applications Of Computer Vision (Wacv)
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023)

Scene graph generation (SGG) methods extract relationships between objects. While most methods focus on improving top-down approaches, which build a scene graph based on detected objects from an off-the-shelf object detector, there is a limited amount of work on bottom-up approaches, which jointly detect objects and their relationships in a single stage. In this work, we present a novel bottom-up SGG approach by representing relationships using Composite Relationship Fields (CoRF). CoRF turns relationship detection into a dense regression and classification task, where each cell of the output feature map identifies surrounding objects and their relationships. Furthermore, we propose a refinement head that leverages Transformers for global scene reasoning, resulting in more meaningful relationship predictions. By combining both contributions, our method outperforms previous bottom-up methods on the Visual Genome dataset by 26% while preserving real-time performance.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/WACV56688.2023.00014
Author(s)
Adaimi, George  
Mizrahi, David
Alahi, Alexandre  
Date Issued

2023

Publisher

Los Alamitos

Publisher place

IEEE Computer Soc

Published in
2023 IEEE/CVF Winter Conference On Applications Of Computer Vision (Wacv)
ISBN of the book

978-1-6654-9346-8

Total of pages

8

Series title/Series vol.

IEEE Winter Conference on Applications of Computer Vision

Start page

52

End page

64

Subjects

Scene Graph Generation

•

Scene Understanding

•

Visual Relationship Detection

•

Object Detection

•

Computer Vision

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
VITA  
Event nameEvent placeEvent date
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023)

Waikoloa, Hawaii, United States

January 3-7, 2023

Available on Infoscience
October 31, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/191701
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés