Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Learning to Refine Object Segments
 
conference paper

Learning to Refine Object Segments

Pinheiro, Pedro H. O.
•
Lin, Tsung-Yi
•
Collobert, Ronan
Show more
2016
Computer Vision - Eccv 2016, Pt I
Computer Vision - ECCV 2016

Object segmentation requires both object-level information and low-level pixel data. This presents a challenge for feedforward networks: lower layers in convolutional nets capture rich spatial information, while upper layers encode object-level knowledge but are invariant to factors such as pose and appearance. In this work we propose to augment feedforward nets for object segmentation with a novel top-down refinement approach. The resulting bottom-up/top-down architecture is capable of efficiently generating high-fidelity object masks. Similarly to skip connections, our approach leverages features at all layers of the net. Unlike skip connections, our approach does not attempt to output independent predictions at each layer. Instead, we first output a coarse ‘mask encoding’ in a feedforward pass, then refine this mask encoding in a top-down pass utilizing features at successively lower layers. The approach is simple, fast, and effective. Building on the recent DeepMask network for generating object proposals, we show accuracy improvements of 10–20% in average recall for various setups. Additionally, by optimizing the overall network architecture, our approach, which we call SharpMask, is 50 % faster than the original DeepMask network (under .8 s per image).

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-319-46448-0_5
Web of Science ID

WOS:000389382700005

Author(s)
Pinheiro, Pedro H. O.
Lin, Tsung-Yi
Collobert, Ronan
Dollar, Piotr
Date Issued

2016

Publisher

Springer

Publisher place

Cham

Published in
Computer Vision - Eccv 2016, Pt I
ISBN of the book

978-3-319-46448-0

978-3-319-46447-3

Total of pages

17

Series title/Series vol.

Lecture Notes in Computer Science

Volume

9905

Start page

75

End page

91

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent place
Computer Vision - ECCV 2016

Amsterdam

Available on Infoscience
January 19, 2017
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/133069
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés