Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning
 
research article

Person Retrieval in Surveillance Videos Via Deep Attribute Mining and Reasoning

Shi, Yuxuan
•
Wei, Zhen  
•
Ling, Hefei
Show more
January 1, 2021
Ieee Transactions On Multimedia

Person retrieval largely relies on the appearance features of pedestrians. This task is rather more difficult in surveillance videos due to the limitations of extracting robust appearance features brought by the cross-view and cross-camera data with lower image resolution, motion blur, occlusion and other kinds of image degradation. To build up a more reliable person retrieval system, recent works introduced appearance attribute models to describe and distinguish different persons with high-level semantic concepts. Despite the progress of previous works, the value of utilizing appearance attributes is still under-explored. On one hand, existing methods lack for concise and precise attribute representations that are specific for each attribute category and, in the meantime, are able to filter noisy information in irrelevant spatial locations and useless patterns. On the other hand, correlation and reasoning between different attributes are neglected, which could generate more useful information and add more robustness to the retrieval system. In this paper, we propose an Attribute Mining and Reasoning (AMR) framework which is capable to handle the issues in question. The AMR makes better use of appearance attributes with two main components. First, the AMR disentangles the representations of different attributes by localizing their spatial positions and identifying their effective patterns in a weakly supervised manner. To achieve more reliable localization, we propose the Attribute Localization Ensemble (ALE) module that is consisted of multiple localization heads and a voting mechanism. Second, we introduce the Attribute Reasoning (AR) module to correlate different attributes together with the global appearance features and discover their latent relations to generate more comprehensive descriptions of pedestrians. Extensive experiments on DukeMTMC-ReID and Market-1501 datasets demonstrate the effectiveness of the proposed AMR framework as well as its superiority over the existing state-of-the-art methods. The AMR model also shows great generalization ability on the unseen CUHK03 dataset when it is only trained on Market-1501 dataset.

  • Details
  • Metrics
Type
research article
DOI
10.1109/TMM.2020.3042068
Web of Science ID

WOS:000720519900036

Author(s)
Shi, Yuxuan
•
Wei, Zhen  
•
Ling, Hefei
•
Wang, Ziyang
•
Shen, Jialie
•
Li, Ping
Date Issued

2021-01-01

Published in
Ieee Transactions On Multimedia
Volume

23

Start page

4376

End page

4387

Subjects

Computer Science, Information Systems

•

Computer Science, Software Engineering

•

Telecommunications

•

Computer Science

•

cognition

•

feature extraction

•

hair

•

semantics

•

training

•

robustness

•

convolution

•

person retrieval

•

person re-identification

•

human attribute

•

graph convolutional network

•

neural-network

•

reidentification

•

identification

Peer reviewed

REVIEWED

Written at

OTHER

EPFL units
CVLAB  
Available on Infoscience
December 4, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/183582
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés