Encoder-Decoder Models for Human Segmentation and Motion Analysis
Detecting people from 2D images and analyzing their motion in 3D have been long standing computer vision problems central to numerous applications such as autonomous driving and athletic training. Recently, with the availability of large amounts of training data and the advent of deep learning, the performance in human segmentation, 3D human pose prediction has improved significantly. However, these problems remain challenging due to several factors. In this thesis, we decompose the human motion analysis into three sub-tasks; 2D human segmentation, 3D human body pose estimation and 3D human motion forecasting. Our goal is to alleviate the challenges in these problems using various encoder-decoder models.
EPFL_TH8036.pdf
n/a
openaccess
Copyright
51.65 MB
Adobe PDF
415d1d7932164a70da6edc9af5858ed3