Fua, PascalSalzmann, MathieuKatircioglu, Isinsu2022-03-072022-03-072022-03-07202210.5075/epfl-thesis-8036https://infoscience.epfl.ch/handle/20.500.14299/186041Detecting people from 2D images and analyzing their motion in 3D have been long standing computer vision problems central to numerous applications such as autonomous driving and athletic training. Recently, with the availability of large amounts of training data and the advent of deep learning, the performance in human segmentation, 3D human pose prediction has improved significantly. However, these problems remain challenging due to several factors. In this thesis, we decompose the human motion analysis into three sub-tasks; 2D human segmentation, 3D human body pose estimation and 3D human motion forecasting. Our goal is to alleviate the challenges in these problems using various encoder-decoder models.enComputer visionself-supervised detection and segmentationmulti-view consistency3D human pose estimation3D motion forecastingattention mechanismdeep learning.Encoder-Decoder Models for Human Segmentation and Motion Analysisthesis::doctoral thesis