Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Learning search polices from humans in a partially observable context
 
research article

Learning search polices from humans in a partially observable context

De Chambrier, Guillaume Pierre Luc  
•
Billard, Aude  orcid-logo
2014
Robotics and Biomimetics

Decision making and planning for which the state information is only partially available is a problem faced by all forms of intelligent entities they being either virtual, synthetic or biological. The standard approach to mathematically solve such a decisional problem is to formulate it as a partially observable decision process (POMDP) and apply the same optimisation techniques used in the Markov decision process (MDP). However, applying naively the same methodology to solve MDPs as with POMDPs makes the problem computationally intractable. To address this problem, we take a programming by demonstration approach to provide a solution to the POMDP in continuous state and action space. In this work, we model the decision making process followed by humans when searching blindly for an object on a table. We show that by representing the belief of the human’s position in the environment by a particle filter (PF) and learning a mapping from this belief to their end effector velocities with a Gaussian mixture model (GMM), we can model the human’s search process and reproduce it for any agent. We further categorize the type of behaviours demonstrated by humans as being either risk-prone or risk-averse and find that more than 70% of the human searches were considered to be risk-averse. We contrast the performance of this human-inspired search model with respect to greedy and coastal navigation search methods. Our evaluation metric is the distance taken to reach the goal and how each method minimises the uncertainty. We further analyse the control policy of the coastal navigation and GMM search models and argue that taking into account uncertainty is more efficient with respect to distance travelled to reach the goal.

  • Details
  • Metrics
Type
research article
DOI
10.1186/s40638-014-0008-1
Author(s)
De Chambrier, Guillaume Pierre Luc  
Billard, Aude  orcid-logo
Date Issued

2014

Published in
Robotics and Biomimetics
Volume

1

Issue

1

Start page

8

Subjects

Belief space planning

•

Partially observable environment

•

Imitation learning

•

Search strategies in humans

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LASA  
Available on Infoscience
November 18, 2014
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/108915
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés