Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Positive-Unlabeled Constraint Learning for Inferring Nonlinear Continuous Constraints Functions From Expert Demonstrations
 
research article

Positive-Unlabeled Constraint Learning for Inferring Nonlinear Continuous Constraints Functions From Expert Demonstrations

Peng, Baiyu  
•
Billard, Aude  orcid-logo
December 25, 2024
IEEE Robotics and Automation Letters

Planning for diverse real-world robotic tasks necessitates to know and write all constraints. However, instances exist where these constraints are either unknown or challenging to specify accurately. A possible solution is to infer the unknown constraints from expert demonstration. This paper presents a novel two-step Positive-Unlabeled Constraint Learning (PUCL) algorithm to infer a continuous constraint function from demonstrations, without requiring prior knowledge of the true constraint parameterization or environmental model as existing works. We treat all data in demonstrations as positive (feasible) data, and learn a control policy to generate potentially infeasible trajectories, which serve as unlabeled data. The proposed two-step learning framework first identifies reliable infeasible data using a distance metric, and secondly learns a binary feasibility classifier (i.e., constraint function) from the feasible demonstrations and reliable infeasible data. The proposed method is flexible to learn complex-shaped constraint boundary and will not mistakenly classify demonstrations as infeasible as previous methods. The effectiveness of the proposed method is verified in four constrained environments, using a networked policy or a dynamical system policy. It successfully infers the continuous nonlinear constraints and outperforms other baseline methods in terms of constraint accuracy and policy safety.

  • Details
  • Metrics
Type
research article
DOI
10.1109/LRA.2024.3522756
Scopus ID

2-s2.0-85213468235

Author(s)
Peng, Baiyu  

École Polytechnique Fédérale de Lausanne

Billard, Aude  orcid-logo

École Polytechnique Fédérale de Lausanne

Date Issued

2024-12-25

Published in
IEEE Robotics and Automation Letters
Volume

10

Issue

2

Start page

1593

End page

1600

Subjects

Learning from demonstration

•

reinforcement learning

•

transfer learning

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LASA  
Available on Infoscience
January 10, 2025
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/242699
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés