Causal Inference for De-biasing Motion Estimation from Robotic Observational Data

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA(2023)

Cited 0|Views16
No score
Abstract
Robot data collected in complex real-world scenarios are often biased due to safety concerns, human preferences, and mission or platform constraints. Consequently, robot learning from such observational data poses great challenges for accurate parameter estimation. We propose a principled causal inference framework for robots to learn the parameters of a stochastic motion model using observational data. Specifically, we leverage the de-biasing functionality of the potential-outcome causal inference framework, the Inverse Propensity Weighting (IPW), and the Doubly Robust (DR) methods, to obtain a better parameter estimation of the robot's stochastic motion model. The IPW is a re-weighting approach to ensure unbiased estimation, and the DR approach further combines any two estimators to strengthen the unbiased result even if one of these estimators is biased. We then develop an approximate policy iteration algorithm using the bias-eliminated estimated state transition function. We validate our framework using both simulation and real-world experiments, and the results have revealed that the proposed causal inference-based navigation and control framework can correctly and efficiently learn the parameters from biased observational data.
More
Translated text
Key words
accurate parameter estimation,bias-eliminated,biased observational data,causal inference-based navigation,control framework,de-biasing functionality,De-biasing Motion Estimation,Doubly Robust methods,DR approach,human preferences,Inverse Propensity Weighting,IPW,potential-outcome causal inference framework,principled causal inference framework,real-world experiments,robot data,robot learning,robotic observational data,state transition function,stochastic motion model,unbiased estimation,unbiased result
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined