Beyond the self: Using grounded affordances to interpret and describe others' actions
IEEE Transactions on Cognitive and Developmental Systems, 2019•ieeexplore.ieee.org
In this paper, we propose a developmental approach that allows a robot to interpret and
describe the actions of human agents by reusing previous experience. The robot first learns
the association between words and object affordances by manipulating the objects in its
environment. It then uses this information to learn a mapping between its own actions and
those performed by a human in a shared environment. It finally fuses the information from
these two models to interpret and describe human actions in light of its own experience. In …
describe the actions of human agents by reusing previous experience. The robot first learns
the association between words and object affordances by manipulating the objects in its
environment. It then uses this information to learn a mapping between its own actions and
those performed by a human in a shared environment. It finally fuses the information from
these two models to interpret and describe human actions in light of its own experience. In …
In this paper, we propose a developmental approach that allows a robot to interpret and describe the actions of human agents by reusing previous experience. The robot first learns the association between words and object affordances by manipulating the objects in its environment. It then uses this information to learn a mapping between its own actions and those performed by a human in a shared environment. It finally fuses the information from these two models to interpret and describe human actions in light of its own experience. In our experiments, we show that the model can be used flexibly to do inference on different aspects of the scene. We can predict the effects of an action on the basis of object properties. We can revise the belief that a certain action occurred, given the observed effects of the human action. In an early action recognition fashion, we can anticipate the effects when the action has only been partially observed. By estimating the probability of words given the evidence and feeding them into a predefined grammar, we can generate relevant descriptions of the scene. We believe that this is a step toward providing robots with the fundamental skills to engage in social collaboration with humans.
ieeexplore.ieee.org