Research

Publications

Dynamic simulation with humanoid-robots on XANIM

Coneural > Ioana Goga > PhD research > Dynamic simulation with humanoid-robots

We implemented the computational model of the seriated cups task, on a dynamic simulation of a child-caretaker pair of humanoid-robots. We used Xanim simulator (Schaal, 2000) to model a pair of 30 degrees of freedom (Head 3, Arms 7*2, Trunk 3, Legs 3*2, Eyes 4 D.O.F). Xanim is created using the simulation package SL (Schaal, 2000). SL has a modular structure that includes a motor servo, used to read the current state of the robot/simulation and to send commands to the robot/simulation; a task servo that allows switching between different tasks; a vision servo to collect data from camera systems; and inverse dynamics and inverse kinematics servos to allow control of the robot from Cartesian states.

The implementation of the model is described in more detail in Marian & Billard (2004).

Pairing behavior corresponding to the first developmental stage

As explained in the description of the computational model, what the agent learns during the demonstration of the seriate cups task is a result of the functioning of three mechanisms: basic categorization, joint attention and localized representation. The joint attention mechanism is necessary to focus the attention on selected objects and to decrease the amount of information learned at any time moment. An assumption of our model is that during the first seriation developmental stage, the cognitive system of the learner is characterized by a low vigilance parameter. By this we mean a low capacity to make fine distinctions between the objects placed in the focus of attention. This determines basic categories to occur and leads to the formation of two internal goals: <hand grasps cup> and <place cup into cup>.

During retrieval, the internal model activates sequentially the two goals. Each active goal drives the actions of the agent towards its satisfaction. At this stagem, the simplest behavior achieved by the agent is to pair two cups.

Figure 7. Pairing of two cups. The cup from position 2 (brown cup) is grasped and carried to position 3 (blue cup).

Choice of the cups

The choice of the cups to be combined results is a function of multiple constraints satisfaction (size, saliency and economy). There are different possible settings of the constraints probabilities:

one possible setting corresponds to a model where saliency has the higher probability, which means that the first selection criterion is the object's saliency. It can lead to various behaviors, as a function of how the other two constraints are satisfied. Figure 8a shows how the most salient object in the scene is chosen as an acting cup (blue cup from position 3), while the recipient cup is chosen to minimize the path (proximity constraint). Figure 8b shows how the acting cup, (i.e., the most salient object), is brought to position 4 (green cup), in order to satisfy size constraints.

(a)
(b)

Figure 8. Saliency driven model. The most salient object becomes the acting cup and it is nested with different cups in order to satisfy (a) proximity constraints; or (b) size constraints.

another setting corresponds to an internal model which optimizes size consonance. To do this, the agent compares internally several objects located in the focus of attention and picks the acting and the recipient cups in such a way to satisfy the constraint: "place a smaller cup into a larger cup". Note that children usually satisfy size constraints, but do not maximize size consonance at this stage (i.e., infants usually embed the smallest cup with some larger cup, probably proximity issues being also considered). See behavior illustrated in Figure 9.

Figure 9. The acting and the recipient cups are chosen to increase size consonance value.

Two pairs and the transfer of one cup

When the hand with the acting cup reaches the position of the target cuip, the system must choose between two possible actions: (a) drop the acting cup and form a pair; (b) hold the acting cup and move the hand. The satisfaction of the goal <place cup into cup> leads the system into a final state (system halts, as seen above). By contrary, if from different reasons the goal <place cup into cup> is not satisfied by the current embedding, then it remains active and it drives the behavior of the system until its completion.

By keeping the goal <place cup into cup> unsatisfied, two types of behavior can result:

one pair is formed and the system continues by choosing another acting cup to form a new pair. This type of behavior is specific to the transition from the first to the second developmental stage.
the acting cup is not dropped at the target position, and the system continues by transferring the cup through several recipient positions. See the transfer of one cup through different positions in Figure 10.

Figure 10. The yellow cup from position 1 is transferred through several positions. First, it is brought to the position 4 (green cup); second, it is brought to the position 5 (red cup); finally it is brought to position 3 (blue cup). At each move, the recipient cup is chosen to maximize size consonance. This setting of the parameters favours the acting of the system towards the satisfaction of the first goal <hand grasps cup> and determines the unsatisfaction of the second goal, what leads to the transfer of the initial cup through several target positions. From simulation simplicity reasons, a previously visited cup falls down from the table, and it is not visible anylonger.

In our model, the concept of the "object as an extension of hand" is naturally emerging by setting two parameters: (1) set a higher priority for the satisfaction of the goal <hand grasps cup> compared to the satisfaction of the goal <place cup into cup>, and (2) set a high probability for the conservation/proximity constraint.

Pot strategy corresponding to the second developmental stage

The transition from the pairing to the pot strategy can take place in two ways (in our view):

increase the number of the specific goals that are learned during the demonstration of the task (i.e., a sequence of sub-goals)
preserve an internal model with only two goals, but increase the information stored within each goal representation (i.e., two fully specified goals)

The assumption of this model is that the second stage is characterized by increased vigilance and memory resources, which are used to learn more information about the goals of the imitation task. During the second stage of development, cognitive resources are employed to learn more about the goal <place cup into cup>. This leads to an increased capacity to remember that several cups (instead of two) have been embedded. Only during the third developmental stage, attentional and memory resources are employed to learn more about the <hand grasps cup> goal. These computational assumptions were inspired by experimental data showing that infants learn what to imitate, before learning how to imitate.

The behavior of the agent in the second developmental stage is oriented towards the maximization of size consonance, computed for all the objects stored in the representation of the final state goal. With each new pair formed, the total consonance increases, and makes less probable behaviors such as, the transfer of one cup through several positions. As a function of how much time the system spends to compute the consonance and how this is computed, different behaviors can result: a tower vs. a nest; a pot containing all the cups vs.a pot with two or three cups. See the nesting of three cups illustrated in Figure 11.

Figure 11. A pot with cups is formed at position 5 (red cup). First it is embedded the brown cup from position 2. Next, the yellow cup from position 1, followed by the blue cup from position 3. Size constraints are satisfied with respect to the acting cup and the recipient pot cup. A tower may result (not shown graphically), because consonance is computed between visible cups, and the size of the already nested cups is not taken into account.

Originality of our account

We would like to point out one important aspect of our account. At the first sight, the seriate nesting cups task may be regarded as a sequence learning problem. For instance, one may consider that the transfer of the cup through different positions is a result of learning a sequence of actions or positions. Similarly, one may consider that the formation of the pot in the second developmental stage is the result of learning the sequence of cups to be embedded. Our explorations also included an approach based on sequence learning (see Marian & Billard, 2004).

Today, we came up with a different solution, which is not based on sequence learning or reproduction. The internal model of the imitator consists of two interconnected goals, which can be activated each, by a set of predecessors. The sequencing like behavior results from the process of multiple constraints satisfaction that takes place during retrieval, and it is not reproduced from a stored sequence structure. We believe that by storing information in a set of constraints, instead of learning the explicit sequence, the system is capable of reproducing the variety of behavior shown by human infants. To exhibit both consistency and variety of behavior represents an important achievement for an epigenetic robot.