Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Representation
Kadir F. Uyanik and Asil Kaan Bozcuoglu
Abstract— The concept of affordances was first coined by features of the object that will provide us enough information
J. J. Gibson to explain how organisms perceive ”values” to deduce the corresponding affordance for a specific action
and ”meanings” of things in the environment and how this and desired outcome. This hypothesis is also supported by
perception is linked to the action possibilities offered to the
organism. This concept has recently been used in the area the recent neuroscientific findings. The ventral pathway of
of cognitive robotics due to its emphasis on the interaction brain is found to be the place that recognizes objects whereas
between the organism and the environment. Another theory the dorsal pathway is responsible for perception of possible
explaining how object recognition occurs in humans is the actions. Overall, the anterior intra-parietal area is the neural
theory of Recognition by Components proposed by I. Biederman. basis of action affordances. In the experiments, this area is
Although these concepts are introduced in Psychology, they
influence the study of autonomous robotics. In vast majority highly active while the organism is trying to grasp an object
of the robotics studies, raw or low-level sensory data are [4].
taken as the perceptual input to the learning system based In this work, we consider three main features of affor-
on the Gibson’s claim that the meaning of the objects in the dances that can be associated with robots:
environment are directly apparent to the agent acting in it.
• Affordances are relative. This feature states that an
In this report, we first review these theories, then propose an
affordance learner system that utilizes the relevant structural affordance do not only depend on the organism nor the
properties of the objects to relate its action repertoire with these environment. Rather, it infers through their interaction.
structural features and outcomes of the applied actions. This For example, the hold-ability affordance of a stone does
system may overcome the representational inadequacies seen in not only depend on the physical features of the stone
most of the affordance learner systems, and yield higher level
perceptual representation of the objects. Finally, we analyze
but also the holder organism’s physical features.
and discuss the preliminary results obtained while testing each • Affordances provide perceptual economy. Perceptual
sub-module of the system. economy covers the hypothesis that the organism do not
need to process the whole perceptual information so as
I. INTRODUCTION
to accomplish a simple task but it is enough to process
Recent studies in robotics with special emphasis on au- only relevant information. In other words, organism
tonomous systems have mainly focused on developing sys- does not care about all the perceptual features of an
tems that mimic human intelligence. With this goal, robotics entity but filters and processes only the information that
research becomes more and more cross-inter disciplinary is necessary for him to do a specific action in order to
by making use of the developments in cognitive science, reach a desired effect. Hence, this feature of affordances
psychology, ethology, neuroscience and many other science provides minimality and low computational cost for the
and engineering disciplines. In this study, we inspired by the perception action.
theory of affordances from Ecological Psychology, and the • Affordances provide general information with a lim-
theory of recognition-by-parts from Cognitive Psychology to ited interaction. While the discussion on how much
develop an autonomous robotic system being able to perceive interaction is needed between the organism and the
the meaning of the objects to some extent if a chance of environment is still continuing, it is usually assumed
interaction is given. that the use of affordances enables one to learn whether
A. Affordances Concept a chair that an organism sees for the first time would
afford sit-ability.
The concept of affordances was introduced by J. J.
In this study we will be using the affordance formalization
Gibson[1] to explain how organisms perceive ”values” and
proposed by Sahin et al.[6] as it is shown in the figure 1.
”meanings” of things in the environment and how this
perception is linked to the action possibilities offered to B. Recognition by Components Theory
the organism. According to this concept, the organisms do Recognition by Components (RBC) theory, proposed by
not need to recognize what actions can be inferred by Irving Biederman[2], argues that we as humans recognize a
interacting an object and make complex inferences over novel/unfamiliar object by parsing it to the primitive com-
these meanings prior to interact with them. For instance, ponents, then comparing/matching the spatial and structural
we do not need to recognize an object when we need to relations between these parts with those of known objects.
immediately interact with it. Instead, we look for set of Human visual recognition system can be associated with
Kadir F. Uyanik 1444405 kadir@ceng.metu.edu.tr the speech recognition system considering the fact that the
Asil Kaan Bozcuoglu 1773829 asil@kovan.ceng.metu.edu.tr lexical access during speech perception can be successfully
Fig. 1. An affordance is an acquired relation between a (entity, behavior)
tuple of an agent such that the application of the “behavior” on the “entity”
generates a certain “effect“ [6].
Fig. 2. The objects on the right are consisted of the geons on the left.
Fig. 6. Left is the input point cloud representing a table having two mugs on it. Right small figures shows the resultant object clusters from different
point of views in the visualization tool.
planning to make the robot to construct a generic graph for
each affordance in the supervised training phase. When we
will have constructed this feature, the comparison graph will
be complete enough to cover these small variations.
IV. FUTURE WORK
The system we propose here offers a complete solution
for affordance-based learning; therefore, it has many unim-
plemented features in the current version. In this section, we
will propose a roadmap for the system. After finalizing devel-
opment of part decomposition and graphical representation
modules, the very first thing to do is to connect currently
implemented two modules in an online fashion and test the
overall system.
For the part decomposition module gaussian curvature
Fig. 7. Amplitude data of comparison objects. estimation and RANSAC primitive shape extraction methods
are going to be tried, and more robust point cloud clustering
methods are to be implemented.
For the structural pattern recognition module, constructing
a generic graph mechanism will be implemented. With this
mechanism, after training with sufficient number of exam-
ples, the system will critically reduce the number of errors
in affordance perception.
V. CONCLUSION
In this report, we described some of psychological studies
such as Affordances concept and RBC theory. Afterwards,
we proposed an intelligent system, to deduct possible set
of actions for an object, based on these studies. Then, we
gave the details of current implementation of the system and
analyze the initial experiment results.
R EFERENCES
Fig. 8. Amplitude data of some test cases. [1] Gibson J. J. (1986). The ecological approach to visual perception.
Lawrence Erlbaum Associates
[2] Gibson, E. J. (2003). The world is so full of a number of things:
The geometric feature of each component is also manually On specification and perceptual learning. Ecological Psychology, 15,
283288.
assigned. [3] I. Biedeman, Recognition-by-components: A theory of human image
For testing, 4 different cups are chosen to be the compar- understanding. Psychological Review. 1987;94(11):5147.
ison objects (Figure 5). Their orientation are exactly same [4] Norman, J. (2002). Two visual systems and two theories. Behavioral
and Brain Sciences, 25, 73144
with given orientation in Figure 5. 8 different cups with 4 [5] Marslen-Wilson, W. (1980). Optimal efficiency in human speech
different orientations, a total of 32 different data, are given processing. Max-Planck-Institut fiir Psycho-linguistik, Nijmegen, The
to the system as testing data. Some of these data can be seen Netherlands
[6] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur , G. Ucoluk, To Afford
in Figure 6. or Not to Afford: A New Formalization of Affordances Toward
As the results, the module successfully identifies 28 dif- Affordance-Based Robot Control, Adaptive Behavior , 2007 pp: 447-
ferent test data, out of 32, when the comparison graph is 472
the cup at the case a. For the case b, 19 cups are identified.
Finally, 23 and 20 different cups are identified for the case
c and the case d respectively.
If we analyze the results, we see that when the handle of
cup lies among the z-axis like in the case a, the module is
more successful. The reason behind this is checking Maha-
lanobis distance in order to distinguish big objects from small
ones. If handle of the cup lies with a different orientation
like in the case b, calculated Mahalanobis distance between
components becomes more erroneous because point cloud
of the handle has less points. This may be a problem for
current version of the system. On the other hand, we are