The Japanese Psychonomic Society
NII-Electronic Library Service
The JapanesePsychonomic Society
lhefopaneseJbut7!alofFig),ehonomicScienca
2005,VoL 24,No. I,I]5-116
Summary
ofAwardedPresentationIP16
The
role
of
depth
information
in
object
recognition
Ayako
SANEyosHi*,
Chikashi
MicHiMATA*,
and
Keiji
SANEyosHI**
Sophia Lbeiversity*and TohJ,o
instttute
of
Teehnol(lgy**"Geons"
(Biederman,
1987)
with differentrotatlon angles were sequentially presented througha
haploscope
and theparticipantsperformed a sequential matching task.The RT increasedas therotation angle increased, The
3D
condition,in
which thestimulus was stereoscopically presented,was performed fasterthan the2D eondition.
The
eft'ect of the rotation angle was equivalentfor
the2D and 3D conditions.
Furthermore,
thercwas a possibilitythatdepth informationis
helpful in extracting thc qualitativeinformation
of an object.These
results suggested that the qualitativeinformation
extracted frornview-dependent2D
representation plays a crucial role inconstructingtherepresentation of objects.
Key words: object representation, view-dependency, stereoscoplc vision
Tarr,Williams, Hayward, and Gauthier
(1998)
re-ported thattherccognition of geons was affected
by
achange ef viewpoint. Thus they concluded thatthc representation of un objcct would not be a
3D
view-independent
$tructural descriptionbut
2D
view-dependent images. However, thc structural
descrip-tionmodel suggested thatrhe
depth
information ofan object would be very important for
reconstruct-ingthe representation of a
3D
objcct.Because
Tarr
et aL
(1998)
employed 2Dimages
as stimuli, thereisa possibilitythat theirresult was due to a lack of
depth
iniormation
of the objects, In the presentexperiment we investigatedtheeffect ofdepth
infor-mation in recognizing geons
from
differentview-points.
We
presented thegeons stereoscopicailyus-inga haploscope.
Methods
Participants
Sixteen graduate andundergradu-ate students served as volunteer participants.
Apparatus
The
stimuli were presented on aSONY
19-inch
CRT
rnonitor connectedto
anApple
PowerBook G4 computer, A ten-key
pad
wascon-nected to the computer and served as a two-key
response console.
Through
ahaploscope
each eyeviews itsrespective halfside of themonitor
Ce.g.
the*
Department
of Psychology,Sophia
University,
1 Kioicho,
Chiyoda-ku.
Tokyo,
102-8554
** Radioisotope Research
Center,
Tokyo Instituteof Technology,
Nagatsutacho,
Midori-ku,
harna-shi,
Kanagawa
226-8501
left
eye sees theleft
ha]fof monitor).In
a2D
condi-tionboth eyes receive identical
images.
In
thepres-cnt investigationwhen the 3D condition was used
two
images
withbinocular
disparitywere prcsented.Stimulus The stimuli were
20
gray-shadedgeons
{Biederman,
1987)
which were rotated around thevcrtical axis with a step of 15 degrees. All of the
stimu!i were appreximately 6X6 degrees of visua] angle insize.
Design
Theexperirnentaldesignwasanorthogo-nal combination of "View"
(2D
vs.3D)
and "RotationAngle"
(Oe,
150,30a,45",600,750,
and900).
Both
ofthe variables were manipulated within the
partici-pants.
Task andprocedure Theparticipants were asked
to decide whether
2
successively presented stimuliwere the same geons, ignoring theirrotation. In2
scparate sessions the partjcipants observed the
geons with a haploscope in
both
the2D
and 3Dcondition. Ineach triala
fixation
cross appeared for500
ms, and thiswasfollowed
by a 200 mspresenta-tionof the
first
stimulus. A mask stimulus was thendisplayedfor500 ms, and thiswas
followed
by
a 1OOms presentation of the second stimulus.
Results
andDiscussion
The
results are shown inTable1(a).
The
reactiontime data was subjected toa
2
(View)X7
(Rotation
Angle) analysis of variance with repeated
measure-ments.
The
main effect of Rotation Angle wasThe Japanese Psychonomic Society
NII-Electronic Library Service
The JapanesePsychonomic Society
116
TheJapanese
Journal
oi PsychonomicScience
Vol.
24,
No.
1
nificant
(F(6,
90)=522,
pex',Ol).
As
can be seenin
Table 1a,
RT
linearly
increased as the RotationAn-gle
increased,
both inthe2D and 3D conditions. Themain effect of View was also significant
(F(1,
15)=4.92,
p<.05),
indicatingthat the3D
condition wasperformed with shorter
RTs
than the 2D condition.The interactionof theRotation Angle by View was
not significant. ThaL is,
the
Rotation
Angle
wasequally affected inthe 2D and 3D conditions. This
implies
that the view-dependency in recognition ofgeons, as reported by Tarr et aL
(1998),
cou]d nolbe
cxplained b},a lack of depth infoTmation for the
objects.
The
present results support the theory thattheobject representation isbased on vicw-depcndent
2D images.
On
the
otherhand,
it
shouldbe
noted thatthe3D
condition was pcrforrned with a shorter RT than the
2D
condition. Itisthereforenecessary toconsidcrt.hcroLe of depth information in constructing
2D-based representation.
Hayward
and Tarr(1997)
sug-gestedthattheeffect of rotatien was largerwhcn the
rotation clid not preserve qua]itative information
(e.g.
the main axis wu' scurved or stra{ght) than whenit
did
not preserve quantitative information(e.g.
thecurvature of the main axis s-,as changed, see Figure
1)but did preserve qualitativeinformation. Thev
concLuded that qualitative
information
would bemore
important
than quantiLative jnformation forconstruction of a representation of theobject.
XVethercfore considered the possibilitythat
qua]i-tativeinformation could
be
extracted much easierwhen depth information was added and we
re-ana]yzcd the data
to
examinethis
possibility.The
RT data was subjected to a
2
(View:
2D, 3D)X2{Cha]ge
Type: quantitative,qualitative;seeFigure
1)analysis of variance with repeated measurements,
The results are shown in Table 1(b). The main
effect ef Change Type was not statistically
sig-nificanL
(F(1,
15)==3,35,
P=.09). The interactionofView
by Change Type was also not statisticallysig-nificant
(F(1,
15)=3.23,P=.09),
but
the quantitativechange produced shorter RTs than the qualitative
information
changcin
the 2D condition, cSidakistestiP<.05). However a significant differenccwas not
Table 1. The mean RTs,
(a)
View by RotationAngular
Type
Oo150
300
45e
600
750
900 Mean2D 457 470 475 481 480 477 486 475
3D 439 456
453
460
455
475
474
458(b)
View by Change Typc
Quantitativechange
(ms)
Qualitative
change2D3D
465466
481464(ms)Figure
1.
An
example of the changed typc
(Change
Type) of stimuli,revealed inthe quantitative change and qualitativc
change in the3D condition,
(Sidak's
test:p=.69).
Consequently,
the addition of depth informationreducecl the cost of the qualitat.ivechange.
This
result
implies
that
depth
{nformatjon
is
helpful
forextracting the quaritativc[nformation of an object.
These
resultsindicated
thatthe
qua]itativeinforma-tion extracted from view-dcpendent 2D images plays a crucial role inconstructing theobject
representa-tion,
References
Biederman, I.1987 Recognition-by-components: a
theory
ofhuman
image
understanding.Rsychotogi-catReview,
94,
115-
147.Hayward, W,
G,,
& Tarr,M.J,
1997 Testingtions
for
viewpoint Lnvariance jnobjecttion.
fournal
of
iixPerimentaZPs.vchotog:y:
HPR
23,.
1511-152L
Tarr,
M.
J.
Williams,
P.
Hayward,W,
G,,
& Gauthier,