Size tuning in the absence of spatial frequency tuning in object recognition

J. Fiser, Suresh Subramaniam, Irving Biederman

Research output: Contribution to journalArticle

18 Citations (Scopus)

Abstract

How do we attend to objects at a variety of sizes as we view our visual world? Because of an advantage in identification of lowpass over highpass filtered patterns, as well as large over small images, a number of theorists have assumed that size-independent recognition is achieved by spatial frequency (SF) based coarse-to-fine tuning. We found that the advantage of large sizes or low SFs was lost when participants attempted to identify a target object (specified verbally) somewhere in the middle of a sequence of 40 images of objects, each shown for only 72 ms, as long as the target and distractors were the same size or spatial frequency (unfiltered or low or high bandpassed). When targets were of a different size or scale than the distractors, a marked advantage (pop out) was observed for large (unfiltered) and low SF targets against small (unfiltered) and high SF distractors, respectively, and a marked decrement for the complementary conditions. Importantly, this pattern of results for large and small images was unaffected by holding absolute or relative SF content constant over the different sizes and it could not be explained by simple luminance- or contrast-based pattern masking. These results suggest that size/scale tuning in object recognition was accomplished over the first several images ( <576 ms) in the sequence and that the size tuning was implemented by a mechanism sensitive to spatial extent rather than to variations in spatial frequency.

Original languageEnglish
Pages (from-to)1931-1950
Number of pages20
JournalVision Research
Volume41
Issue number15
DOIs
Publication statusPublished - 2001

Keywords

  • Coarse-to-fine tuning
  • Human object recognition
  • Rapid serial visual presentation
  • Size invariance
  • Spatial frequency

ASJC Scopus subject areas

  • Ophthalmology
  • Sensory Systems

Cite this

Size tuning in the absence of spatial frequency tuning in object recognition. / Fiser, J.; Subramaniam, Suresh; Biederman, Irving.

In: Vision Research, Vol. 41, No. 15, 2001, p. 1931-1950.

Research output: Contribution to journalArticle

Fiser, J. ; Subramaniam, Suresh ; Biederman, Irving. / Size tuning in the absence of spatial frequency tuning in object recognition. In: Vision Research. 2001 ; Vol. 41, No. 15. pp. 1931-1950.
@article{4922c21ef0a74a7a9fb687b1d0c08ae1,
title = "Size tuning in the absence of spatial frequency tuning in object recognition",
abstract = "How do we attend to objects at a variety of sizes as we view our visual world? Because of an advantage in identification of lowpass over highpass filtered patterns, as well as large over small images, a number of theorists have assumed that size-independent recognition is achieved by spatial frequency (SF) based coarse-to-fine tuning. We found that the advantage of large sizes or low SFs was lost when participants attempted to identify a target object (specified verbally) somewhere in the middle of a sequence of 40 images of objects, each shown for only 72 ms, as long as the target and distractors were the same size or spatial frequency (unfiltered or low or high bandpassed). When targets were of a different size or scale than the distractors, a marked advantage (pop out) was observed for large (unfiltered) and low SF targets against small (unfiltered) and high SF distractors, respectively, and a marked decrement for the complementary conditions. Importantly, this pattern of results for large and small images was unaffected by holding absolute or relative SF content constant over the different sizes and it could not be explained by simple luminance- or contrast-based pattern masking. These results suggest that size/scale tuning in object recognition was accomplished over the first several images ( <576 ms) in the sequence and that the size tuning was implemented by a mechanism sensitive to spatial extent rather than to variations in spatial frequency.",
keywords = "Coarse-to-fine tuning, Human object recognition, Rapid serial visual presentation, Size invariance, Spatial frequency",
author = "J. Fiser and Suresh Subramaniam and Irving Biederman",
year = "2001",
doi = "10.1016/S0042-6989(01)00062-1",
language = "English",
volume = "41",
pages = "1931--1950",
journal = "Vision Research",
issn = "0042-6989",
publisher = "Elsevier Limited",
number = "15",

}

TY - JOUR

T1 - Size tuning in the absence of spatial frequency tuning in object recognition

AU - Fiser, J.

AU - Subramaniam, Suresh

AU - Biederman, Irving

PY - 2001

Y1 - 2001

N2 - How do we attend to objects at a variety of sizes as we view our visual world? Because of an advantage in identification of lowpass over highpass filtered patterns, as well as large over small images, a number of theorists have assumed that size-independent recognition is achieved by spatial frequency (SF) based coarse-to-fine tuning. We found that the advantage of large sizes or low SFs was lost when participants attempted to identify a target object (specified verbally) somewhere in the middle of a sequence of 40 images of objects, each shown for only 72 ms, as long as the target and distractors were the same size or spatial frequency (unfiltered or low or high bandpassed). When targets were of a different size or scale than the distractors, a marked advantage (pop out) was observed for large (unfiltered) and low SF targets against small (unfiltered) and high SF distractors, respectively, and a marked decrement for the complementary conditions. Importantly, this pattern of results for large and small images was unaffected by holding absolute or relative SF content constant over the different sizes and it could not be explained by simple luminance- or contrast-based pattern masking. These results suggest that size/scale tuning in object recognition was accomplished over the first several images ( <576 ms) in the sequence and that the size tuning was implemented by a mechanism sensitive to spatial extent rather than to variations in spatial frequency.

AB - How do we attend to objects at a variety of sizes as we view our visual world? Because of an advantage in identification of lowpass over highpass filtered patterns, as well as large over small images, a number of theorists have assumed that size-independent recognition is achieved by spatial frequency (SF) based coarse-to-fine tuning. We found that the advantage of large sizes or low SFs was lost when participants attempted to identify a target object (specified verbally) somewhere in the middle of a sequence of 40 images of objects, each shown for only 72 ms, as long as the target and distractors were the same size or spatial frequency (unfiltered or low or high bandpassed). When targets were of a different size or scale than the distractors, a marked advantage (pop out) was observed for large (unfiltered) and low SF targets against small (unfiltered) and high SF distractors, respectively, and a marked decrement for the complementary conditions. Importantly, this pattern of results for large and small images was unaffected by holding absolute or relative SF content constant over the different sizes and it could not be explained by simple luminance- or contrast-based pattern masking. These results suggest that size/scale tuning in object recognition was accomplished over the first several images ( <576 ms) in the sequence and that the size tuning was implemented by a mechanism sensitive to spatial extent rather than to variations in spatial frequency.

KW - Coarse-to-fine tuning

KW - Human object recognition

KW - Rapid serial visual presentation

KW - Size invariance

KW - Spatial frequency

UR - http://www.scopus.com/inward/record.url?scp=0034966822&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034966822&partnerID=8YFLogxK

U2 - 10.1016/S0042-6989(01)00062-1

DO - 10.1016/S0042-6989(01)00062-1

M3 - Article

C2 - 11412885

AN - SCOPUS:0034966822

VL - 41

SP - 1931

EP - 1950

JO - Vision Research

JF - Vision Research

SN - 0042-6989

IS - 15

ER -