Modeling guidance and recognition in categorical search: bridging human and computer object detection.

Abstract:

:Search is commonly described as a repeating cycle of guidance to target-like objects, followed by the recognition of these objects as targets or distractors. Are these indeed separate processes using different visual features? We addressed this question by comparing observer behavior to that of support vector machine (SVM) models trained on guidance and recognition tasks. Observers searched for a categorically defined teddy bear target in four-object arrays. Target-absent trials consisted of random category distractors rated in their visual similarity to teddy bears. Guidance, quantified as first-fixated objects during search, was strongest for targets, followed by target-similar, medium-similarity, and target-dissimilar distractors. False positive errors to first-fixated distractors also decreased with increasing dissimilarity to the target category. To model guidance, nine teddy bear detectors, using features ranging in biological plausibility, were trained on unblurred bears then tested on blurred versions of the same objects appearing in each search display. Guidance estimates were based on target probabilities obtained from these detectors. To model recognition, nine bear/nonbear classifiers, trained and tested on unblurred objects, were used to classify the object that would be fixated first (based on the detector estimates) as a teddy bear or a distractor. Patterns of categorical guidance and recognition accuracy were modeled almost perfectly by an HMAX model in combination with a color histogram feature. We conclude that guidance and recognition in the context of search are not separate processes mediated by different features, and that what the literature knows as guidance is really recognition performed on blurred objects viewed in the visual periphery.

journal_name

J Vis

journal_title

Journal of vision

authors

Zelinsky GJ,Peng Y,Berg AC,Samaras D

doi

10.1167/13.3.30

subject

Has Abstract

pub_date

2013-10-08 00:00:00

pages

30

issue

3

issn

1534-7362

pii

13.3.30

journal_volume

13

pub_type

杂志文章
  • Can attention selectively bias bistable perception? Differences between binocular rivalry and ambiguous figures.

    abstract::It is debated whether different forms of bistable perception result from common or separate neural mechanisms. Binocular rivalry involves perceptual alternations between competing monocular images, whereas ambiguous figures such as the Necker cube lead to alternations between two possible pictorial interpretations. Pr...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/4.7.2

    authors: Meng M,Tong F

    更新日期:2004-07-01 00:00:00

  • Induced movement: the flying bluebottle illusion.

    abstract::Two small objects (flies) followed identical circular orbits. However, a large background that circled around behind them in different phases made one orbit look twice as large as the other (size illusion) or made the circles look like very thin horizontal or vertical ellipses with aspect ratios of 7.5:1 or more (shap...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/6.10.8

    authors: Anstis S,Casco C

    更新日期:2006-09-22 00:00:00

  • Cross-modal effects of auditory magnitude on visually guided grasping.

    abstract::Recent research has established the role of objects' semantic properties in the planning of motor actions with respect to these objects. It has been shown that visual numerical magnitude affects visuomotor control in a similar direction to the effect of physical size: The larger the numerical value, the larger the gri...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/15.8.2

    authors: Namdar G,Ganel T

    更新日期:2015-01-01 00:00:00

  • Interactions between luminance and color signals: effects on shape.

    abstract::Although luminance and color are thought to be processed independently at early stages of visual processing, there is evidence that they interact at later stages. For example, chromatic information has been shown to enhance or suppress depth from luminance depending on whether chromatic edges are aligned or orthogonal...

    journal_title:Journal of vision

    pub_type: 杂志文章,评审

    doi:10.1167/13.5.16

    authors: Clery S,Bloj M,Harris JM

    更新日期:2013-04-18 00:00:00

  • Response similarity as a basis for perceptual binding.

    abstract::Detection of low-contrast Gabor patches (GPs) is improved when flanked by collinear GPs, whereas suppression is observed for high-contrast GPs. The facilitation resembles the principles of Gestalt theory of perceptual organization. We propose a model for contour integration in the context of noise that incorporates a ...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/8.7.17

    authors: Sterkin A,Sterkin A,Polat U

    更新日期:2008-07-07 00:00:00

  • Deep neural networks capture texture sensitivity in V2.

    abstract::Deep convolutional neural networks (CNNs) trained on visual objects have shown intriguing ability to predict some response properties of visual cortical neurons. However, the factors (e.g., if the model is trained or not, receptive field size) and computations (e.g., convolution, rectification, pooling, normalization)...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/jov.20.7.21

    authors: Laskar MNU,Sanchez Giraldo LG,Schwartz O

    更新日期:2020-07-01 00:00:00

  • Reaching and grasping actions and their context shape the perception of object size.

    abstract::Humans frequently estimate the size of objects to grasp them. In fact, when performing an action, our perception is focused towards the visual properties of the object that enable us to successfully execute the action. However, the motor system is also able to influence perception, but only a few studies have reported...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/17.12.10

    authors: Bosco A,Daniele F,Fattori P

    更新日期:2017-10-01 00:00:00

  • The effects of smooth occlusions and directions of illumination on the visual perception of 3-D shape from shading.

    abstract::Human observers made local orientation judgments of smoothly shaded surfaces illuminated from different directions by large area lights, both with and without visible smooth occlusion contours. Test-retest correlations between the first and second halves of the experiment revealed that observers' judgments were highly...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/15.2.24

    authors: Egan EJ,Todd JT

    更新日期:2015-02-24 00:00:00

  • Classification of apparent motion percepts based on temporal factors.

    abstract::As pointed out by M. Wertheimer (1912), a number of qualitatively different motion impressions, such as "optimal motion," "part motion," and "pure phi," may be evoked by manipulating the temporal parameters of two-element apparent motion sequences. We investigated how the transitions between the different percepts dep...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/8.4.31

    authors: Ekroll V,Faul F,Golz J

    更新日期:2008-04-29 00:00:00

  • Topographical representation of binocular depth in the human visual cortex using fMRI.

    abstract::We used binocular stimuli to define how the visual location of stereoscopic depth structure maps topographically onto the human visual cortex. The main stimulus consisted of a circular disk of dots, most at zero-disparity, against which a single quadrant was defined with changing disparity ('correlated' disparity), an...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/7.14.15

    authors: Bridge H,Parker AJ

    更新日期:2007-12-17 00:00:00

  • Dynamic information for the recognition of conversational expressions.

    abstract::Communication is critical for normal, everyday life. During a conversation, information is conveyed in a number of ways, including through body, head, and facial changes. While much research has examined these latter forms of communication, the majority of it has focused on static representations of a few, supposedly ...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/9.13.7

    authors: Cunningham DW,Wallraven C

    更新日期:2009-12-07 00:00:00

  • Disentangling the effects of spatial inconsistency of targets and distractors when searching in realistic scenes.

    abstract::Previous research has suggested that correctly placed objects facilitate eye guidance, but also that objects violating spatial associations within scenes may be prioritized for selection and subsequent inspection. We analyzed the respective eye guidance of spatial expectations and target template (precise picture or v...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/15.2.12

    authors: Spotorno S,Malcolm GL,Tatler BW

    更新日期:2015-02-10 00:00:00

  • Occlusion and the solution to visual motion ambiguity: Looking beyond the aperture problem.

    abstract::A horizontally moving grating viewed within a diamond-shaped aperture can be made to appear to move obliquely by introducing appropriate depth-ordering cues (R. O. Duncan, T. D. Albright, & G. R. Stoner, 2000). It is commonly assumed that the depth cues in such displays determine which line terminators are seen as int...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/8.2.4

    authors: van der Smagt MJ,Stoner GR

    更新日期:2008-02-20 00:00:00

  • Activity in visual area V4 correlates with surface perception.

    abstract::The neural mechanisms responsible for unifying noncontiguous regions of a visual image into a percept of a single surface remain largely unknown. To investigate these mechanisms, we used a novel stimulus in which local luminance was the only cue for surface segmentation. Subjects viewed an array of small adjoining ele...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/8.7.28

    authors: Bouvier SE,Cardinal KS,Engel SA

    更新日期:2008-11-07 00:00:00

  • The contribution of local and global motion adaptation in the repulsive direction aftereffect.

    abstract::After adapting to a certain motion direction, our perception of a similar direction will be repelled away from the adapting direction, a phenomenon known as the direction aftereffect (DAE). As the motion system consists of local and global processing stages, it remains unclear how the adaptation of the two stages cont...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/18.12.2

    authors: Lee ALF

    更新日期:2018-11-01 00:00:00

  • Pupil responses to high-level image content.

    abstract::The link between arousal and pupil dilation is well studied, but it is less known that other cognitive processes can trigger pupil responses. Here we present evidence that pupil responses can be induced by high-level scene processing, independent of changes in low-level features or arousal. In Experiment 1, we recorde...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/13.6.7

    authors: Naber M,Nakayama K

    更新日期:2013-05-17 00:00:00

  • Blur clarified: a review and synthesis of blur discrimination.

    abstract::Blur is an important attribute of human spatial vision, and sensitivity to blur has been the subject of considerable experimental research and theoretical modeling. Often, these models have invoked specialized concepts or mechanisms, such as intrinsic blur, multiple spatial frequency channels, or blur estimation units...

    journal_title:Journal of vision

    pub_type: 杂志文章,评审

    doi:10.1167/11.5.10

    authors: Watson AB,Ahumada AJ

    更新日期:2011-09-19 00:00:00

  • Global attention facilitates the planning, but not execution of goal-directed reaches.

    abstract::In daily life, humans interact with multiple objects in complex environments. A large body of literature demonstrates that target selection is biased toward recently attended features, such that reaches are faster and trajectory curvature is reduced when target features (i.e., color) are repeated (priming of pop-out)....

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/16.9.7

    authors: McCarthy JD,Song JH

    更新日期:2016-07-01 00:00:00

  • Transfer in motion discrimination learning was no greater in double training than in single training.

    abstract::We investigated the controversy regarding double training in motion discrimination learning. We collected data from 43 participants in a motion direction discrimination learning task with either double training (i.e., training plus exposure) or single training (i.e., no exposure). By pooling these data with those in t...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/17.6.7

    authors: Huang J,Liang J,Zhou Y,Liu Z

    更新日期:2017-06-01 00:00:00

  • The perception of gloss depends on highlight congruence with surface shading.

    abstract::Studies have shown that displacing specular highlights from their natural locations in images reduces perceived surface gloss. Here, we assessed the extent to which perceived gloss depends on congruence in the position and orientation of specular highlights relative to surface shape and the diffuse shading from which ...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/11.9.4

    authors: Kim J,Marlow P,Anderson BL

    更新日期:2011-08-12 00:00:00

  • Future path and tangent point models in the visual control of locomotion in curve driving.

    abstract::Studying human behavior in the natural context of everyday visual tasks--including locomotor tasks such as driving--can reveal visual strategies or even suggest underlying visual mechanisms. This paper reviews empirical and theoretical work in the past 20 years (1994-2014) on the visual control of steering a vehicle a...

    journal_title:Journal of vision

    pub_type: 杂志文章,评审

    doi:10.1167/14.12.21

    authors: Lappi O

    更新日期:2014-10-21 00:00:00

  • Slow feature analysis yields a rich repertoire of complex cell properties.

    abstract::In this study we investigate temporal slowness as a learning principle for receptive fields using slow feature analysis, a new algorithm to determine functions that extract slowly varying signals from the input data. We find a good qualitative and quantitative match between the set of learned functions trained on imag...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/5.6.9

    authors: Berkes P,Wiskott L

    更新日期:2005-07-20 00:00:00

  • Dichoptic color saturation mixture: Binocular luminance contrast promotes perceptual averaging.

    abstract::We demonstrate a new type of interaction between suprathreshold color (chromatic) and luminance contrast in the context of binocular vision. When two isoluminant colored disks of identical hue but different saturations are presented to different eyes, the apparent saturation of the resulting "dichoptic" mix is close t...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/15.5.2

    authors: Kingdom FA,Libenson L

    更新日期:2015-01-01 00:00:00

  • Binocular contrast discrimination needs monocular multiplicative noise.

    abstract::The effects of signal and noise on contrast discrimination are difficult to separate because of a singularity in the signal-detection-theory model of two-alternative forced-choice contrast discrimination (Katkov, Tsodyks, & Sagi, 2006). In this article, we show that it is possible to eliminate the singularity by combi...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/16.5.12

    authors: Ding J,Levi DM

    更新日期:2016-01-01 00:00:00

  • Discrimination of locomotion direction in impoverished displays of walkers by macaque monkeys.

    abstract::A vast literature exists on human biological motion perception in impoverished displays, e.g., point-light walkers. Less is known about the perception of impoverished biological motion displays in macaques. We trained 3 macaques in the discrimination of facing direction (left versus right) and forward versus backward ...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/10.4.22

    authors: Vangeneugden J,Vancleef K,Jaeggli T,VanGool L,Vogels R

    更新日期:2009-04-28 00:00:00

  • Viewpoint oscillation improves the perception of distance travelled based on optic flow.

    abstract::When static observers are presented with a visual simulation of forward self-motion, they generally misestimate distance travelled relative to a previously seen distant target: It has been suggested that this finding can be accounted for by a "leaky path integration" model. In the present study, using a similar experi...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/16.15.4

    authors: Bossard M,Goulon C,Mestre DR

    更新日期:2016-12-01 00:00:00

  • Short-term global motion adaptation induces a compression in the subjective duration of dynamic visual events.

    abstract::Apparent duration can be manipulated in a local region of visual field by long-term adaptation to motion or flicker (Johnston, Arnold, & Nishida, 2006). These effects show narrow spatial tuning (Ayhan, Bruno, Nishida, & Johnston, 2009), as well as retinotopic position dependency (Bruno, Ayhan, & Johnston, 2010), suppo...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/19.5.19

    authors: Gulhan D,Ayhan I

    更新日期:2019-05-01 00:00:00

  • Sensory and cognitive influences on the training-related improvement of reading speed in peripheral vision.

    abstract::Reading speed in normal peripheral vision is slow but can be increased through training on a letter-recognition task. The aim of the present study is to investigate the sensory and cognitive factors responsible for this improvement. The visual span is hypothesized to be a sensory bottleneck limiting reading speed. Thr...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/13.7.14

    authors: He Y,Legge GE,Yu D

    更新日期:2013-06-24 00:00:00

  • Robust object-based encoding in visual working memory.

    abstract::Recently, researchers have begun to investigate how nonspatial perceptual information is extracted into visual working memory (VWM), focusing particularly on object-based encoding (OBE). That is, whenever even one feature-dimension is selected for entry into VWM, the others are also extracted automatically. While ther...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/13.2.1

    authors: Shen M,Tang N,Wu F,Shui R,Gao Z

    更新日期:2013-02-01 00:00:00

  • A role for incidental auditory learning in auditory-visual word learning among kindergarten children.

    abstract::This study focused on the potential role of incidental, auditory perceptual learning in among children learning new words. To this end, we examined how irrelevant auditory similarities across words, that provide no cues regarding their visual or conceptual attributes, influence pseudo-word learning in a name/picture m...

    journal_title:Journal of vision

    pub_type: 杂志文章

    doi:10.1167/jovi.20.3.4

    authors: Banai K,Nir B,Moav-Scheff R,Bar-Ziv N

    更新日期:2020-03-17 00:00:00