A saliency-based search mechanism for overt and covert shifts of visual attention

Vision Research - Tập 40 Số 10-12 - Trang 1489-1506 - 2000
Laurent Itti1, Christof Koch
1Computation and Neural Systems Program, Division of Biology, California Institute of Technology, Mail-Code 139-74, Pasadena, CA 91125, USA.

Tóm tắt

Từ khóa


Tài liệu tham khảo

Andersen, R.A. (1997). Multimodal integration for the representation of space in the posterior parietal cortex. Philosophical Transactions of the Royal Society of London, Series B: Biological Sciences, 352, 1421–1428.

Andersen, 1990, Eye position effects on visual, memory, and saccade-related activity in areas LIP and 7A of macaque, Journal of Neuroscience, 10, 1176, 10.1523/JNEUROSCI.10-04-01176.1990

Bergen, 1983, Parallel versus serial processing in rapid pattern discrimination, Nature, 303, 696, 10.1038/303696a0

Beymer, 1996, Image representations for visual learning, Science, 272, 1905, 10.1126/science.272.5270.1905

Bijl, P., Kooi, F. K., & van Dorresteijn, M. (1997). Visual search performance for realistic target imagery from the DISSTAF field trials. Soesterberg, The Netherlands: TNO Human Factors Research Institute.

Braun, 1998, Withdrawing attention at little or no cost: detection and discrimination tasks, Perception and Psychophysics, 60, 1, 10.3758/BF03211915

Braun, 1990, Vision outside the focus of attention, Perception and Psychophysics, 48, 45, 10.3758/BF03205010

Braun, 1998, Vision and attention: the role of training (letter; comment), Nature (Comment on: Nature June 19;387(6635), 805–807)), 393, 424, 10.1038/30875

Burt, 1983, The laplacian pyramid as a compact image code, IEEE Transactions on Communications, 31, 532, 10.1109/TCOM.1983.1095851

Cannon, 1991, Spatial interactions in apparent contrast: inhibitory effects among grating patterns of different spatial frequencies, spatial positions and orientations, Vision Research, 31, 1985, 10.1016/0042-6989(91)90193-9

Colby, 1999, Space and attention in parietal cortex, Annual Review of Neuroscience, 22, 319, 10.1146/annurev.neuro.22.1.319

Corbetta, 1998, Frontoparietal cortical networks for directing attention and the eye to visual locations: identical, independent, or overlapping neural systems?, Proceedings of the National Academy of Sciences of the United States of America, 95, 831, 10.1073/pnas.95.3.831

Crick, 1998, Constraints on cortical and thalamic projections: the no-strong-loops hypothesis, Nature, 391, 245, 10.1038/34584

Desimone, 1995, Neural mechanisms of selective visual attention, Annual Review of Neuroscience, 18, 193, 10.1146/annurev.ne.18.030195.001205

DeValois, 1982, Spatial-frequency selectivity of cells in macaque visual cortex, Vision Research, 22, 545, 10.1016/0042-6989(82)90113-4

Driver, 1992, Motion coherence and conjunction search-implications for guided search theory, Perception and Psychophysics, 51, 79, 10.3758/BF03205076

Engel, 1997, Colour tuning in human visual cortex measured with functional magnetic resonance imaging, Nature, 388, 68, 10.1038/40398

Gallant, 1998, Neural activity in areas Vl, V2 and V4 during free viewing of natural scenes compared to controlled viewing, Neuroreport, 9, 85, 10.1097/00001756-199801050-00017

Gilbert, 1983, Clustered intrinsic connections in cat visual cortex, Journal of Neuroscience, 3, 1116, 10.1523/JNEUROSCI.03-05-01116.1983

Gilbert, 1989, Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex, Journal of Neuroscience, 9, 2432, 10.1523/JNEUROSCI.09-07-02432.1989

Gilbert, 1996, Spatial integration and cortical dynamics, Proceedings of the National Academy of Sciences of the United States of America, 93, 615, 10.1073/pnas.93.2.615

Gottlieb, 1998, The representation of visual salience in monkey parietal cortex, Nature, 391, 481, 10.1038/35135

Greenspan, H., Belongie, S., Goodman, R., Perona, P., Rakshit, S., & Anderson, C. H. (1994). Overcomplete steerable pyramid filters and rotation invariance. In Proc. IEEE Computer Vision and Pattern Recognition (CVPR), Seattle, WA (June), 222–228.

Hamker, F. H. (1999). The role of feedback connections in task-driven visual search. In D. von Heinke, G. W. Humphreys, & A. Olson, Connectionist Models in Cognitive Neuroscience, Proc. of the 5th neural computation and psychology workshop (NCPW’98). London: Springer-Verlag.

Heisenberg, M., & Wolf, R. (1984). Studies of brain function, vol. 12: vision in Drosophila. Berlin: Springer-Verlag.

Hikosaka, 1996, Orienting a spatial attention — its reflexive, compensatory, and voluntary mechanisms, Brain Research and Cognitive Brain Research, 5, 1, 10.1016/S0926-6410(96)00036-5

Hillstrom, 1994, Visual-motion and attentional capture, Perception and Psychophysics, 55, 399, 10.3758/BF03205298

Horiuchi, T., Morris, T., Koch, C. & DeWeerth, S. 1997. Analog vlsi circuits for attention-based, visual tracking. In M. Mozer, M. Jordan, & T. Petsche, Neural information processing systems (NIPS*9) (706–712). Cambridge, MA: MIT Press.

Horowitz, 1998, Visual search has no memory, Nature, 394, 575, 10.1038/29068

Itti, L., & Koch, C. (1999). A comparison of feature combination strategies for saliency-based visual attention systems. In SPIE human vision and electronic imaging IV (HVEI’99), San Jose, CA (pp. 473–482).

Itti, 1998, A model of saliency-based visual-attention for rapid scene analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, 1254, 10.1109/34.730558

James, 1890

Knierim, 1992, Neuronal responses to static texture patterns in area V1 of the alert macaque monkey, Journal of Neurophysiology, 67, 961, 10.1152/jn.1992.67.4.961

Koch, 1985, Shifts in selective visual attention: towards the underlying neural circuitry, Human Neurobiology, 4, 219

Koch, 1998

Kustov, 1996, Shared neural control of attentional shifts and eye movements, Nature, 384, 74, 10.1038/384074a0

Kwak, 1992, Consequences of allocating attention to locations and to other attributes, Perception and Psychophysics, 51, 455, 10.3758/BF03211641

Laberge, 1990, Positron emission tomographic measurements of pulvinar activity during an attention task, Journal of Neuroscience, 10, 613, 10.1523/JNEUROSCI.10-02-00613.1990

Lee, 1999, Attention activates winner-take-all competition among visual filters, Nature Neuroscience, 2, 375, 10.1038/7286

Leventhal, A., 1991. The neural basis of visual function. In Vision and visual dysfunction, vol. 4. Boca Raton, FL: CRC Press.

Levitt, 1997, Contrast dependence of contextual effects in primate visual cortex, Nature, 387, 73, 10.1038/387073a0

Luschow, 1993, Pop-out of orientation but no pop-out of motion at isoluminance, Vision Research, 33, 91, 10.1016/0042-6989(93)90062-2

Malach, 1994, Cortical columns as devices for maximizing neuronal diversity, Trends in Neuroscience, 17, 101, 10.1016/0166-2236(94)90113-9

Malach, 1993, Relationship between intrinsic connections and functional architecture revealed by optical imaging and in vivo targeted biocytin injections in primate striate cortex, Proceedings of the National Academy of Sciences of the United States of America, 90, 10469, 10.1073/pnas.90.22.10469

Malik, 1990, Preattentive texture discrimination with early vision mechanisms, Journal of the Optical Society of America A, 7, 923, 10.1364/JOSAA.7.000923

Motter, 1998, The guidance of eye movements during active visual search, Vision Research, 38, 1805, 10.1016/S0042-6989(97)00349-0

Nakayama, 1989, Sustained and transient components of focal visual attention, Vision Research, 29, 1631, 10.1016/0042-6989(89)90144-2

Niebur, E., & Koch, C. (1996). Control of selective visual attention: modeling the ‘where’ pathway. In D. Touretzky, M. Mozer, & M. Hasselmo, Neural information processing systems (NIPS 8), (802–808). Cambridge, MA: MIT Press.

Niebur, 1998, Computational architectures for attention, 163

Niyogi, 1998, Incorporating prior information in machine learning by creating virtual examples, Proceedings of the IEEE, 86, 2196, 10.1109/5.726787

Noton, 1971, Scanpaths in eye movements during pattern perception, Science, 171, 308, 10.1126/science.171.3968.308

O’Regan, 1999, Change-blindness as a result of ‘mudsplashes’, Nature, 398, 34, 10.1038/17953

Olshausen, 1993, A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information, Journal of Neuroscience, 13, 4700, 10.1523/JNEUROSCI.13-11-04700.1993

Poggio, 1997, Image representations for visual learning, Lecture Notes in Computer Science, 1206, 143, 10.1007/BFb0015989

Polat, 1994, The architecture of perceptual spatial interactions, Vision Research, 34, 73, 10.1016/0042-6989(94)90258-5

Polat, 1994, Spatial interactions in human vision: from near to far via experience-dependent cascades of connections, Proceedings of the National Academy of Sciences of the United States of America, 91, 1206, 10.1073/pnas.91.4.1206

Posner, 1982, Neural systems control of spatial orienting, Philosophical Transactions of the Royal Society of London, Series B: Biological Sciences, 298, 187, 10.1098/rstb.1982.0081

Rao, 1995, An active vision architecture based on iconic representations, Artificial Intelligence, 78, 461, 10.1016/0004-3702(95)00026-7

Robinson, 1992, The pulvinar and visual salience, Trends in Neuroscience, 15, 127, 10.1016/0166-2236(92)90354-B

Rockland, 1983, Intrinsic laminar lattice connections in primate visual cortex, Journal of Comparative Neurology, 216, 303, 10.1002/cne.902160307

Rockland, 1999, Single axon analysis of pulvinocortical connections to several visual areas in the macaque, Journal of Comparative Neurology, 406, 221, 10.1002/(SICI)1096-9861(19990405)406:2<221::AID-CNE7>3.0.CO;2-K

Saarinen, 1991, The speed of attentional shifts in the visual field, Proceedings of the National Academy of Sciences of the United States of America, 88, 1812, 10.1073/pnas.88.5.1812

Sheliga, 1994, Orienting of attention and eye movements, Experimantal Brain Research, 98, 507

Shepherd, 1986, The relationship between eye movements and spatial attention, Quarterly Journal of Experimental Psychology, 38, 475, 10.1080/14640748608401609

Sillito, 1996, Context-dependent interactions and visual processing in vl, Journal of Physiology Paris, 90, 205, 10.1016/S0928-4257(97)81424-6

Sillito, 1995, Visual cortical mechanisms detecting focal orientation discontinuities, Nature, 378, 492, 10.1038/378492a0

Simoncelli, 1992, Shiftable multiscale transforms, IEEE Transactions on Information Theory, 38, 587, 10.1109/18.119725

Simons, 1997, Failure to detect changes to attended objects, Investigative Opthalmology and Visual Science, 38, 3273

Toet, A., Bijl, P., Kooi, F. L., & Valeton, J. M. (1998). A high-resolution image dataset for testing search and detection models (TNO-TM-98-A020). TNO Human Factors Research Institute, Soesterberg, The Netherlands.

Tootell, 1988, Functional anatomy of macaque striate cortex. i. ocular dominance, binocular interactions, and baseline conditions, Journal of Neuroscience, 8, 1500, 10.1523/JNEUROSCI.08-05-01500.1988

Treisman, 1980, A feature-integration theory of attention, Cognitive Psychology, 12, 97, 10.1016/0010-0285(80)90005-5

Treisman, 1988, Feature analysis in early vision: evidence from search asymmetries, Psychology Review, 95, 15, 10.1037/0033-295X.95.1.15

Treisman, 1988, Features and objects: the fourteenth bartlett memorial lecture, Quarterly Journal of Experimental Psychology A, 40, 201, 10.1080/02724988843000104

Ts’o, 1986, Relationships between horizontal interactions and functional architecture in cat striate cortex as revealed by cross-correlation analysis, Journal of Neuroscience, 6, 1160, 10.1523/JNEUROSCI.06-04-01160.1986

Tsotsos, 1995, Modeling visual-attention via selective tuning, Artificial Intelligence, 78, 507, 10.1016/0004-3702(95)00025-9

Wagenaar, 1969, Note on the construction of digram-balanced latin squares, Psychology Bulletin, 72, 384, 10.1037/h0028329

Weliky, 1995, Patterns of excitation and inhibition evoked by horizontal connections in visual cortex share a common relationship to orientation columns, Neuron, 15, 541, 10.1016/0896-6273(95)90143-4

Wolfe, 1994, Visual search in continuous, naturalistic stimuli, Vision Research, 34, 1187, 10.1016/0042-6989(94)90300-X

Yarbus, 1967

Yuille, 1989, A mathematical-analysis of the motion coherence theory, International Journal of Computer Vision, 3, 155, 10.1007/BF00126430

Zenger, 1996, Isolating excitatory and inhibitory nonlinear spatial interactions involved in contrast detection, Vision Research, 36, 2497, 10.1016/0042-6989(95)00303-7