INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM

Progress in Neurobiology - Tập 51 Số 2 - Trang 167-194 - 1997
Guy Wallis1, Edmund T. Rolls2
1Oxford University, Department of Experimental Psychology, U.K.
2Oxford University, Department of Experimental Psychology, South Parks Road, Oxford OX1 3UDUK

Tóm tắt

Từ khóa


Tài liệu tham khảo

Abbott, 1996, Representational capacity of face coding in monkeys, Cerebral Cortex, 6, 498, 10.1093/cercor/6.3.498

Baddeley, R.J., Wakeman, E., Booth, M., Rolls, E.T. and Abbott, L.F. (1997) The distribution of firing rates of primate temporal lobe visual neurons to “natural” scenes (in preparation).

Baizer, 1991, Organization of visual inputs to the inferior temporal and posterior parietal cortex in macaques, J. Neurosci., 11, 168, 10.1523/JNEUROSCI.11-01-00168.1991

Ballard, D.H. (1990) Animate vision uses object-centred reference frames. In: Advanced Neural Computers, pp. 229–236. Ed. R. Eckmiller. North-Holland: Amsterdam.

Ballard, D.H. (1993) Subsymbolic modelling of hand-eye co-ordination. In: The Simulation of Human Intelligence, Ch. 3, pp. 71–102. Ed. D.E. Broadbent. Blackwell: Oxford.

Barlow, 1972, Single units and sensation: a neuron doctrine for perceptual psychology?, Perception, 1, 371, 10.1068/p010371

Barlow, H.B. (1985) Cerebral cortex as model builder. In: Models of the Visual Cortex, pp. 37–46. Eds D. Rose and V.G. Dobson. Wiley: Chichester.

Barlow, 1989, Finding minimum entropy codes, Neural Computat., 1, 412, 10.1162/neco.1989.1.3.412

Baylis, 1985, Selectivity between faces in the responses of a population of neurons in the cortex in the superior temporal sulcus of the monkey, Brain Res., 342, 91, 10.1016/0006-8993(85)91356-3

Baylis, 1987, Functional subdivisions of temporal lobe neocortex, J. Neurosci., 7, 330, 10.1523/JNEUROSCI.07-02-00330.1987

Baylis, 1987, Responses of neurons in the inferior temporal cortex in short term and serial recognition memory tasks, Expl Brain Res., 65, 614, 10.1007/BF00235984

Bennett, 1990, Large competitive networks, Network, 1, 449, 10.1088/0954-898X/1/4/005

Boussaoud, 1991, Visual topography of area TEO in the macaque, J. Comp. Neurol., 306, 554, 10.1002/cne.903060403

Breitmeyer, 1980, Unmasking visual masking: a look at the “why” behind the veil of the “how”, Psychol. Rev., 87, 52, 10.1037/0033-295X.87.1.52

Brown, 1990, Hebbian synapses: biological mechanisms and algorithms, Ann. Rev. Neurosci., 13, 475, 10.1146/annurev.ne.13.030190.002355

Buhmann, J., Lades, M. and von der Malsburg, C. (1990) Size and distortion invariant object recognition by hierarchical graph matching. In: International Joint Conference on Neural Networks, pp. 411–416. IEEE: New York.

Buhmann, J., Lange, J., von der Maslburg, C., Vorbrüggen, J.C. and Würtz, R.P. (1991) Object recognition in the dynamic link architecture: parallel implementation of a transputer network. In: Neural Networks for Signal Processing, pp. 121–159. Ed. B. Kosko. Prentice Hall: Englewood Cliffs, New Jersey.

Bülthoff, 1992, Psychophysical support for a two-dimensional view interpolation theory of object recognition, Proc. natn. Acad. Sci. U.S.A., 92, 60, 10.1073/pnas.89.1.60

Cavanagh, 1978, Size and location invariance in the visual system, Perception, 7, 167, 10.1068/p070167

Chakravarty, I. (1979) A generalized line and junction labelling scheme with applications to scene analysis. IEEE Transactions PAMI, April, pp. 202–205.

Engel, 1992, Temporal coding in the visual system: new vistas on integration in the nervous system, Trends Neurosci., 15, 218, 10.1016/0166-2236(92)90039-B

Feldman, 1985, Four frames suffice: a provisional model of vision and space [see p. 279], Behav. Brain Sci., 8, 265, 10.1017/S0140525X00020707

Foldiak, 1991, Learning invariance from transformation sequences, Neural Comp., 3, 193, 10.1162/neco.1991.3.2.194

Fukushima, 1980, Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position, Biol. Cybernet., 36, 193, 10.1007/BF00344251

Gross, C.G. (1973) Visual functions of the inferotemporal cortex. In: Handbook of Sensory Physiology, pp. 451–482. Springer-Verlag: Berlin.

Gross, 1985, Inferior temporal cortex and pattern recognition, Expl Brain Res. Suppl., 11, 179, 10.1007/978-3-662-09224-8_10

Hasselmo, 1989, The role of expressions and identity in the face-selective responses of neurons in the temporal visual cortex of the monkey, Behav. Brain Res., 32, 203, 10.1016/S0166-4328(89)80054-3

Hasselmo, 1989, Object-centered encoding by face-selective neurons in the cortex in the superior temporal sulcus of the monkey, Expl Brain Res., 75, 417, 10.1007/BF00247948

Hawken, M.J. and Parker, A.J. (1987) Spatial properties of the monkey striate cortex. Proc. R. Soc. London [B] 231, 251–288.

Hertz, J., Krogh, A. and Palmer, R.G. (1991) Introduction to the Theory of Neural Computation. Addison-Wesley: Wokingham, U.K.

Hinton, G.E. (1981) A parallel computation that assigns canonical object based frames of reference. In: Proceedings of the 9th International Joint Conference on Artificial Intelligence. Reviewed in Rumelhart and McClelland (1986), Ch. 4.

Hornak, 1996, Face and voice expression identification and their association with emotional and behavioural changes in patients with frontal lobe damage, Neuropsychologia, 34, 247, 10.1016/0028-3932(95)00106-9

Hummel, 1992, Dynamic binding in a neural network for shape recognition, Psychol. Rev., 99, 480, 10.1037/0033-295X.99.3.480

Humphreys, G.W. and Bruce, V. (1989) Visual Cognition. Erlbaum: Hove, U.K.

Koenderink, 1979, The internal representation of solid shape with respect to vision, Biol. Cybernet., 32, 211, 10.1007/BF00337644

Kovacs, 1995, Cortical correlate of pattern backward masking, Proc. Natn. Acad. Sci., 92, 5587, 10.1073/pnas.92.12.5587

Leonard, 1985, Neurons in the amygdala of the monkey with responses selective for faces, Behav. Brain Res., 15, 159, 10.1016/0166-4328(85)90062-2

Linsker, E. (1986) From basic network principles to neural architecture. Proc. natn. Acad. Sci. U.S.A., 83, 7508–7512, 8390–8394, 8779–8783.

Logothetis, 1994, View-dependent object recognition by monkeys, Curr. Biol., 4, 401, 10.1016/S0960-9822(00)00089-0

Marr, D. (1982) Vision. W.H. Freeman: San Francisco.

Maunsell, 1987, Visual processing in monkey extrastriate cortex, Ann. Rev. Neurosci., 10, 363, 10.1146/annurev.ne.10.030187.002051

Mel, B.W. (1996) SEEMORE: Combining color, shape, and texture histogramming in a neurally inspired approach to visual object recognition. (Unpublished manuscript.)

Miller, 1994, Parallel neuronal mechanisms for short-term memory, Science, 263, 520, 10.1126/science.8290960

Miyashita, 1993, Inferior temporal cortex: where visual perception meets memory, Ann. Rev. Neurosci., 16, 245, 10.1146/annurev.ne.16.030193.001333

Miyashita, 1988, Neuronal correlate of pictorial short-term memory in the primate temporal cortex, Nature, 331, 68, 10.1038/331068a0

Montague, 1991, Spatial signalling in the development and function of neural connections, Cerebr. Cort., 1, 199, 10.1093/cercor/1.3.199

Nass, 1975, A theory for the development of feature detecting cells in visual cortex, Biol. Cybernet., 19, 1, 10.1007/BF00319777

Oja, 1982, A simplified neuron model as a principal component analyzer, J. Math. Biol., 15, 267, 10.1007/BF00275687

Olhausen, 1993, A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information, J. Neurosci., 13, 4700, 10.1523/JNEUROSCI.13-11-04700.1993

Perrett, 1982, Visual neurons responsive to faces in the monkey temporal cortex, Expl Brain Res., 47, 329, 10.1007/BF00239352

Perrett, 1985, Visual analysis of body movements by neurons in the temporal cortex of the macaque monkey: a preliminary report, Behav. Brain Res., 16, 153, 10.1016/0166-4328(85)90089-0

Perrett, 1985, Visual cells in temporal cortex sensitive to face view and gaze direction, Proc. R. Soc., 223B, 293, 10.1098/rspb.1985.0003

Perrett, 1987, Visual neurons responsive to faces, Trends Neurosci., 10, 358, 10.1016/0166-2236(87)90071-3

Perrett, 1992, Organisation and functions of cells responsive to faces in the temporal cortex, Phil. Trans. R. Soc. London [B], 335, 23, 10.1098/rstb.1992.0003

Poggio, 1990, A network that learns to recognize three-dimensional objects, Nature, 343, 263, 10.1038/343263a0

Poggio, 1990, Regularization algorithms for learning that are equivalent to multilayer networks, Science, 247, 978, 10.1126/science.247.4945.978

Poggio, 1990, Networks for approximation and learning, Proc. IEEE, 78, 1481, 10.1109/5.58326

Rhodes, 1992, The open time of the NMDA channel facilitates the self-organisation of invariant object responses in cortex, Soc. Neurosci. Abstr., 18, 740

Rolls, 1984, Neurons in the cortex of the temporal lobe and in the amygdala of the monkey with responses selective for faces, Human Neurobiol., 3, 209

Rolls, E.T. (1989a) Functions of neuronal networks in the hippocampus and neocortex in memory. In: Neural Models of Plasticity: Experimental and Theoretical Approaches, Ch. 13, pp. 240–265. Eds J.H. Byrne and W.O. Berry, Academic Press: San Diego.

Rolls, E.T. (1989b) The representation and storage of information in neuronal networks in the primate cerebral cortex and hippocampus. In: The Computing Neuron, Ch. 8, pp. 125–159. Eds R. Durbin, C. Miall and G. Mitchison. Addison-Wesley: Wokingham, U.K.

Rolls, E.T. (1989c) Functions of neuronal networks in the hippocampus and cerebral cortex in memory. In: Models of Brain Function, pp. 15–33. Ed. R.M.J. Coterill. Cambridge University Press: Cambridge, U.K.

Rolls, 1990, A theory of emotion, and its application to understanding the neural basis of emotion, Cogn. Emotion, 4, 161, 10.1080/02699939008410795

Rolls, 1991, Neural organisation of higher visual functions, Curr. Opin. Neurobiol., 1, 274, 10.1016/0959-4388(91)90090-T

Rolls, E.T. (1992a) Neurophysiology and functions of the primate amygdala. In: The Amygdala, Ch. 5, pp. 143–165. Ed. J.P. Aggleton. Wiley-Liss: New York.

Rolls, 1992, Neurophysiological mechanisms underlying face processing within and beyond the temporal cortical visual areas, Phil. Trans. R. Soc., 335, 11, 10.1098/rstb.1992.0002

Rolls, 1994, Brain mechanisms for invariant visual recognition and learning, Behav. Proc., 33, 113, 10.1016/0376-6357(94)90062-0

Rolls, E.T. (1995a) A theory of emotion and consciousness, and its application to understanding the neural basis of emotion. In: The Cognitive Neuroscience, Ch. 72, pp. 1091–1106. Ed. M.S. Gazzaniga. MIT Press: Cambridge, MA.

Rolls, 1995, Learning mechanisms in the temporal lobe visual cortex, Behav. Brain Res., 66, 177, 10.1016/0166-4328(94)00138-6

Rolls, E.T. (1996a) Roles of long term potentiation and long term depression in neuronal network operations in the brain. In: Cortical Plasticity: LTP and LTD, Ch. 11, pp. 223–250. Eds M.S. Fazeli and G.L. Collingridge. Bios: Oxford, U.K.

Rolls, E.T. (1996b) A neurophysiological and computational approach to the functions of the temporal lobe cortical visual areas in invariant object recognition. In: Computational and Biological Mechanisms of Visual Coding, Eds L. Harris and M. Jenkin. Cambridge University Press: Cambridge, U.K.

Rolls, 1985, Role of low and high spatial frequencies in the face-selective responses of neurons in the cortex in the superior temporal sulcus, Vision Res., 25, 1021, 10.1016/0042-6989(85)90091-4

Rolls, 1986, Size and contrast have only small effects on the responses to faces of neurons in the cortex of the superior temporal sulcus of the monkey, Expl Brain Res., 65, 38, 10.1007/BF00243828

Rolls, 1987, The responses of neurons in the cortex in the superior temporal sulcus of the monkey to band-pass spatial frequency filtered faces, Vis. Res., 27, 311, 10.1016/0042-6989(87)90081-2

Rolls, 1989, The effect of learning on the face-selective responses of neurons in the cortex in the superior temporal sulcus of the monkey, Expl Brain Res., 76, 153, 10.1007/BF00253632

Rolls, 1990, The relative advantages of sparse versus distributed encoding for associative neuronal networks in the brain, Network, 1, 407, 10.1088/0954-898X/1/4/002

Rolls, 1993, Visual learning reflected in the responses of neurons in the temporal visual cortex of the macaque, Soc. Neurosci. Abstr., 19, 27

Rolls, 1994, Processing speed in the cerebral cortex, and the neurophysiology of visual backward masking, Proc. R. Soc. B., 257, 9, 10.1098/rspb.1994.0087

Rolls, 1994, The responses of neurons in the temporal cortex of primates, and face identification and detection, Expl Brain Res., 101, 474, 10.1007/BF00227340

Rolls, 1995, Sparseness of the neuronal representation of stimuli in the primate temporal visual cortex, J. Neurophysiol., 73, 713, 10.1152/jn.1995.73.2.713

Rolls, 1995, The responses of single neurons in the temporal visual cortical areas of the macaque when more than one stimulus is present in the visual field, Expl Brain Res., 103, 409, 10.1007/BF00241500

Rolls, E.T., Treves, A. and Tovee, M.J. (1996a) The representational capacity of the distributed encoding of information provided by populations of neurons in the primate temporal visual cortex. Expl Brain Res. (in press).

Rolls, E.T., Booth, M.C.A. and Treves, A. (1996b) View-invariant representations of objects in the inferior temporal visual cortex. Soc. Neurosci. Abstr. 22.

Rolls, E.T., Tovee, M. and Treves, A. (1997) Information in the neuronal representation of individual stimuli in the primate temporal visual cortex. J. Comput. Neurosci. (in press).

Rolls, E.T. and Treves, A. (1997) Neural Networks and Brain Function. Oxford University Press: Oxford.

Seltzer, 1978, Afferent cortical connections and architectonics of the superior temporal sulcus and surrounding cortex in the rhesus monkey, Brain Res., 149, 1, 10.1016/0006-8993(78)90584-X

Simmen, M.W., Rolls, E.T. and Treves, A. (1996) On the dynamics of a network of spiking neurons. In Computations and Neuronal Systems: Proceedings of CNS95, Eds F.H. Eekman and J.M. Bower. Kluwer: Boston.

Snedecor, G.W. and Cochran, W.G. (1989) Statistical Methods, 8th edn. Iowa State University Press: Ames, IA.

Sutton, 1981, Towards a modern theory of adaptive networks: expectation and prediction, Psychol. Rev., 88, 135, 10.1037/0033-295X.88.2.135

Tanaka, K., Saito, C., Fukada, Y. and Moriya, M. (1990) Integration of form, texture, and color information in the inferotemporal cortex of the macaque. In: Vision, Memory and the Temporal Lobe, Ch. 10, pp. 101–109. Eds E. Iwai and M. Mishkin. Elsevier: New York.

Tanaka, 1991, Coding visual images of objects in the inferotemporal cortex of the macaque monkey, J. Neurophysiol., 66, 170, 10.1152/jn.1991.66.1.170

Tarr, 1989, Mental rotation and orientation-dependence in shape recognition, Cognit. Psychol., 21, 233, 10.1016/0010-0285(89)90009-1

Thorpe, S.J. and Imbert, M. (1989) Biological constraints on connectionist models. In: Connectionism in Perspective, pp. 63–92. Eds R. Pfeifer, Z. Schreter and F. Fogelman-Soulie. Elsevier: Amsterdam.

Tovee, 1993, Information encoding and the responses of single neurons in the primate temporal visual cortex, J. Neurophysiol., 70, 640, 10.1152/jn.1993.70.2.640

Tovee, 1994, Translation invariance and the responses of neurons in the temporal visual cortical areas of primates, J. Neurophysiol., 72, 1049, 10.1152/jn.1994.72.3.1049

Tovee, 1995, Information encoding in short firing rate epochs by single neurons in the primate temporal visual cortex, Vis. Cognit., 2, 35, 10.1080/13506289508401721

Tovee, 1996, Visual learning in neurons of the primate temporal visual cortex, NeuroReport, 7, 2757, 10.1097/00001756-199611040-00070

Treves, 1993, Mean-field analysis of neuronal spike dynamics, Network, 4, 259, 10.1088/0954-898X/4/3/002

Treves, 1994, A computational analysis of the role of the hippocampus in memory, Hippocampus, 4, 374, 10.1002/hipo.450040319

Turvey, 1973, On the peripheral and central processes in vision: inferences from an information processing analysis of masking with patterned stimuli, Psychol. Rev., 80, 1, 10.1037/h0033872

Ungerleider, L.G. and Mishkin, M. (1982) Two cortical visual systems. In: Analysis of Visual Behaviour, pp. 549–586. Eds D.J. Ingle, M.A. Goodale and R.J.W. Mansfield. MIT Press: Cambridge, MA.

Ungerleider, 1994, “What” and “Where” in the human brain, Curr. Opin. Neurobiol., 4, 157, 10.1016/0959-4388(94)90066-3

Van Essen, 1992, Information processing in the primate visual system: an integrated systems perspective, Science, 255, 419, 10.1126/science.1734518

von der Malsburg, 1973, Self-organization of orientation sensitive cells in the striate cortex [Reprinted in Anderson and Rosenfeld, 1988], Kybernetik, 14, 85, 10.1007/BF00288907

von der Malsburg, C. (1981) The Correlation Theory of Brain Function. Technical report 81–2, Department of Neurobiology, Max-Planck-Institute for Biophysical Chemistry, Göttingen.

von der Malsburg, 1986, A neural cocktail-party processor, Biol. Cybernet., 54, 29, 10.1007/BF00337113

von der Malsburg, C. (1990) A neural architecture for the representation of scenes. In: Brain Organization and Memory: Cells, Systems and Circuits, Ch. 18, pp. 356–372. Eds J.L. McGaugh, N.M. Weinberger and G. Lynch. Oxford University Press: New York.

Wallis, G. (1984) Neural Mechanisms Underlying Processing in the Visual Areas of the Occipital and Temporal Lobes. Doctoral thesis, Department of Experimental Psychology, Oxford University, U.K.

Wallis, G. (1996a) Optimal unsupervised learning in invariant object recognition Neural Computation (in press).

Wallis, G. (1996b) Using spatio-temporal correlations to learn invariant object recognition. Neural Networks (in press).

Wallis, G. (1996c) Temporal order in human object recognition. J. Biol. Syst. (in press).

Wallis, 1993, Learning invariant responses to the natural transformations of objects, Intl Joint Conf. Neural Networks, 2, 1087

Yamane, 1988, What facial features activate face neurons in the inferotemporal cortex of the monkey?, Expl Brain Res., 73, 209, 10.1007/BF00279674