Recent developments in human motion analysis

Pattern Recognition - Tập 36 - Trang 585-601 - 2003
Liang Wang1, Weiming Hu1, Tieniu Tan1
1National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, People’s Republic of China

Tài liệu tham khảo

R.T. Collins, et al., A system for video surveillance and monitoring: VSAM final report, CMU-RI-TR-00-12, Technical Report, Carnegie Mellon University, 2000. Haritaoglu, 2000, W4: real-time surveillance of people and their activities, IEEE Trans. Pattern Anal. Mach. Intell., 22, 809, 10.1109/34.868683 Remagnino, 1998, Multi-agent visual surveillance of dynamic scenes, Image Vision Comput., 16, 529, 10.1016/S0262-8856(98)00099-7 Maggioni, 1998, Gesture computer: history, design, and applications W. Freeman, C. Weissman, Television control by hand gestures, Proceedings of the International Conference on Automatic Face and Gesture Recognition, 1995, pp. 179–183. Gavrila, 1999, The visual analysis of human movement, Comput. Vision Image Understanding, 73, 82, 10.1006/cviu.1998.0716 Collins, 2000, Introduction to the special section on video surveillance, IEEE Trans. Pattern Anal. Mach. Intell., 22, 745, 10.1109/TPAMI.2000.868676 Maybank, 2000, Introduction to special section on visual surveillance, Int. J. Comput. Vision, 37, 173, 10.1023/A:1008151520284 J. Steffens, E. Elagin, H. Neven, Person Spotter-fast and robust system for human detection, tracking and recognition, Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition, 1998, pp. 516–521. J. Yang, A. Waibel, A real-time face tracker, Proceedings of the IEEE CS Workshop on Applications of Computer Vision, Sarasota, FL, 1996, pp. 142–147. B. Moghaddam, W. Wahid, A. Pentland, Beyond eigenfaces: probabilistic matching for face recognition, Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition, 1998, pp. 30–35. C. Wang, M.S. Brandstein, A hybrid real-time face tracking system, Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Seattle, WA, 1998. Little, 1998, Recognizing people by their gait, J. Comput. Vision Res., 1 J.D. Shutler, M.S. Nixon, C.J. Harris, Statistical gait recognition via velocity moments, Proceedings of the IEE Colloquium on Visual Biometrics, 2000, pp. 10/1–10/5. Huang, 1999, Human gait recognition in canonical space using temporal templates, Proc. IEE Vision Image Signal Process., 146, 93, 10.1049/ip-vis:19990187 D. Cunado, M.S. Nixon, J.N. Carter, Automatic gait recognition via model-based evidence gathering, Proceedings of the Workshop on Automatic Identification Advanced Technologies, New Jersey, 1998, pp. 27–30. Boghossian, 1999, Image processing system for pedestrian monitoring using neural classification of normal motion patterns, Meas. Control, 32, 261, 10.1177/002029409903200902 B.A. Boghossian, S.A. Velastin, Motion-based machine vision techniques for the management of large crowds, Proceedings of the IEEE Sixth International Conference on Electronics, Circuits and Systems, September 5–8, 1999. Yi Li, Songde Ma, Hanqing Lu, Human posture recognition using multi-scale morphological method and Kalman motion estimation, Proceedings of the IEEE International Conference on Pattern Recognition, 1998, pp. 175–177. J. Segen, S. Kumar, Shadow gestures: 3D hand pose estimation using a single camera, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1999, pp. 479–485. M-H. Yang, N. Ahuja, Recognizing hand gesture using motion trajectories, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1999, pp. 468–472. Y. Cui, J.J. Weng, Hand segmentation using learning-based prediction and verification for hand sign recognition, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1997, pp. 88–93. M. Turk, Visual interaction with lifelike characters, Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition, Killington, 1996, pp. 368–373. H.M. Lakany, G.M. Haycs, M. Hazlewood, S.J. Hillman, Human walking: tracking and analysis, Proceedings of the IEE Colloquium on Motion Analysis and Tracking, 1999, pp. 5/1–5/14. M. Köhle, D. Merkl, J. Kastner, Clinical gait analysis by neural networks: issues and experiences, Proceedings of the IEEE Symposium on Computer-Based Medical Systems, 1997, pp. 138–143. D. Meyer, J. Denzler and H. Niemann, Model based extraction of articulated objects in image sequences for gait analysis, Proceedings of the IEEE International Conference on Image Processing, 1997, pp. 78–81. W. Freeman, et al., Computer vision for computer games, Proceedings of the International Conference on Automatic Face and Gesture Recognition, 1996, pp. 100–105. J.K. Aggarwal, Q. Cai, W. Liao, B. Sabata, Articulated and elastic non-rigid motion: a review, Proceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects, 1994, pp. 2–14. Cedras, 1995, Motion-based recognition, Image Vision Comput., 13, 129, 10.1016/0262-8856(95)93154-K J.K. Aggarwal, Q. Cai, Human motion analysis: a review, Proceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects, 1997, pp. 90–102. Aggarwal, 1999, Human motion analysis, Comput. Vision Image Understanding, 73, 428, 10.1006/cviu.1998.0744 Pentland, 2000, Looking at people, IEEE Trans. Pattern Anal. Mach. Intell., 22, 107, 10.1109/34.824823 Moeslund, 2001, A survey of computer vision-based human motion capture, Comput. Vision Image Understanding, 81, 231, 10.1006/cviu.2000.0897 K.P. Karmann, A. Brandt, Moving object recognition using an adaptive background memory, in: V. Cappellini (Ed.), Time-Varying Image Processing and Moving Object Recognition, Vol. 2, Elsevier, Amsterdam, The Netherlands, 1990. M. Kilger, A shadow handler in a video-based real-time traffic monitoring system, Proceedings of the IEEE Workshop on Applications of Computer Vision, 1992, pp. 1060–1066. Yang, 1992, The background primal sketch, Mach. Vision Appl., 5, 17, 10.1007/BF01213527 Wren, 1997, Pfinder, IEEE Trans. Pattern Anal. Mach. Intell., 19, 780, 10.1109/34.598236 C. Stauffer, W. Grimson, Adaptive background mixture models for real-time tracking, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, Vol. 2, 1999, pp. 246–252. McKenna, 2000, Tracking groups of people, Comput. Vision Image Understanding, 80, 42, 10.1006/cviu.2000.0870 S. Arseneau, J.R. Cooperstock, Real-time image segmentation for action recognition, Proceedings of the IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, 1999, pp. 86–89. H.Z. Sun, T. Feng, T.N. Tan, Robust extraction of moving objects from image sequences, Proceedings of the Fourth Asian Conference on Computer Vision, Taiwan, 2000, pp. 961–964. A. Elgammal, D. Harwood, L.S. David, Nonparametric background model for background subtraction, Proceedings of the Sixth European Conference on Computer Vision, 2000. A.J. Lipton, H. Fujiyoshi, R.S. Patil, Moving target classification and tracking from real-time video, Proceedings of the IEEE Workshop on Applications of Computer Vision, 1998, pp. 8–14. C. Anderson, P. Bert, G. Vander Wal, Change detection and tracking using pyramids transformation techniques, Proceedings of the SPIE-Intelligent Robots and Computer Vision, Vol. 579, 1985, pp. 72–78.d Bergen, 1992, A three frame algorithm for estimating two-component image motion, IEEE Trans. Pattern Anal. Mach. Intell., 14, 886, 10.1109/34.161348 Y. Kameda, M. Minoh, A human motion estimation method using 3-successive video frames, Proceedings of the International Conference on Virtual Systems and Multimedia, 1996. A. Verri, S. Uras, E. DeMicheli, Motion segmentation from optical flow, Proceedings of the Fifth Alvey Vision Conference, 1989, pp. 209–214. A. Meygret, M. Thonnat, Segmentation of optical flow and 3d data for the interpretation of mobile objects, Proceedings of the International Conference on Computer Vision, Osaka, Japan, December 1990. Barron, 1994, Performance of optical flow techniques, Int. J. Comput. Vision, 12, 42, 10.1007/BF01420984 H.A. Rowley, J.M. Rehg, Analyzing articulated motion using expectation-maximization, Proceedings of the International Conference on Pattern Recognition, 1997, pp. 935–941. A.M. Baumberg, D. Hogg, Learning spatio-temporal models from training examples, Technical Report of University of Leeds, September 1995. C. Bregler, Learning and recognizing human dynamics in video sequences, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1997, pp. 568–574. McLachlan, 1997 N. Friedman, S. Russell, Image segmentation in video sequences: a probabilistic approach, Proceedings of the 13th Conference on Uncertainty in Artificial Intelligence, August 1–3, 1997. E. Stringa, Morphological change detection algorithms for surveillance applications, British Machine Vision Conference, 2000, pp. 402–411. Y. Kuno, T. Watanabe, Y. Shimosakoda, S. Nakagawa, Automated detection of human for visual surveillance system, Proceedings of the International Conference on Pattern Recognition, 1996, pp. 865–869. Cutler, 2000, Robust real-time periodic motion detection, analysis, and applications, IEEE Trans. Pattern Anal. Mach. Intell., 22, 781, 10.1109/34.868681 A.J. Lipton, Local application of optic flow to analyse rigid versus non-rigid motion, In the website http://www.eecs.lehigh.edu/FRAME/Lipton/iccvframe.html. A. Selinger, L. Wixson, Classifying moving objects as rigid or non-rigid without correspondences, Proceedings of the DAPRA Image Understanding Workshop, Vol. 1, 1998, pp. 341–358. M. Oren, et al., Pedestrian detection using wavelet templates, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1997, pp. 193–199. C. Stauffer, Automatic hierarchical classification using time-base co-occurrences, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1999, pp. 333–339. A. Iketani, et al., Detecting persons on changing background, Proceedings of the International Conference on Pattern Recognition, Vol. 1, 1998, pp. 74–76. A.M. Elgammal, L.S. Davis, Probabilistic framework for segmenting people under occlusion, Proceedings of the International Conference on Computer Vision, 2001. D. Meyer, J. Denzler, H. Niemann, Model based extraction of articulated objects in image sequences, Proceedings of the Fourth International Conference on Image Processing, 1997. Mohan, 2001, Example-based object detection in images by components, IEEE Trans. Pattern Recognition Mach. Intell., 23, 349, 10.1109/34.917571 L. Zhao, C. Thorpe, Recursive context reasoning for human detection and parts identification, Proceedings of the IEEE Workshop on Human Modeling, Analysis and Synthesis, June 2000. Ioffe, 2001, Probabilistic methods for finding people, Int. J. Comput. Vision, 43, 45, 10.1023/A:1011179004708 G. Welch, G. Bishop, An introduction to the Kalman filter, from http://www.cs.unc.edu, UNC-Chapel Hill, TR95-041, November 2000. Isard, 1998, Condensation—conditional density propagation for visual tracking, Int. J. Comput. Vision, 29, 5, 10.1023/A:1008078328650 H. Sidenbladh, M.J. Black, D.J. Fleet, Stochastic tracking of 3D human figures using 2D image motion, Proceedings of the European Conference on Computer Vision, 2000. V. Pavlović, J.M. Rehg, T.-J. Cham, K.P. Murphy, A dynamic Bayesian network approach to figure tracking using learned dynamic models, Proceedings of the International Conference on Computer Vision, 1999, pp. 94–101. L. Goncalves, E.D. Bernardo, E. Ursella, P. Perona, Monocular tracking of the human arm in 3D, Proceedings of the Fifth International Conference on Computer Vision, Cambridge, 1995, pp. 764–770. J. Rehg, T. Kanade, Visual tracking of high DOF articulated structures: an application to human hand tracking, Proceedings of the European Conference on Computer Vision, 1994, pp. 35–46. D. Meyer, et al., Gait classification with HMMs for trajectories of body parts extracted by mixture densities, British Machine Vision Conference, 1998, pp. 459–468. P. Fieguth, D. Terzopoulos, Color-based tracking of heads and other mobile objects at video frame rate, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1997, pp. 21–27. Jang, 2000, Active models for tracking moving objects, Pattern Recognition, 33, 1135, 10.1016/S0031-3203(99)00100-4 I.A. Karaulova, P.M. Hall, A.D. Marshall, A hierarchical model of dynamics for tracking people with a single video camera, British Machine Vision Conference, 2000, pp. 352–361. Guo, 1994, Tracking human body motion based on a stick figure model, Visual Commun. Image Representation, 5, 1, 10.1006/jvci.1994.1001 Y. Guo, G. Xu, S. Tsuji, Understanding human motion patterns, Proceedings of the International Conference on Pattern Recognition, 1994, pp. 325–329. Leung, 1995, First sight, IEEE Trans. Pattern Anal. Mach. Intell., 17, 359, 10.1109/34.385981 I.-C. Chang, C.-L. Huang, Ribbon-based motion analysis of human body movements, Proceedings of the International Conference on Pattern Recognition, Vienna, 1996, pp. 436–440. S.A. Niyogi, E.H. Adelson, Analyzing and recognizing walking figures in XYT, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1994, pp. 469–474. S. Ju, M. Black, Y. Yaccob, Cardboard people: a parameterized model of articulated image motion, Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition, 1996, pp. 38–44. Rohr, 1994, Towards model-based recognition of human movements in image sequences, CVGIP: Image Understanding, 59, 94, 10.1006/ciun.1994.1006 Wachter, 1999, Tracking persons in monocular image sequences, Comput. Vision Image Understanding, 74, 174, 10.1006/cviu.1999.0758 J.M. Rehg, T. Kanade, Model-based tracking of self-occluding articulated objects, Proceedings of the Fifth International Conference on Computer Vision, Cambridge, 1995, pp. 612–617. I.A. Kakadiaris, D. Metaxas, Model-based estimation of 3-D human motion with occlusion based on active multi-viewpoint selection, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, 1996, pp. 81–87. N. Goddard, Incremental model-based discrimination of articulated movement from motion features, Proceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects, Austin, 1994, pp. 89–94. Badenas, 2001, Motion-based segmentation and region tracking in image sequences, Pattern Recognition, 34, 661, 10.1016/S0031-3203(00)00014-5 A. Baumberg, D. Hogg, An efficient method for contour tracking using active shape models, Proceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects, Austin, 1994, pp. 194–199. M. Isard, A. Blake, Contour tracking by stochastic propagation of conditional density, Proceedings of the European Conference on Computer Vision, 1996, pp. 343–356. Paragios, 2000, Geodesic active contours and level sets for the detection and tracking of moving objects, IEEE Trans. Pattern Anal. Mach. Intell., 22, 266, 10.1109/34.841758 Bertalmio, 2000, Morphing active contours, IEEE Trans. Pattern Anal. Mach. Intell., 22, 733, 10.1109/34.865191 Peterfreund, 2000, Robust tracking of position and velocity with Kalman snakes, IEEE Trans. Pattern Anal. Mach. Intell., 22, 564, 10.1109/34.771328 Zhong, 2001, Object tracking using deformable templates, IEEE Trans. Pattern Anal. Mach. Intell., 22, 544, 10.1109/34.857008 Baumberg, 1996, Generating spatio-temporal models from examples, Image Vision Comput., 14, 525, 10.1016/0262-8856(96)01092-X R. Polana, R. Nelson, Low level recognition of human motion, Proceedings of the IEEE CS Workshop on Motion of Non-Rigid and Articulated Objects, Austin, TX, 1994, pp. 77–82. Tissainaryagam, 2001, Visual tracking with automatic motion model switching, Pattern Recognition, 34, 641, 10.1016/S0031-3203(00)00019-4 A. Azarbayejani, A. Pentland, Real-time self-calibrating stereo person tracking using 3D shape estimation from blob features, Proceedings of the International Conference on Pattern Recognition, 1996, pp. 627–632. Q. Cai, A. Mitiche, J.K. Aggarwal, Tracking human motions in an indoor environment, Proceedings of the International Conference on Image Processing, Vol. 1, 1995, pp. 215–218. J. Segen, S. Pingali, A camera-based system for tracking people in real time, Proceedings of the International Conference on Pattern Recognition, 1996, pp. 63–67. T.-J. Cham, J.M. Rehg, A multiple hypothesis approach to figure tracking, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1999, pp. 239–245. Ricquebourg, 2000, Real-time tracking of moving persons by exploiting spatio-temporal image slices, IEEE Trans. Pattern Anal. Mach. Intell., 22, 797, 10.1109/34.868682 Darrell, 2000, Integrated person tracking using stereo, color, and pattern detection, Int. J. Comput. Vision, 37, 175, 10.1023/A:1008103604354 M. Rossi, A. Bozzoli, Tracking and counting people, Proceedings of the International Conference on Image Processing, Austin, 1994, pp. 212–216. H. Fujiyoshi, A.J. Lipton, Real-time human motion analysis by image skeletonization, Proceedings of the IEEE Workshop on Applications of Computer Vision, 1998, pp. 15–21. Q. Cai, J.K. Aggarwal, Tracking human motion using multiple cameras, Proceedings of the 13th International Conference on Pattern Recognition, 1996, pp. 68–72. D. Gavrila, L. Davis, 3-D model-based tracking of humans in action: a multi-view approach, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, 1996, pp. 73–80. A. Utsumi, H. Mori, J. Ohya, M. Yachida, Multiple-view-based tracking of multiple humans, Proceedings of the International Conference on Pattern Recognition, 1998, pp. 597–601. T.H. Chang, S. Gong, E.J. Ong, Tracking multiple people under occlusion using multiple cameras, British Machine Vision Conference, 2000, pp. 566–575. Boult, 1998 Zheng, 1995, Automatic feature point extraction and tracking in image sequences for arbitrary camera motion, Int. J. Comput. Vision, 15, 31, 10.1007/BF01450849 Barrón, 2001, Estimating anthropometry and pose from a single uncalibrated image, Comput. Vision Image Understanding, 81, 269, 10.1006/cviu.2000.0888 Y. Wu, T.S. Huang, A co-inference approach to robust visual tracking, Proceedings of the International Conference on Computer Vision, 2001. H.T. Nguyen, M. Worring, R. Boomgaard, Occlusion robust adaptive template tracking, Proceedings of the International Conference on Computer Vision, 2001. E.J. Ong, S. Gong, Tracking 2D–3D human models from multiple views, Proceedings of the International Workshop on Modeling People at ICCV, 1999. Aggarwal, 1998, Non-Rigid motion analysis: articulated & elastic motion, Comput. Vision Image Understanding, 70, 142, 10.1006/cviu.1997.0620 C.R. Wren, B.P. Clarkson, A. Pentland, Understanding purposeful human motion, Proceedings of the International Conference on Automatic Face and Gesture Recognition, France, March 2000. Y. Iwai, K. Ogaki, M. Yachida, Posture estimation using structure and motion models, Proceedings of the International Conference on Computer Vision, Greece, September 1999. Luo, 1992, An automatic rotoscopy system for human motion based on a biomechanic graphical model, Comput. Graphics, 16, 355, 10.1016/0097-8493(92)90021-M C. Yaniz, J. Rocha, F. Perales, 3D region graph for reconstruction of human motion, Proceedings of the Workshop on Perception of Human Motion at ECCV, 1998. M. Silaghi, et al., Local and global skeleton fitting techniques for optical motion capture, Proceedings of the Workshop on Modeling and Motion Capture Techniques for Virtual Environments, Switzerland, November 1998. S. Iwasaw, et al., Real-time estimation of human body posture from monocular thermal images, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1997. Y. Kameda, M. Minoh, K. Ikeda, Three-dimensional pose estimation of an articulated object from its silhouette image, Proceedings of the Asian Conference on Computer Vision, 1993. Y. Kameda, M. Minoh, K. Ikeda, Three-dimensional pose estimation of a human body using a difference image sequence, Proceedings of the Asian Conference on Computer Vision, 1995. C. Hu, et al., Extraction of parametric human model for posture recognition using generic algorithm, Proceedings of the Fourth International Conference on Automatic Face and Gesture Recognition, France, March 2000. C. Bregler, J. Malik, Tracking people with twists and exponential maps, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1998. O. Munkelt, et al., A model driven 3D image interpretation system applied to person detection in video images, Proceedings of the International Conference on Pattern Recognition, 1998. Q. Delamarre, O. Faugeras, 3D articulated models and multi-view tracking with silhouettes, Proceedings of the International Conference on Computer Vision, Greece, September 1999. J.P. Luck, D.E. Small, C.Q. Little, Real-time tracking of articulated human models using a 3d shape-from-silhouette method, Proceedings of the Robot Vision Conference, Auckland, New Zealand, 2001. R. Rosales, S. Sclaroff, 3D trajectory recovery for tracking multiple objects and trajectory guided recognition of actions, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, June 1999. Myers, 1980, Performance tradeoffs in dynamic time warping algorithms for isolated word recognition, IEEE Trans. Acoust. Speech Signal Process., 28, 623, 10.1109/TASSP.1980.1163491 A. Bobick, A. Wilson, A state-based technique for the summarization and recognition of gesture, Proceedings of the International Conference on Computer Vision, Cambridge, 1995, pp. 382–388. K. Takahashi, S. Seki, et al., Recognition of dexterous manipulations from time varying images, Proceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects, Austin, 1994, pp. 23–28. A.B. Poritz, Hidden Markov models: a guided tour, Proceedings of the International Conference on Acoustic Speech and Signal Processing, 1988, pp. 7–13. Rabinier, 1989, A tutorial on hidden Markov models and selected applications in speech recognition, Proc. IEEE, 77, 257, 10.1109/5.18626 T. Starner, A. Pentland, Real-time American Sign Language recognition from video using hidden Markov models, Proceedings of the International Symposium on Computer Vision, 1995, pp. 265–270. J. Yamato, J. Ohya, K. Ishii, Recognizing human action in time-sequential images using hidden Markov model, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1992, pp. 379–385. M. Brand, N. Oliver, A. Pentland, Coupled hidden Markov models for complex action recognition, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1997, pp. 994–999. C. Vogler, D. Metaxas, ASL recognition based on a coupling between HMMs and 3D motion analysis, Proceedings of the International Conference on Computer Vision, 1998, pp. 363–369. M. Rosenblum, Y. Yacoob, L. Davis, Human emotion recognition from motion using a radial basis function network architecture, Proceedings of the IEEE Workshop on Motion of Non-Rigid and Articulated Objects, Austin, 1994, pp. 43–49. O. Chomat, J.L. Crowley, Recognizing motion using local appearance, International Symposium on Intelligent Robotic Systems, University of Edinburgh, 1998. Y. Yacoob, M.J. Black, Parameterized modeling and recognition of activities, Proceedings of the International Conference on Computer Vision, India, 1998. Galata, 2001, Learning variable-length Markov models of behavior, Comput. Vision Image Understanding, 81, 398, 10.1006/cviu.2000.0894 Lin, 1999, A space-time delay neural network for motion recognition and its application to lipreading, Int. J. Neural Systems, 9, 311, 10.1142/S0129065799000319 J.E. Boyd, J.J. little, Global versus structured interpretation of motion: moving light displays, Proceedings of the IEEE CS Workshop on Motion of Non-Rigid and Articulated Objects, 1997, pp. 18–25. A.F. Bobick, J. Davis, Real-time recognition of activity using temporal templates, Proceedings of the IEEE CS Workshop on Applications of Computer Vision, 1996, pp. 39–42. J.W. Davis, A.F. Bobick, The representation and recognition of action using temporal templates, Technical Report 402, MIT Media Lab, Perceptual Computing Group, 1997. L. Campbell, A. Bobick, Recognition of human body motion using phase space constraints, Proceedings of the International Conference on Computer Vision, Cambridge, 1995, pp. 624–630. P. Remagnino, T. Tan, K. Baker, Agent orientated annotation in model based visual surveillance, Proceedings of the International Conference on Computer Vision, 1998, pp. 857–862. Kojima, et al., Generating natural language description of human behaviors from video images, Proceedings of the International Conference on Pattern Recognition, 2000, pp. 728–731. S. Intille, A. Bobick, Representation and visual recognition of complex, multi-agent actions using belief networks, Technical Report 454, Perceptual Computing Section, MIT Media Lab, 1998. G. Herzog, K. Rohr, Integrating vision and language: towards automatic description of human movements, Proceedings of the 19th Annual German Conference on Artificial Intelligence, 1995, pp. 257–268. Penlend, 1999, Modeling and prediction of human behaviors, Neural Comput., 11, 229, 10.1162/089976699300016890 M. Thonnat, N. Rota, Image understanding for visual surveillance applications, Proceedings of the Third International Workshop on Cooperative Distributed Vision, 1999, pp. 51–82. N. Rota, M. Thonnat, Video sequence interpretation for visual surveillance, Proceedings of the Workshop on Visual Surveillance, Ireland, 2000, pp. 59–67. G. Rigoll, S. Eickeler, S. Müller, Person tracking in real world scenarios using statistical methods, Proceedings of the International Conference on Automatic Face and Gesture Recognition, France, March 2000. Kakadiaris, 1998, Three-dimensional human body model acquisition from multiple views, Int. J. Comput. Vision, 30, 191, 10.1023/A:1008071332753 H. Sidenbladh, F. Torre, M. J. Black, A framework for modeling the appearance of 3D articulated figures, Proceedings of the International Conference on Automatic Face and Gesture Recognition, France, March 2000. N. Johnson, A. Galata, D. Hogg, The acquisition and use of interaction behavior models, Proceedings of the IEEE CS Conference on Computer Vision and Pattern Recognition, 1998, pp. 866–871. P. Fua, et al., Human body modeling and motion analysis from video sequence, International Symposium on Real Time Imaging and Dynamic Analysis, Japan, June 1998. R. Plänkers, P. Fua, Articulated soft object for video-based body modeling, Proceedings of the International Conference on Computer Vision, 2001. Hilton, 2001, Foreword, Comput. Vision Image Understanding, 81, 227, 10.1006/cviu.2001.0907