A survey on vision-based human action recognition
Tài liệu tham khảo
Achard, 2008, A novel approach for recognition of human actions with semi-global features, Machine Vision and Applications, 19, 27, 10.1007/s00138-007-0074-2
Aggarwal, 1999, Human motion analysis: a review, Computer Vision and Image Understanding (CVIU), 73, 428, 10.1006/cviu.1998.0744
Ahad, 2008, Motion recognition approach to solve overwriting in complex actions, 1
Ahmad, 2008, Human action recognition using shape and CLG-motion flow from multi-view image sequences, Pattern Recognition, 41, 2237, 10.1016/j.patcog.2007.12.008
Saad Ali, Mubarak Shah, Human action recognition in videos using kinematic features and multiple instance learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), in press.
Dhruv Batra, Tsuhan Chen, Rahul Sukthankar, Space–time shapelets for action recognition, in: Proceedings of the Workshop on Motion and Video Computing (WMVC’08), Copper Mountain, CO, January 2008, pp. 1–6.
Bay, 2008, SURF: Speeded up robust features, Computer Vision and Image Understanding (CVIU), 110, 346, 10.1016/j.cviu.2007.09.014
Jaron Blackburn, Eraldo Ribeiro, Human motion recognition using Isomap and dynamic time warping, in: Human Motion: Understanding, Modeling, Capture and Animation (HUMO’07), Lecture Notes in Computer Science, Rio de Janeiro, Brazil, October 2007, pp. 285–298 (Number 4814).
Moshe Blank, Lena Gorelick, Eli Shechtman, Michal Irani, Ronen Basri, Actions as space–time shapes, in: Proceedings of the International Conference On Computer Vision (ICCV’05), vol. 2, Beijing, China, October 2005, pp. 1395–1402.
Bobick, 1997, Movement, activity and action: the role of knowledge in the perception of motion, Philosophical Transactions of the Royal Society B: Biological Sciences, 352, 1257, 10.1098/rstb.1997.0108
Bobick, 2001, The recognition of human movement using temporal templates, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 23, 257, 10.1109/34.910878
Boiman, 2007, Detecting irregularities in images and in video, International Journal of Computer Vision (IJCV), 74, 17, 10.1007/s11263-006-0009-9
Matteo Bregonzio, Shaogang Gong, Tao Xiang, Recognising action as clouds of space–time interest points, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Caillette, 2008, Real-time 3-D human body tracking using learnt models of behaviour, Computer Vision and Image Understanding (CVIU), 109, 112, 10.1016/j.cviu.2007.05.005
Chakraborty, 2008, View-invariant human-body detection with extension to human action recognition using component-wise HMM of body parts, 1
Hsuan-Sheng Chen, Hua-Tsung Chen, Yi-Wen Chen, Suh-Yin Lee, Human action recognition using star skeleton, in: Proceedings of the International Workshop on Video Surveillance and Sensor Networks (VSSN’06), Santa Barbara, CA, October 2006, pp. 171–178.
Srikanth Cherla, Kaustubh Kulkarni, Amit Kale, Viswanathan Ramasubramanian, Towards fast, view-invariant human action recognition, in: Proceedings of the Workshop on Computer Vision and Pattern Recognition for Human Communicative Behavior Analysis (CVPR4HB’08), Anchorage, AK, June 2008, pp. 1–8.
Tat-Jun Chin, Liang Wang, Konrad Schindler, David Suter, Extrapolating learned manifolds for human activity recognition, in: Proceedings of the International Conference on Image Processing (ICIP’07), vol. 1, San Antonio, TX, September 2007, pp. 381–384.
Olivier Chomat, Jérôme Martin, James L. Crowley, A probabilistic sensor for the perception and recognition of activities, in: Proceedings of the European Conference on Computer Vision (ECCV’00), Lecture Notes in Computer Science, vol. 1, Dublin, Ireland, June 2000, pp. 487–503 (Number 1842).
Timothee Cour, Chris Jordan, Eleni Miltsakaki, Ben Taskar, Movie/script: Alignment and parsing of video and text transcription, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 4, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 158–171 (Number 5305).
Cutler, 2000, Robust real-time periodic motion detection, analysis, and applications, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 22, 781, 10.1109/34.868681
Fabio Cuzzolin, Using bilinear models for view-invariant action and identity recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’06), vol. 2, New York, NY, June 2006, pp. 1701–1708.
Navneet Dalal, Bill Triggs, Histograms of oriented gradients for human detection, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’05), vol. 1, San Diego, CA, June 2005, pp. 886–893.
Somayeh Danafar, Niloofar Gheissari, Action recognition for surveillance applications using optic flow and SVM, in: Proceedings of the Asian Conference on Computer Vision (ACCV’07) – part 2, Lecture Notes in Computer Science, Tokyo, Japan, November 2007, pp. 457–466 (Number 4844).
Piotr Dollár, Vincent Rabaud, Garrison Cottrell, Serge Belongie, Behavior recognition via sparse spatio-temporal features, in: Proceedings of the International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS’05), Beijing, China, October 2005, pp. 65–72.
Olivier Duchenne, Ivan Laptev, Josef Sivic, Francis Bach, Jean Ponce, Automatic annotation of human actions in video, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik, Recognizing action at a distance, in: Proceedings of the International Conference on Computer Vision (ICCV’03), vol. 2, Nice, France, October 2003, pp. 726–733.
Ahmed M. Elgammal, Chan-Su Lee, Separating style and content on a nonlinear manifold, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’04), vol. 1, Washington, DC, June 2004, pp. 478–485.
Markus Enzweiler, Dariu M. Gavrila, Monocular pedestrian detection: survey and experiments, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 31(12) (2009) 2179–2195.
Erol, 2007, Vision-based hand pose estimation: a review, Computer Vision and Image Understanding (CVIU), 108, 52, 10.1016/j.cviu.2006.10.012
Escobar, 2009, Action recognition using a bio-inspired feedforward spiking network, International Journal of Computer Vision (IJCV), 82, 284, 10.1007/s11263-008-0201-1
Claudio Fanti, Lihi Zelnik-Manor, Pietro Perona, Hybrid models for human motion recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’05), vol. 1, San Diego, CA, June 2005, pp. 1166–1173.
Ali Farhadi, Mostafa Kamali Tabriz, Learning to recognize activities from the wrong view point, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 1, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 154–166 (Number 5302).
Ali Farhadi, Mostafa Kamali Tabriz, Ian Endres, David A. Forsyth, A latent model of discriminative aspect, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Alireza Fathi, Greg Mori, Action recognition by learning mid-level motion features, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Xiaolin Feng, Pietro Perona, Human action recognition by sequence of movelet codewords, in: Proceedings of the International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT’02), Padova, Italy, June 2002, pp. 717–721.
Roman Filipovych, Eraldo Ribeiro, Learning human motion models from unsegmented videos, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–7.
Forsyth, 2006, Computational studies of human motion part 1: tracking and motion synthesis, Foundations and Trends in Computer Graphics and Vision, 1, 77
Freund, 1997, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences, 55, 119, 10.1006/jcss.1997.1504
Adrien Gaidon, Marcin Marszałek, Cordelia Schmid, Mining visual actions from movies, in: Proceedings of the British Machine Vision Conference (BMVC’09), London, United Kingdom, in press.
Gandhi, 2007, Pedestrian protection systems: issues, survey, and challenges, IEEE Transactions On Intelligent Transportation Systems, 8, 413, 10.1109/TITS.2007.903444
Gavrila, 1999, The visual analysis of human movement: a survey, Computer Vision and Image Understanding (CVIU), 73, 82, 10.1006/cviu.1998.0716
David Gerónimo, Antonio M. López, Angel D. Sappa, Thorsten Graf, Survey of pedestrian detection for advanced driver assistance systems, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), in press.
Andrew Gilbert, John Illingworth, Richard Bowden, Scale invariant action recognition using compound features mined from dense spatio-temporal corners, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 1, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 222–233 (Number 5302).
Gorelick, 2007, Actions as space–time shapes, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 29, 2247, 10.1109/TPAMI.2007.70711
Matthias Grundmann, Franziska Meier, Irfan Essa, 3D shape context and distance transform for action recognition, in: Proceedings of the International Conference on Pattern Recognition (ICPR’08), Tampa, FL, December 2008, pp. 1–4.
Gupta, 2009, Observing human–object interactions: using spatial and functional compatibility for recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 31, 1775, 10.1109/TPAMI.2009.83
Sonal Gupta, Raymond J. Mooney, Using closed captions to train activity recognizers that improve video retrieval, in: Proceedings of the Workshop on Visual and Contextual Learning (VCL) at the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Chris Harris, Mike Stephens, A combined corner and edge detector, in: Proceedings of the Alvey Vision Conference, Manchester, United Kingdom, August 1988, pp. 147–151.
Kardelen Hatun, Pınar Duygulu, Pose sentences: a new representation for action recognition using sequence of pose words, in: Proceedings of the International Conference on Pattern Recognition (ICPR’08), Tampa, FL, December 2008, pp. 1–4.
Yuxiao Hu, Liangliang Cao, Fengjun Lv, Shuicheng Yan, Yihong Gong, Thomas S. Huang, Action detection in complex scenes with spatial and temporal ambiguities, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Feiyue Huang, Guangyou Xu, Viewpoint insensitive action recognition using envelop shape, in: Proceedings of the Asian Conference on Computer Vision (ACCV’07) – part 2, Lecture Notes in Computer Science, Tokyo, Japan, November 2007, pp. 477–486 (Number 4844).
Nazlı İkizler, Ramazan G. Cinbiş, Pınar Duygulu, Human action recognition with line and flow histograms, in: Proceedings of the International Conference on Pattern Recognition (ICPR’08), Tampa, FL, December 2008, pp. 1–4.
Nazlı İkizler, Ramazan G. Cinbiş, Selen Pehlivan, Pınar Duygulu, Recognizing actions from still images, in: Proceedings of the International Conference on Pattern Recognition (ICPR’08), Tampa, FL, December 2008, pp. 1–4.
Nazlı İkizler, Ramazan G. Cinbiş, Stan Sclaroff, Learning actions from the web, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
İkizler, 2009, Histogram of oriented rectangles: a new pose descriptor for human action recognition, Image and Vision Computing, 27, 1515, 10.1016/j.imavis.2009.02.002
İkizler, 2008, Searching for complex human activities with no visual examples, International Journal of Computer Vision (IJCV), 30, 337, 10.1007/s11263-008-0142-8
Hueihan Jhuang, Thomas Serre, Lior Wolf, Tomaso Poggio, A biologically inspired system for action recognition, in: Proceedings of the International Conference On Computer Vision (ICCV’07), Rio de Janeiro, Brazil, October 2007, pp. 1–8.
Kui Jia, Dit-Yan Yeung, Human action recognition using local spatio-temporal discriminant embedding, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Hao Jiang, David R. Martin, Finding actions using shape flows, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 2, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 278–292 (Number 5303).
Imran Junejo, Emilie Dexter, Ivan Laptev, Patrick Pérez, Cross-view action recognition from temporal self-similarities, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 2, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 293–306 (Number 5303).
Timor Kadir, Michael Brady, Scale saliency: a novel approach to salient feature and scale selection, in: Proceedings of the International Conference on Visual Information Engineering (VIE), Guildford, United Kingdom, July 2003, pp. 25–28.
Yan Ke, Rahul Sukthankar, Martial Hebert, Efficient visual event detection using volumetric features, in: Proceedings of the International Conference On Computer Vision (ICCV’05), vol. 1, Beijing, China, October 2005, pp. 166–173.
Yan Ke, Rahul Sukthankar, Martial Hebert, Event detection in crowded videos, in: Proceedings of the International Conference On Computer Vision (ICCV’07), Rio de Janeiro, Brazil, October 2007, pp. 1–8.
Yan Ke, Rahul Sukthankar, Martial Hebert, Spatio-temporal shape and flow correlation for action recognition, in: Proceedings of the Workshop on Visual Surveillance (VS) at the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Vili Kellokumpu, Guoying Zhao, Matti Pietikäinen, Human activity recognition using a dynamic texture based method, in: Proceedings of the British Machine Vision Conference (BMVC’08), Leeds, United Kingdom, September 2008, pp. 885–894.
Kim, 2009, Canonical correlation analysis of video volume tensors for action categorization and detection, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 31, 1415, 10.1109/TPAMI.2008.167
Alexander Kläser, Marcin Marszałek, Cordelia Schmid, A spatio-temporal descriptor based on 3d-gradients, in: Proceedings of the British Machine Vision Conference (BMVC’08), Leeds, United Kingdom, September 2008, pp. 995–1004.
Krüger, 2007, The meaning of action: a review on action recognition and mapping, Advanced Robotics, 21, 1473, 10.1163/156855307782148578
John D. Lafferty, Andrew McCallum, Fernando C. Pereira, Conditional random fields: probabilistic models for segmenting and labeling sequence data, in: Proceedings of the International Conference on Machine Learning (ICML’01), Williamstown, MA, June 2001, pp. 282–289.
Laptev, 2007, Local velocity-adapted motion events for spatio-temporal recognition, Computer Vision and Image Understanding (CVIU), 108, 207, 10.1016/j.cviu.2006.11.023
Ivan Laptev, Tony Lindeberg, Space–time interest points, in: Proceedings of the International Conference on Computer Vision (ICCV’03), vol. 1, Nice, France, October 2003, pp. 432–439.
Ivan Laptev, Marcin Marszałek, Cordelia Schmid, Benjamin Rozenfeld, Learning realistic human actions from movies, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Ivan Laptev, Patrick Pérez, Retrieving actions in movies, in: Proceedings of the International Conference On Computer Vision (ICCV’07), Rio de Janeiro, Brazil, October 2007, pp. 1–8.
Zhe Lin, Zhuolin Jiang, Larry S. Davis, Recognizing actions by shape-motion prototype trees, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Jingen Liu, Saad Ali, Mubarak Shah, Recognizing human actions using multiple features, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Jingen Liu, Jiebo LUO, Mubarak Shah, Recognizing realistic actions from videos “in the wild”, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Jingen Liu, Mubarak Shah, Learning human actions via information maximization, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Lowe, 2004, Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision (IJCV), 60, 91, 10.1023/B:VISI.0000029664.99615.94
Wei-Lwun Lu, James J. Little, Simultaneous tracking and action recognition using the PCA–HOG descriptor, in: Proceedings of the Canadian Conference on Computer and Robot Vision (CRV’06), Quebec City, Canada, June 2006, pp. 6–6.
Fengjun Lv, Ram Nevatia, Recognition and segmentation of 3-D human action using HMM and multi-class adaBoost, in: Proceedings of the European Conference on Computer Vision (ECCV’06), Lecture Notes in Computer Science, vol. 4, Graz, Austria, May 2006, pp. 359–372 (Number 3953).
Fengjun Lv, Ram Nevatia, Single view human action recognition using key pose matching and Viterbi path searching, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid, Actions in context, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Masoud, 2003, A method for human action recognition, Image and Vision Computing, 21, 729, 10.1016/S0262-8856(03)00068-4
Pyry Matikainen, Martial Hebert, Rahul Sukthankar, Yan Ke, Fast motion consistency through matrix quantization, in: Proceedings of the British Machine Vision Conference (BMVC’08), Leeds, United Kingdom, September 2008, pp. 1055–1064.
M. Ángeles Mendoza, Nicolás Pérez de la Blanca, Applying space state models in human action recognition: a comparative study, in: International Workshop on Articulated Motion and Deformable Objects (AMDO’08), Lecture Notes in Computer Science, Port d’Andratx, Spain, July 2008, pp. 53–62 (Number 5098).
Ross Messing, Chris Pal, Henry Kautz, Activity recognition using the velocity histories of tracked keypoints, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Krystian Mikolajczyk, Hirofumi Uemura, Action recognition with motion-appearance vocabulary forest, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Mitra, 2007, Gesture recognition: a survey, IEEE Transactions on Systems, Man, and Cybernetics (SMC) – Part C: Applications and Reviews, 37, 311, 10.1109/TSMCC.2007.893280
Moeslund, 2006, A survey of advances in vision-based human motion capture and analysis, Computer Vision and Image Understanding (CVIU), 104, 90, 10.1016/j.cviu.2006.08.002
Darnell J. Moore, Irfan A. Essa, Monson H. Hayes III, Exploiting human actions and object context for recognition tasks, in: Proceedings of the International Conference on Computer Vision (ICCV’99), vol. 1, Kerkyra, Greece, September 1999, pp. 80–86.
Pradeep Natarajan, Ram Nevatia, Online, real-time tracking and recognition of human actions, in: Proceedings of the Workshop on Motion and Video Computing (WMVC’08), Copper Mountain, CO, January 2008, pp. 1–8.
Pradeep Natarajan, Ram Nevatia, View and scale invariant action recognition using multiview shape-flow models, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Juan Carlos Niebles, Li Fei-Fei, A hierarchical model of shape and appearance for human action classification, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Niebles, 2008, Unsupervised learning of human action categories using spatial–temporal words, International Journal of Computer Vision (IJCV), 79, 299, 10.1007/s11263-007-0122-4
Huazhong Ning, Yuxiao Hu, Thomas S. Huang, Searching human behaviors using spatial–temporal words, in: Proceedings of the International Conference on Image Processing (ICIP’07), vol. 6, San Antonio, TX, September 2007, pp. 337–340.
Huazhong Ning, Wei Xu, Yihong Gong, Thomas S. Huang, Latent pose estimator for continuous action recognition, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 2, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 419–433 (Number 5305).
Sebastian Nowozin, Gökhan Bakır, Koji Tsuda, Discriminative subsequence mining for action classification, in: Proceedings of the International Conference On Computer Vision (ICCV’07), Rio de Janeiro, Brazil, October 2007, pp. 1–8.
Abhijit S. Ogale, Alap Karapurkar, Yiannis Aloimonos, View-invariant modeling and recognition of human actions using grammars, in: Revised Papers of the Workshops on Dynamical Vision (WDV’05 and WDV’06), Lecture Notes in Computer Science, Beijing, China, May 2007, pp. 115–126 (Number 4358).
Takehito Ogata, William Christmas, Josef Kittler, Seiji Ishikawa, Improving human activity detection by combining multi-dimensional motion descriptors with boosting, in: Proceedings of the International Conference on Pattern Recognition (ICPR’06), vol. 1, Kowloon Tong, Hong Kong, August 2006, pp. 295–298.
Antonios Oikonomopoulos, Maja Pantic, Ioannis Patras, Sparse B-spline polynomial descriptors for human activity recognition, Image and Vision Computing 27(12) (2009) 1814–1825.
Oikonomopoulos, 2006, Spatiotemporal salient points for visual recognition of human actions, IEEE Transactions On Systems Man And Cybernetics (SMC) – Part B: Cybernetics, 36, 710, 10.1109/TSMCB.2005.861864
Olusegun Oshin, Andrew Gilbert, John Illingworth, Richard Bowden, Spatio-temporal feature recognition using randomised ferns, in: Proceedings of the International Workshop on Machine Learning for Vision-based Motion Analysis (MLVMA’08), Marseille, France, October 2008, pp. 1–12.
Parameswaran, 2006, View invariance for human action recognition, International Journal of Computer Vision (IJCV), 66, 83, 10.1007/s11263-005-3671-4
Park, 2008, Understanding human interactions with track and body synergies (TBS) captured from multiple views, Computer Vision and Image Understanding (CVIU), 111, 2, 10.1016/j.cviu.2007.10.005
Alonso Patron-Perez, Ian Reid, A probabilistic framework for recognizing similar actions using spatio-temporal features, in: Proceedings of the British Machine Vision Conference (BMVC’07), Edinburgh, United Kingdom, September 2007, pp. 1–10.
Patrick Peursum, Svetha Venkatesh, Geoff West, Tracking-as-recognition for articulated full-body human motion analysis, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Polana, 1997, Detection and recognition of periodic, nonrigid motion, International Journal of Computer Vision (IJCV), 23, 261, 10.1023/A:1007975200487
Poppe, 2007, Vision-based human motion analysis: an overview, Computer Vision and Image Understanding (CVIU), 108, 4, 10.1016/j.cviu.2006.10.016
Poppe, 2008, Discriminative human action recognition using pairwise CSP classifiers, 1
Quattoni, 2007, Hidden conditional random fields, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 29, 1848, 10.1109/TPAMI.2007.1124
Hossein Ragheb, Sergio Velastin, Paolo Remagnino, Tim Ellis, Human action recognition using robust power spectrum features, in: Proceedings of the International Conference on Image Processing (ICIP’08), San Diego, CA, October 2008, pp. 753–756.
Deva Ramanan, Learning to parse images of articulated bodies, in: Advances in Neural Information Processing Systems (NIPS), vol. 19, Vancouver, Canada, December 2006, pp. 1129–1136.
Deva Ramanan, David A. Forsyth, Automatic annotation of everyday movements, in: Advances in Neural Information Processing Systems (NIPS), vol. 16, Vancouver, Canada, 2003, pp. 1–8.
Ramanan, 2007, Tracking people by learning their appearance, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 29, 65, 10.1109/TPAMI.2007.250600
Rao, 2002, View-invariant representation and recognition of actions, International Journal of Computer Vision (IJCV), 50, 203, 10.1023/A:1020350100748
Konstantinos Rapantzikos, Yannis Avrithis, Stefanos Kollias, Dense saliency-based spatiotemporal feature points for action recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Rapantzikos, 2007, Spatiotemporal saliency for event detection and representation in the 3D wavelet domain: potential in human action recognition, 294
Robertson, 2006, A general method for human activity recognition in video, Computer Vision and Image Understanding (CVIU), 104, 232, 10.1016/j.cviu.2006.07.006
Mikel D. Rodriguez, Javed Ahmed, Mubarak Shah, Action MACH: a spatio-temporal maximum average correlation height filter for action recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Rómer E. Rosales, Recognition of human action using moment-based features, Technical Report BU-1998-020, Boston University, Computer Science, Boston, MA, November 1998.
Ryoo, 2009, Semantic representation and recognition of continued and recursive human activities, International Journal of Computer Vision (IJCV), 82, 1, 10.1007/s11263-008-0181-1
Silvio Savarese, Andrey DelPozo, Juan Carlos Niebles, Li Fei-Fei, Spatial–temporal correlatons for unsupervised action classification, in: Proceedings of the Workshop on Applications of Computer Vision (WACV’08), Copper Mountain, CO, January 2008, pp. 1–8.
Konrad Schindler, Luc J. van Gool, Action snippets: how many frames does human action recognition require? in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Schüldt, 2004, Recognizing human actions: a local SVM approach, vol. 3, 32
Paul Scovanner, Saad Ali, Mubarak Shah, A 3-dimensional SIFT descriptor and its application to action recognition, in: Proceedings of the International Conference on Multimedia (MultiMedia’07), Augsburg, Germany, September 2007, pp. 357–360.
Seitz, 1997, View-invariant analysis of cyclic motion, International Journal of Computer Vision (IJCV), 25, 231, 10.1023/A:1007928103394
Hae Jong Seo, Peyman Milanfar, Detection of human actions from a single example, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Eli Shechtman, Michal Irani, Matching local self-similarities across images and videos, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Shechtman, 2007, Space–time behavior-based correlation-OR-How to tell if two underlying motion fields are similar without computing them?, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 29, 2045, 10.1109/TPAMI.2007.1119
Yaser Sheikh, Mumtaz Sheikh, Mubarak Shah, Exploring the space of a human action, in: Proceedings of the International Conference On Computer Vision (ICCV’05), vol. 1, Beijing, China, October 2005, pp. 144–149.
Shen, 2009, View-invariant action recognition from point triplets, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 31, 1898, 10.1109/TPAMI.2009.41
Qinfeng Shi, Li Wang, Li Cheng, Alex Smola, Discriminative human action segmentation and recognition using semi-Markov model, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Sminchisescu, 2006, Conditional models for contextual human motion recognition, Computer Vision and Image Understanding (CVIU), 104, 210, 10.1016/j.cviu.2006.07.014
Paul Smith, Niels da Vitoria Lobo, Mubarak Shah, TemporalBoost for event recognition, in: Proceedings of the International Conference On Computer Vision (ICCV’05), vol. 1, Beijing, China, October 2005, pp. 733–740.
Song, 2003, Unsupervised learning of human motion, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 25, 814, 10.1109/TPAMI.2003.1206511
Richard Souvenir, Justin Babbs, Learning the viewpoint manifold for action recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–7.
Josephine Sullivan, Stefan Carlsson, Recognizing and tracking human action, in: Proceedings of the European Conference on Computer Vision (ECCV’02), vol. 1, Lecture Notes in Computer Science, Copenhagen, Denmark, May 2002, pp. 629–644 (Number 2350).
Evan A. Suma, Christopher W. Sinclair, Justin Babbs, Richard Souvenir, A sketch-based approach for detecting common human actions, in: Proceedings of the International Symposium on Advances in Visual Computing (ISVC’08) – part 1, Lecture Notes in Computer Science, Las Vegas, NV, December 2008, pp. 418–427 (Number 5358).
Ju Sun, Xiao Wu, Shuicheng Yan, Loong-Fah Cheong, Tat-Seng Chua, Jintao Li, Hierarchical spatio-temporal context modeling for action recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Christian Thurau, Václav Hlaváč, Pose primitive based human action recognition in videos or still images, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–6.
Du Tran, Alexander Sorokin, David A. Forsyth, Human activity recognition with metric learning, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 1, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 548–561 (Number 5302).
Turaga, 2008, Machine recognition of human activities: a survey, IEEE Transactions on Circuits and Systems for Video Technology, 18, 1473, 10.1109/TCSVT.2008.2005594
Turaga, 2009, Unsupervised view and rate invariant clustering of video sequences, Computer Vision and Image Understanding (CVIU), 113, 353, 10.1016/j.cviu.2008.08.009
Pavan K. Turaga, Ashok Veeraraghavan, Rama Chellappa, Statistical analysis on stiefel and grassmann manifolds with applications in computer vision, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Hirofumi Uemura, Seiji Ishikawa, Krystian Mikolajczyk, Feature tracking and motion compensation for action recognition, in: Proceedings of the British Machine Vision Conference (BMVC’08), Leeds, United Kingdom, September 2008, pp. 293–302.
Ashok Veeraraghavan, Rama Chellappa, Amit K. Roy-Chowdhury, The function space of an activity, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’06), vol. 1, New York, NY, June 2006, pp. 959–968.
Veeraraghavan, 2005, Matching shape sequences in video with applications in human movement analysis, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 27, 1896, 10.1109/TPAMI.2005.246
Paul A. Viola, Michael J. Jones, Rapid object detection using a boosted cascade of simple features, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’01), vol. 1, Kauai, HI, December 2001, pp. 511–518.
Shiv N. Vitaladevuni, Vili Kellokumpu, Larry S. Davis, Action recognition using ballistic dynamics, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–8.
Heng Wang, Muhammad Muneeb Ullah, Alexander Kläser, Ivan Laptev, Cordelia Schmid, Evaluation of local spatio-temporal features for action recognition, in: Proceedings of the British Machine Vision Conference (BMVC’09), London, United Kingdom, in press.
Jack M. Wang, David J. Fleet, Aaron Hertzmann, Multifactor Gaussian process models for style-content separation, in: Proceedings of the International Conference on Machine Learning (ICML’07), ACM International Conference Proceeding Series, Corvalis, OR, June 2007, pp. 975–982 (Number 227 ).
Wang, 2003, Recent developments in human motion analysis, Pattern Recognition, 36, 585, 10.1016/S0031-3203(02)00100-0
Liang Wang, David Suter, Informative shape representations for human action recognition, in: Proceedings of the International Conference on Pattern Recognition (ICPR’06), vol. 2, Kowloon Tong, Hong Kong, August 2006, pp. 1266–1269.
Wang, 2007, Learning and matching of dynamic shape manifolds for human action recognition, IEEE Transactions On Image Processing (TIP), 16, 1646, 10.1109/TIP.2007.896661
Liang Wang, David Suter, Recognizing human activities from silhouettes: motion subspace and factorial discriminative graphical model, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Wang, 2008, Visual learning and recognition of sequential data manifolds with applications to human movement analysis, Computer Vision and Image Understanding (CVIU), 110, 153, 10.1016/j.cviu.2007.06.001
Yang Wang, Hao Jiang, Mark S. Drew, Ze-Nian Li, Greg Mori, Unsupervised discovery of action classes, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’06), vol. 2, New York, NY, June 2006, pp. 1654–1661.
Yang Wang, Greg Mori, Learning a discriminative hidden part model for human action recognition, in: Advances in Neural Information Processing Systems (NIPS), vol. 21, Vancouver, Canada, December 2008, pp. 1721–1728.
Wang, 2009, Human action recognition by semilatent topic models, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 31, 1762, 10.1109/TPAMI.2009.43
Yang Wang, Greg Mori, Max-margin hidden conditional random fields for human action recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Ying Wang, Kaiqi Huang, Tieniu Tan, Human activity recognition based on R transform, in: Proceedings of the Workshop on Visual Surveillance (VS) at the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Daniel Weinland, Edmond Boyer, Action recognition using exemplar-based embedding, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–7.
Daniel Weinland, Edmond Boyer, Remi Ronfard, Action recognition from arbitrary views using 3D exemplars, in: Proceedings of the International Conference On Computer Vision (ICCV’07), Rio de Janeiro, Brazil, October 2007, pp. 1–8.
Daniel Weinland, Remi Ronfard, Edmond Boyer, Automatic discovery of action taxonomies from multiple views, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’06), vol. 2, New York, NY, June 2006, pp. 1639–1645.
Weinland, 2006, Free viewpoint action recognition using motion history volumes, Computer Vision and Image Understanding (CVIU), 104, 249, 10.1016/j.cviu.2006.07.013
Geert Willems, Tinne Tuytelaars, Luc J. Van Gool, An efficient dense and scale-invariant spatio-temporal interest point detector, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 2, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 650–663 (Number 5303).
Shu-Fai Wong, Roberto Cipolla, Extracting spatiotemporal interest points using global information, in: Proceedings of the International Conference On Computer Vision (ICCV’07), Rio de Janeiro, Brazil, October 2007, pp. 1–8.
Shu-Fai Wong, Tae-Kyun Kim, Roberto Cipolla, Learning motion categories using both semantic and structural information, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’07), Minneapolis, MN, June 2007, pp. 1–8.
Junji Yamato, Jun Ohya, Kenichiro Ishii, Recognizing human action in time-sequential images using hidden Markov model, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’92), Champaign, IL, June 1992, pp. 379–385.
Pingkun Yan, Saad M. Khan, Mubarak Shah, Learning 4D action feature models for arbitrary view action recognition, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’08), Anchorage, AK, June 2008, pp. 1–7.
Changjiang Yang, Yanlin Guo, Harpreet S. Sawhney, Rakesh Kumar, Learning actions using robust string kernels, in: Human Motion: Understanding, Modeling, Capture and Animation (HUMO’07), Lecture Notes in Computer Science, Rio de Janeiro, Brazil, October 2007, pp. 313–327 (Number 4814).
Benjamin Yao, Song-Chun Zhu, Learning deformable action templates from cluttered videos, in: Proceedings of the International Conference On Computer Vision (ICCV’09), Kyoto, Japan, September 2009, pp. 1–8.
Yilmaz, 2006, Matching actions in presence of camera motion, Computer Vision and Image Understanding (CVIU), 104, 221, 10.1016/j.cviu.2006.07.012
Yilmaz, 2008, A differential geometric approach to representing the human actions, Computer Vision and Image Understanding (CVIU), 119, 335, 10.1016/j.cviu.2007.09.006
Junsong Yuan, Zicheng Liu, Ying Wu, Discriminative subvolume search for efficient action detection, in: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’09), Miami, FL, June 2009, pp. 1–8.
Zelnik-Manor, 2006, Statistical analysis of dynamic actions, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 28, 1530, 10.1109/TPAMI.2006.194
Zhang, 2010, Action categorization with modified hidden conditional random field, Pattern Recognition, 43, 197, 10.1016/j.patcog.2009.05.015
Ziming Zhang, Yiqun Hu, Syin Chan, Liang-Tien Chia, Motion context: a new representation for human action recognition, in: Proceedings of the European Conference on Computer Vision (ECCV’08) – part 4, Lecture Notes in Computer Science, Marseille, France, October 2008, pp. 817–829 (Number 5305).
Zhipeng Zhao, Ahmed Elgammal, Human activity recognition from frame’s spatiotemporal representation, in: Proceedings of the International Conference on Pattern Recognition (ICPR’08), Tampa, FL, December 2008, pp. 1–4.