Stabilized active camera tracking system
Tóm tắt
We present a robust and real-time stabilized active camera tracking system (ACTS), which consists of three algorithmic modules: visual tracking, pan-tilt control, and digital video stabilization. We propose an efficient correlation-based framework for visual tracking module that is designed to handle the problems which severely deteriorate the performance of a traditional tracker. The problems that it handles are template drift, noise, object fading (obscuration), background clutter, intermittent occlusion, varying illumination in the scene, high computational complexity, and varying shape, scale, and velocity of the manoeuvring object during its motion. The pan-tilt control module is a predictive open-loop car-following control strategy, which moves the camera efficiently and smoothly so that the target being tracked is always at the center of the video frame. Video stabilization module is required to eliminate the vibration in the video, when the system is mounted on a vibratory platform such as truck, helicopter, ship, etc. We present a very efficient video stabilization method that adds no extra computational overhead to the overall system. It exploits the coordinates of the target, computed by the tracker module, to sense the amount of vibration and then filters it out of the video. The proposed system works at full frame rate (30 fps), and has been successfully used in uncontrolled real-world environment. Experimental results show the efficiency, precision, and robustness of the proposed stabilized ACTS.
Tài liệu tham khảo
Kumar, P., Dick, A., Sheng T.S.: Real Time Target Tracking with Pan Tilt Zoom Camera. In: Proceedings of IEEE Conference on Digital Image Computing: Techniques and Applications (DICTA), December 2009
Dinh, T., Yu, Q., Medioni, G.: Real Time Tracking using an Active Pan-Tilt-Zoom Network Camera. In: IEEE International Conference on Intelligent Robots and Systems, October 2009
Cui, Y., Samarasekera, S., Huang, Q., Greienhagen M: Indoor Monitoring Via the Collaboration Between a Peripheral Sensor and a Foveal Sensor. In: IEEE Workshop on Visual Surveillance, pp. 2–9 (1998)
Bradski, G.R.: Computer Vision Face Tracking as a Component of a Perceptual User Interface. In: Proceedings of IEEE Workshop on Applications of Computer Vision, Princeton, pp. 214–219 (1998)
Wren, C., Azarbayejani, A., Darrell, T., Pentland, A.: PFinder: real-time tracking of the human body. IEEE Trans. Pattern Anal. Mach. Intell. 19, 780–785 (1997)
Intille, S.S., Davis, J.W., Bobick, A.F.: Real-Time Closed-World Tracking. IEEE Conference on Computer Vision and Pattern Recognition, pp. 697–703 (1997)
Ahmed, J., Shah, M., Miller, A., Harper, D., Jafri, M.N.:A Vision Based System for a UGV to Handle a Road Intersection. In: Proceedings of AAAI-07, 22nd Conference on Artificial Intelligence, 22–26 July 2007
Eleftheriadis, A., Jacquin, A.: Automatic face location detection and tracking for model-assisted coding of video teleconference sequences at low bit rates. Signal Process. Image Commun. 7(3), 231–248 (1995)
Rosales, R., Sclaro, S.: 3D Trajectory Recovery for Tracking Multiple Objects and Trajectory Guided Recognition of Actions. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Fort Collins, vol. 2, pp. 117–123 (1999)
Doulamis, A., Doulamis, N., Ntalianis, K., Kollias, S.: An Efficient Fully Unsupervised Video Object Segmentation Scheme Using an Adaptive Neural-Network Classifier Archtecture. In: Proceedings of IEEE Transactions on Neural Networks, May 2003
Cuevas, E.V., Zaldivar, D., Rojas, R.: Intelligent Tracking. Technical Report B-03-15, Freie Universitat Berlin, Germany, 10 November 2003
Ahmed, J., Jafri, M.N., Ahmad, J., Khan, M.I.: Design and Implementation of a Neural Network for Real-Time Object Tracking. In: Proceedings of Machine Vision and Pattern Recognition Conference in 4th World Enformatika Conference, 2005
Ahmed, J., Jafri, M.N., Ahmad, J.: Target Tracking in an Image Sequence Using Wavelet Features and a Neural Network. In: Proceedings of IEEE Region 10: Tencon’05 Conference, 2005
Stauffer, C., Grimson, W.: Learning patterns of activity using real time tracking. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 747–767 (2000)
Wang, H., Suter, D., Schindler, K.: Effective Appearance Model and Similarity Measure for Particle Filtering and Visual Tracking. In: Proceedings of European Conference on Computer Vision, 2006
Perez, P., et al.: Color-Based Probabilistic Tracking. In: Proceedings of European Conference on Computer Vision, pp. 661–675 (2002)
Yilmaz, A., Li, X., Shah, M.: Contour-based object tracking with occlusion handling in video acquired using mobile cameras. IEEE Trans. Pattern Anal. Mach. Intell. 26(11), 1531 (2004)
Kass, M., Witkin, A., Terzopoulos, D.: Snakes: active contour models. Int. J. Comput. Vis. 1, 321–331 (1988)
Bleser, G., Becker, M., Stricker, D.: Real-time vision-based tracking and reconstruction. J. Real-Time Image Proc. 2, 165–171 (2007). doi:10.1007/s11554-007-0034-0
Peddigari, V., Kehtarnavaz, N.: Real-time predictive zoom tracking for digital still cameras. J. Real-Time Image Process. 2, 45–54 (2007). doi:10.1007/s11554-007-0036-y
Comaniciu, D., Visvanathan, R., Meer, P.: Kernel based object tracking. IEEE Trans. Pattern Anal. Mach. Intell. 25(5), 564–575 (2003)
Comaniciu, D., Ramesh, V., Meer, P.: Real-time Tracking of Non-Rigid Objects Using Mean Shift. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, vol. 1, pp. 142–149 (2000)
Porikli, F., Tuzel, O., Meer, P.: Covariance Tracking using Model Update Based on Lie Algebra. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2006
Porikli F., Tuzel, O.:Multi-kernel Object Tracking. In: Proceedings of IEEE International Conference on Multimedia and Expo, 2005
Adam, A., Rivlin, E., Shimshoni, I.: Robust Fragments-based Tracking Using the Integral Histogram. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2006
Porikli, F.: Integral Histogram: A Fast Way to Extract Histograms in Cartesian Spaces. IEEE Conference on Computer Vision and Pattern Recognition, 2005
Nixon, M., Aguado, A.: Feature Extraction and Image Processing. Newnes, Oxford (2002)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 2nd Edn. Prentice-Hall Inc., USA (2002)
Ritter, G.X., Wilson, J.N.: Handbook of Computer Vision Algorithms in Image Algebra. CRC, Boca Raton (1996)
Kuglin, C., Hines, D.: The Phase Correlation Image Alignment Method. In: Proceedings on International Conference on Cybernetics and Society, pp. 163–165 (1975)
Wong, S.: Advanced Correlation Tracking of Objects in Cluttered Imagery. In: Proceedings of SPIE, vol. 5810 (2005)
Lewis, J.P.: Fast Normalized Cross-Correlation. Industrial Light and Magic, 1995
Ahmed, J., Jafri, M.N., Shah, M., Akbar, M.: Real-time edge-enhanced dynamic correlation and predictive open-loop car following control for robust tracking. Mach. Vis. Appl. J. 19(1), 1–25 (2008)
Kuo, B.C.: Automatic Control Systems, 7th Edn. Wiley, USA (1995)
Ross, T.J.: Fuzzy Logic with Engineering Applications, 2nd edn. Wiley, England (2004)
Nguyen, H.T., Prasad, N.R., Walker, C.L., Walker, E.A.: A First Course in Fuzzy and Neural Control. Chapman & Hall/CRC, Boca Raton (2003)
Mir-Nasiri, N.: Camera-based 3D Tracking. In: Proceedings of IEEE Region 10: Tencon’05 Conference, Melbourne, Australia, 2005
Basic Control Law for PTU to Follow a Moving Target. Application Note 01, Directed Perception Inc., 1996
E. Vermeulen, “Real-Time Video Stabilization for Moving Platforms,” 21st Aristotle UAV Systems Conference, April 2007
Tico, M., Vehvilainen, M.: Robust Method of Video Stabilization, In: EUSIPCO-07: European Signal and Image Processing Conference (2007)
Hu, R., Shi, R., Shen, I.f., Chen, W.: Video Stabilization using Scale Invariant Features. IV’07: In: Proceedings of IEEE 11th International Conference on Information Visualization (2007)
Inc., C., “Canon FAQ: What is vari-angle prism?” available at: http://www.canon.com/bctv/faq/vari.html (2011). Accessed 18 June, 2011
Battiato, S., Gallo, G., Puglisi, G.: Fuzzy-Based Motion Estimation for Video Stabilization Using SIFT Interest Points. In: Proceedings of SPIE, vol. 7250, 72500T; doi: 10.1117/12.805660 (2009)
Chang, H.-C., Lai, S.-H., Lu, K.-R.: A Robust and Efficient Video Stabilization Algorithm. ICME-’04: In: Proceedings of IEEE International Conference on Multimedia and Expo, June 2004
“Optical Flow”, available at: http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/OWENS/LECT12/node4.html (2011). Accessed 18 June, 2011
Buehler, C. Bosse, M., McMillian, L.: Non-metric Image Based Rendering for Video Stabilization. In: Proceedings of CVPR-01, IEEE Conference on Computer Vision and Pattern Recognition (2001)
Auberger, S., Miro, C.: Digital Video Stabilization Architecture for Low Cost Devices. In: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis p. 474 (2005)
Jang, S.-W., Pomplun, M., Kim, G.-Y., Choi, H.-I.: Adaptive Robust Estimation of Affine Parameters from Block Motion Vectors. In: Image and Vision Computing, pp. 1250–1263, August 2005
Vella, F., Castorina, A., Mancuso, M., Messina, G.: Digital image stabilization by adaptive block motion vectors filtering. IEEE Trans. Consum. Electron. 48, 796–801 (2002)
Blackman, S., Popoli, R.: Design and Analysis of Modern Tracking Systems., pp. 309–313. Artech House, Boston (1999)
Fagiani C., Gips, J.: An Evaluation of Tracking Methods for Human-Computer Interaction, Senior Thesis, Computer Science Department, Boston College, Fulton Hall, Chestnut Hill, MA 02467 (2002)
Brookner, E.: Tracking and Kalman Filtering Made Easy. Wiley, USA (1998)
BMP Image Sequences for Elliptical Head Tracking: http://vision.stanford.edu/~birch/headtracker/seq/ (2011). Accessed 18 June, 2011
Nummiaroa, K., Koller-Meierb, E., Gool, L.V.: An adaptive color-based particle filter. Image Vis. Comput. 21, 99–110 (2003)
Isard, M., Blake, A.: CONDENSATION-conditional density propagation for visual tracking. Int. J. Comput. Vis. 29(1), 5–28 (1998)
Bradski, G., Kaehler, A., Pisarevsky, V.: Learning-based computer vision with open source computer vision library. Intel. Technol. J. 9(2) (2005)
Crow, F.: Summed-area tables for texture mapping. Comput. Graph. 18(3), 207–212 (1984)
Ahmed, J., Jafri, M.N.: Best-Match Rectangle Adjustment Algorithm for Persistent and Precise Correlation Tracking. In: Proceedings of IEEE International Conference on Machine Vision, 28–29 December 2007
Available at: http://www.fastpasses.com (2005). Accessed 19 July, 2005
Lathoud, G., Odobez, J., Gatica-Perez, D.: AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking. LNCS 3361, pp. 182–195, Springer-Verlag, Berlin (2005)
PETS 2001 dataset, publicly available at the website: http://www.research.ibm.com/peoplevision/performanceevaluation.html (2006). Accessed 11 November, 2006
CAVIAR (Context Aware Vision using Image-based Active Recognition) test video clips 2003-2004, available at http://homepages.inf.ed.ac.uk/rbf/CAVIAR (2003). Accessed 23 December, 2003