Dynamic stills and clip trailers
Tóm tắt
We propose a method for generating visual summaries of video. It reduces browsing time, minimizes screen-space utilization, while preserving the crux of the video content and the sensation of motion. The outputs are images or short clips, denoted as dynamic stills or clip trailers, respectively. The method selects informative poses out of extracted video objects. Optimal rotations and transparency supports visualization of an increased number of poses, leading to concise activity visualization. Our method addresses previously avoided scenarios, e.g., activities occurring in one place, or scenes with non-static background. We demonstrate and evaluate the method for various types of videos.
Tài liệu tham khảo
Abdi, H., Valentin, D., O’Toole, A.J., Edelman, B.: Distatis: The analysis of multiple distance matrices. In: Empirical Evaluation Methods in Computer Vision Workshop, pp. 42–47. San Diego, CA, USA (2005)
Agarwala, A., Dontcheva, M., Agrawala, M., Drucker, S., Colburn, A., Curless, B., Salesin, D., Cohen, M.: Interactive digital photomontage. ACM Trans. Graph. (SIGGRAPH) 23(3), 294–302 (2004)
Agarwala, A., Zheng, C., Pal, C., Agrawala, M., Cohen, M., Curless, B., Salesin, D., Szeliski, R.: Panoramic video textures. ACM Trans. Graph. (SIGGRAPH) 24(3), 821–827 (2005)
Assa, J., Caspi, Y., Cohen-Or, D.: Action synopsis: Pose selection and illustration. ACM Trans. Graph. (SIGGRAPH) 24(3), 667–676 (2005)
Axelrod, A.: Video previewing using pose slices. MSc thesis, Dept. of Computer Science Tel Aviv University (2006)
Axelrod, A., Caspi, Y., Gamliel, A., Matsushita, Y.: Interactive video exploration using pose slices. In: SIGGRAPH Sketches. Boston, MA, USA (2006)
Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)
BenAbdelkader, C., Cutler, R., Davis, L.: Gait recognition using image self-similarity. EURASIP J. Appl. Signal Process. 15(4), 572–585 (2004)
Cassinelli, A., Ito, T., Ishikawa, M.: Khronos projector. In: SIGGRAPH Emerging Technologies. Los Angeles, CA, USA (2005)
Chiu, P., Girgensohn, A., Liu, Q.: Stained-glass visualization for highly condensed video summaries. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), pp. 2059–2062. IEEE, Taipei, Taiwan (2004)
CMU: Motion capture database, cmu graphics lab. http://mocap.cs.cmu.edu/ (2002)
CNN: Cable news network. http://www.cnn.com/video/ (2004)
Criminisi, A., Cross, G., Blake, A., Kolmogorov, V.: Bilayer segmentation of live video. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 53–61. New York, NY, USA (2006)
Google: Google video sharing service. http://video.google.com/ (2005)
Irani, M., Anandan, P., Hsu, S.: Mosaic based representations of video sequences and their applications. In: International Conference on Computer Vision, pp. 605–611. Washington, DC, USA (1995)
Kruskal, J.: Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika 29, 1–27 (1966)
Li, Y., Sun, J., Shum, H.Y.: Video object cut and paste. ACM Trans. Graph. (SIGGRAPH) 24(3), 595–600 (2005)
Liu, F., Zhuang, Y., Wu, F., Pan, Y.: 3D motion retrieval with motion index tree. Comput. Vis. Image Underst. 92(2–3), 265–284 (2003)
Loy, G., Sullivan, J., Carlsson, S.: Pose-based clustering in action sequences. In: Workshop on Higher-Level Knowledge in 3D Modeling & Motion Analysis, pp. 66–72. Nice, France (2003)
Massey, M., Bender, W.: Salient stills: Process and practice. IBM Syst. J. 35(3/4), 557–573 (1996)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856. MIT Press, Cambridge, MA (2001)
Rav-Acha, A., Pritch, Y., Lischinski, D., Peleg, S.: Dynamosaicing: Video mosaics with non-chronological time. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 58–65. San Diego, CA, USA (2005)
Roweis, S., Saul, L.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000)
Sun, J., Jia, J., Tang, C.K., Shum, H.Y.: Poisson matting. ACM Trans. Graph. (SIGGRAPH) 23(3), 315–321 (2004)
Sun, J., Zhang, W., Tang, X., Shum, H.Y.: Background cut. In: European Conference on Computer Vision, pp. 628–641. Graz, Austria (2006)
Szeliski, R., Shum, H.Y.: Creating full view panoramic image mosaics and environment maps. In: SIGGRAPH ’97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques, pp. 251–258. New York, NY, USA (1997)
Taniguchi, Y., Akutsu, A., Tonomura, Y.: Panoramaexcerpts: extracting and packing panoramas for video browsing. In: MULTIMEDIA: Proceedings of the fifth ACM international conference on Multimedia, pp. 427–436. ACM Press, New York, NY, USA (1997)
Wang, J., Bhat, P., Colburn, R.A., Agrawala, M., Cohen, M.F.: Interactive video cutout. ACM Trans. Graph. (SIGGRAPH) 24(3), 585–594 (2005)