A review on multimodal video indexing

C.G.M. Snoek1, M. Worring1
1Intelligent Sensory Information Systems, University of Amsterdam, Amsterdam, Netherlands

Tóm tắt

Efficient and effective handling of video documents depends on the availability of indexes. Manual indexing is unfeasible for large video collections. Efficient, single modality based, video indexing methods have appeared in literature. Effective indexing, however, requires a multimodal approach in which either the most appropriate modality is selected or the different modalities are used in collaborative fashion. We present a framework for multimodal video indexing, which views a video document from the perspective of its author. The framework serves as a blueprint for a generic and flexible multimodal video indexing system, and generalizes different state-of-the-art video indexing methods. It furthermore forms the basis for categorizing these different methods.

Từ khóa

#Indexing #Video sharing #Intelligent systems #Intelligent sensors #Information systems #Document handling #Collaboration #Software libraries #Digital filters #Filtering

Tài liệu tham khảo

snoek, 2001, Multimodal Video Indexing A Review of the State-of-the-Art 10.1023/A:1011315803415 10.1109/93.752960 10.1109/6046.909601 10.1145/169059.169143 boggs, 2000, The Art of Watchingfilms 10.1109/6046.985555 10.1023/A:1011395131992 manning, 1999, Foundations of Statistical Natural Language Processing 10.1109/5254.796090 10.1109/MMSP.1999.793797 10.1145/217279.215283 10.1006/jvci.1997.0404 li, 2000, Automatic text detection and tracking in digital video, IEEE Trans on Imge Processing, 9, 147, 10.1109/83.817607 10.1016/S0167-8655(00)00119-7