Adaptive classification-based articulation and tracking of video objects employing neural network retraining
2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628) - Tập 2 - Trang 575-578 vol.2
Tóm tắt
An adaptive neural network architecture is proposed for efficient video object segmentation and tracking of stereoscopic video sequences. The scheme includes (a) a retraining algorithm for adapting network weights to current conditions; (b) a semantically meaningful object extraction module for creating a retraining set; (c) a decision mechanism, which detects the time instances of a new network retraining. The retraining algorithm optimally adapts network weights by exploiting information of the current conditions and simultaneously minimally degrading the obtained network knowledge. The algorithm results in the minimization of a convex function subject to linear constraints, thus, one minimum exists. Furthermore, a decision mechanism is included to detect the time instances that a new network retraining is required. A description of the current conditions is provided by a segmentation fusion algorithm, which appropriately combines color and depth information.
Từ khóa
#Neural networks #Video sequences #MPEG 4 Standard #Standards development #Layout #Adaptive systems #Electronic mail #Computer architecture #Object segmentation #Data miningTài liệu tham khảo
kim, 1999, A VOP Generation Tool: Automatic Segmentation of Moving Objects in Image Sequences Based on Spatio-Temporal Information, IEEE Transactions on Cicuits and Systems for Video Technology, 9, 1216, 10.1109/76.809157
10.1109/76.718503
gu, 1998, Semiautomatic Segmentation and Tracking of Semantic Video Objects, IEEE Trans Cicuits Syst Video Techol, 8, 572, 10.1109/76.718504
10.1109/76.718505
10.1109/72.822517
10.1109/76.844996
yco, 1995, Rapid Scene Analysis on Compressed Videos, IEEE Trans Circuits and Systems for Video Technology, 5, 533, 10.1109/76.475896
10.1109/TSMC.1981.4308619
10.1016/1047-3203(90)90014-M
wang, 1994, Representing Moving Images with Layers, IEEE Trans Image Processing, 3, 625, 10.1109/83.334981
10.1049/ip-f-1.1986.0025
10.1109/76.718501
avid, 1985, Determining Three-dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects, IEEE Trans Pattern Anal Machine Intel, pami 7, 384, 10.1109/TPAMI.1985.4767678
10.1109/76.736718
10.1109/76.554415
10.1109/76.809155
