Adaptive classification-based articulation and tracking of video objects employing neural network retraining

N.D. Doulamis1, A.D. Doulamis1, K. Ntalianis1
1Department of Electrical and Computer Engineering, National and Technical University of Athens, Zografou, Greece

Tóm tắt

An adaptive neural network architecture is proposed for efficient video object segmentation and tracking of stereoscopic video sequences. The scheme includes (a) a retraining algorithm for adapting network weights to current conditions; (b) a semantically meaningful object extraction module for creating a retraining set; (c) a decision mechanism, which detects the time instances of a new network retraining. The retraining algorithm optimally adapts network weights by exploiting information of the current conditions and simultaneously minimally degrading the obtained network knowledge. The algorithm results in the minimization of a convex function subject to linear constraints, thus, one minimum exists. Furthermore, a decision mechanism is included to detect the time instances that a new network retraining is required. A description of the current conditions is provided by a segmentation fusion algorithm, which appropriately combines color and depth information.

Từ khóa

#Neural networks #Video sequences #MPEG 4 Standard #Standards development #Layout #Adaptive systems #Electronic mail #Computer architecture #Object segmentation #Data mining

Tài liệu tham khảo

kim, 1999, A VOP Generation Tool: Automatic Segmentation of Moving Objects in Image Sequences Based on Spatio-Temporal Information, IEEE Transactions on Cicuits and Systems for Video Technology, 9, 1216, 10.1109/76.809157 10.1109/76.718503 gu, 1998, Semiautomatic Segmentation and Tracking of Semantic Video Objects, IEEE Trans Cicuits Syst Video Techol, 8, 572, 10.1109/76.718504 10.1109/76.718505 10.1109/72.822517 10.1109/76.844996 yco, 1995, Rapid Scene Analysis on Compressed Videos, IEEE Trans Circuits and Systems for Video Technology, 5, 533, 10.1109/76.475896 10.1109/TSMC.1981.4308619 10.1016/1047-3203(90)90014-M wang, 1994, Representing Moving Images with Layers, IEEE Trans Image Processing, 3, 625, 10.1109/83.334981 10.1049/ip-f-1.1986.0025 10.1109/76.718501 avid, 1985, Determining Three-dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects, IEEE Trans Pattern Anal Machine Intel, pami 7, 384, 10.1109/TPAMI.1985.4767678 10.1109/76.736718 10.1109/76.554415 10.1109/76.809155