Combining 2D and 3D deep models for action recognition with depth informationSignal, Image and Video Processing - Tập 12 - Trang 1197-1205 - 2018
Ali Seydi Keçeli, Aydın Kaya, Ahmet Burak Can
In activity recognition, usage of depth data is a rapidly growing research area. This paper presents a method for recognizing single-person activities and dyadic interactions by using deep features extracted from both 3D and 2D representations, which are constructed from depth sequences. First, a 3D volume representation is generated by considering spatiotemporal information in depth frames of an ...... hiện toàn bộ
Optimizing nonlinear activation function for convolutional neural networksSignal, Image and Video Processing - Tập 15 - Trang 1323-1330 - 2021
Munender Varshney, Pravendra Singh
Activation functions play a critical role in the training and performance of the deep convolutional neural networks. Currently, the rectified linear unit (ReLU) is the most commonly used activation function for the deep CNNs. ReLU is a piecewise linear function that will output the input directly if it is positive, otherwise, it will output zero. In this work, we propose a novel approach to genera...... hiện toàn bộ
VEDesc: vertex-edge constraint on local learned descriptorsSignal, Image and Video Processing - Tập 17 - Trang 865-872 - 2021
Jianhua Yin, Longzhen Zhu, Yang Bai, Zhenyu He
To improve the performance of local learned descriptors, many researchers pay primary attention to the triplet loss network. As expected, it is useful to achieve state-of-the-art performance on various datasets. However, these local learned descriptors suffer from the inconsistency problem without considering the relationship between two descriptors in a patch. Consequently, the problem causes the...... hiện toàn bộ
PET–MRI image fusion using adaptive filter based on spectral and spatial discrepancySignal, Image and Video Processing - Tập 13 - Trang 135-143 - 2018
Arash Saboori, Javad Birjandtalab
Recently, medical imaging equipment has undergone major developments. They play an important role in healthcare industry since they provide visual interpretation of human organs. Magnetic resonance imaging (MRI) and positron emission tomography (PET) are two well-known technologies which capture the structural and functional characteristics of the body organs, respectively. Fusing such functional ...... hiện toàn bộ
DWT-based joint antenna selection for correlated MIMO channelsSignal, Image and Video Processing - Tập 3 - Trang 35-45 - 2008
Ehab Farouk Badran
This paper proposes a new discrete wavelet transform (DWT)-based joint antenna selection scheme for spatially correlated multiple-input multiple output (MIMO) channels. To reduce the severe performance degradation of the traditional antenna selection schemes in correlated channels, a new scheme which employ joint antenna selection (JAS) at both link ends algorithm and embed DWT operations in the r...... hiện toàn bộ
A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answeringSignal, Image and Video Processing - - Trang 1-12 - 2024
Mingyang Ma, Turdi Tohti, Yi Liang, Zicheng Zuo, Askar Hamdulla
Visual question answering tasks based on the knowledge graph are dedicated to integrating rich information in the knowledge graph to deal with complex questions that cannot be solved by image features alone while focusing on improving the performance of fundamental visual question answering tasks. The core of this task is to achieve effective cross-modal information fusion and resolve the semantic...... hiện toàn bộ
Comments on “The fractional Laplace transform”Signal, Image and Video Processing - Tập 8 - Trang 489-490 - 2012
Manuel D. Ortigueira
In Sharma (SIViP 4:377–379, 2010) a fractional Laplace transform assumed to generalize the fractional Fourier transform was proposed. Here, it is shown that its region of convergence degenerates to the imaginary axis. So it is not a generalization of the fractional Fourier transform.