Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming
Tóm tắt
We propose a time-domain blind source separation (BSS) algorithm that utilizes geometric information such as sensor positions and assumed locations of sources. The algorithm tackles the problem of convolved mixtures by explicitly exploiting the non-stationarity of the acoustic sources. The learning rule is based on second-order statistics and is derived by natural gradient minimization. The proposed initialization of the algorithm is based on the null beamforming principle. This method leads to improved separation performance, and the algorithm is able to estimate long unmixing FIR filters in the time domain due to the geometric initialization. We also propose a post-filtering method for dewhitening which is based on the scaling technique in frequency-domain BSS. The validity of the proposed method is shown by computer simulations. Our experimental results confirm that the algorithm is capable of separating real-world speech mixtures and can be applied to short learning data sets down to a few seconds. Our results also confirm that the proposed dewhitening post-filtering method maintains the spectral content of the original speech in the separated output.
Từ khóa
#Blind source separation #Source separation #Finite impulse response filter #Speech #Frequency domain analysis #Array signal processing #Acoustic sensors #Laboratories #Time domain analysis #StatisticsTài liệu tham khảo
lee, 1998, Independent Component Analysis
nishikawa, 2002, Blind source separation based on multistage ICA using frequency-domain ICA and time-domain ICA, Proc of ICF
10.1109/NNSP.2001.943132
sun, 2001, A natural gradient convolutive Blind Source Separation algorithm for speech mixtures, Proc ICASSP 01
10.1109/5.720250
10.1109/ICASSP.2001.940212
hyvaerinen, 2001, Independent Component Analysis
haykin, 2000, Unsupervised Adaptive Filtering, Volume 1 Blind Source Separation
10.1016/S0925-2312(98)00055-1
ikeda, 1999, A Method of ICA in time-frequency domain, Proc ICA '99, 365
araki, 2001, Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Null Beamformers, Proc Eurospeech'01, 2595
10.1162/089976698300017746
kobayashi, 1992, ASJ continuous speech corpus for research, J Acoust Soc Jpn, 48, 888