Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming

R. Aichner1,2,3, S. Araki1, S. Makino1, T. Nishikawa4, H. Saruwatari4
1NTT Communication Science Laboratories, NTT Corporation, Kyoto, Japan
2University of Applied Sciences, Regensburg, Germany
3Telecommunications Laboratory, University of Erlangen, Nuremberg, Germany
4Nara Institute of Science and Technology, Ikoma, Japan

Tóm tắt

We propose a time-domain blind source separation (BSS) algorithm that utilizes geometric information such as sensor positions and assumed locations of sources. The algorithm tackles the problem of convolved mixtures by explicitly exploiting the non-stationarity of the acoustic sources. The learning rule is based on second-order statistics and is derived by natural gradient minimization. The proposed initialization of the algorithm is based on the null beamforming principle. This method leads to improved separation performance, and the algorithm is able to estimate long unmixing FIR filters in the time domain due to the geometric initialization. We also propose a post-filtering method for dewhitening which is based on the scaling technique in frequency-domain BSS. The validity of the proposed method is shown by computer simulations. Our experimental results confirm that the algorithm is capable of separating real-world speech mixtures and can be applied to short learning data sets down to a few seconds. Our results also confirm that the proposed dewhitening post-filtering method maintains the spectral content of the original speech in the separated output.

Từ khóa

#Blind source separation #Source separation #Finite impulse response filter #Speech #Frequency domain analysis #Array signal processing #Acoustic sensors #Laboratories #Time domain analysis #Statistics

Tài liệu tham khảo

lee, 1998, Independent Component Analysis nishikawa, 2002, Blind source separation based on multistage ICA using frequency-domain ICA and time-domain ICA, Proc of ICF 10.1109/NNSP.2001.943132 sun, 2001, A natural gradient convolutive Blind Source Separation algorithm for speech mixtures, Proc ICASSP 01 10.1109/5.720250 10.1109/ICASSP.2001.940212 hyvaerinen, 2001, Independent Component Analysis haykin, 2000, Unsupervised Adaptive Filtering, Volume 1 Blind Source Separation 10.1016/S0925-2312(98)00055-1 ikeda, 1999, A Method of ICA in time-frequency domain, Proc ICA '99, 365 araki, 2001, Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Null Beamformers, Proc Eurospeech'01, 2595 10.1162/089976698300017746 kobayashi, 1992, ASJ continuous speech corpus for research, J Acoust Soc Jpn, 48, 888