Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming

Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing - Trang 445-454

R. Aichner^1,2,3, S. Araki¹, S. Makino¹, T. Nishikawa⁴, H. Saruwatari⁴

¹NTT Communication Science Laboratories, NTT Corporation, Kyoto, Japan

²University of Applied Sciences, Regensburg, Germany

³Telecommunications Laboratory, University of Erlangen, Nuremberg, Germany

⁴Nara Institute of Science and Technology, Ikoma, Japan

Tóm tắt

We propose a time-domain blind source separation (BSS) algorithm that utilizes geometric information such as sensor positions and assumed locations of sources. The algorithm tackles the problem of convolved mixtures by explicitly exploiting the non-stationarity of the acoustic sources. The learning rule is based on second-order statistics and is derived by natural gradient minimization. The proposed initialization of the algorithm is based on the null beamforming principle. This method leads to improved separation performance, and the algorithm is able to estimate long unmixing FIR filters in the time domain due to the geometric initialization. We also propose a post-filtering method for dewhitening which is based on the scaling technique in frequency-domain BSS. The validity of the proposed method is shown by computer simulations. Our experimental results confirm that the algorithm is capable of separating real-world speech mixtures and can be applied to short learning data sets down to a few seconds. Our results also confirm that the proposed dewhitening post-filtering method maintains the spectral content of the original speech in the separated output.

Từ khóa

#Blind source separation #Source separation #Finite impulse response filter #Speech #Frequency domain analysis #Array signal processing #Acoustic sensors #Laboratories #Time domain analysis #Statistics

Tài liệu tham khảo

lee, 1998, Independent Component Analysis nishikawa, 2002, Blind source separation based on multistage ICA using frequency-domain ICA and time-domain ICA, Proc of ICF 10.1109/NNSP.2001.943132 sun, 2001, A natural gradient convolutive Blind Source Separation algorithm for speech mixtures, Proc ICASSP 01 10.1109/5.720250 10.1109/ICASSP.2001.940212 hyvaerinen, 2001, Independent Component Analysis haykin, 2000, Unsupervised Adaptive Filtering, Volume 1 Blind Source Separation 10.1016/S0925-2312(98)00055-1 ikeda, 1999, A Method of ICA in time-frequency domain, Proc ICA '99, 365 araki, 2001, Equivalence between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Null Beamformers, Proc Eurospeech'01, 2595 10.1162/089976698300017746 kobayashi, 1992, ASJ continuous speech corpus for research, J Acoust Soc Jpn, 48, 888

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA