Home
Scholarly Works
A frequency domain method for blind source...
Journal article

A frequency domain method for blind source separation of convolutive audio mixtures

Abstract

In this paper, we propose a new frequency domain approach to blind source separation (BSS) of audio signals mixed in a reverberant environment. We propose a joint diagonalization procedure on the cross power spectral density matrices of the signals at the output of the mixing system to identify the mixing system at each frequency bin up to a scale and permutation ambiguity. The frequency domain joint diagonalization is performed using a new and quickly converging algorithm which uses an alternating least-squares (ALS) optimization method. The inverse of the mixing system is then used to separate the sources. An efficient dyadic algorithm to resolve the frequency dependent permutation ambiguities that exploits the inherent nonstationarity of the sources is presented. The effect of the unknown scaling ambiguities is partially resolved using an initialization procedure for the ALS algorithm. The performance of the proposed algorithm is demonstrated by experiments conducted in real reverberant rooms. Performance comparisons are made with previous methods.

Authors

Rahbar K; Reilly JP

Journal

IEEE Transactions on Audio Speech and Language Processing, Vol. 13, No. 5, pp. 832–844

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publication Date

September 1, 2005

DOI

10.1109/tsa.2005.851925

ISSN

2329-9290

Contact the Experts team