Direction of Arrival Estimation for Speech Sources using Fourth Order Cross Cumulants
Blekinge Institute of Technology, School of Engineering, Department of Signal Processing2008 (English)Conference paper (Refereed) Published
In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known Steered Response Power with Phase Transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a Probability Density Function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The proposed method is evaluated and compared to the original SRP-PHAT algorithm and shows significant improvements in localization performance for speech sources.
Place, publisher, year, edition, pages
Seattle: IEEE , 2008.
Localization, Delay estimation, Higher order statistics
IdentifiersURN: urn:nbn:se:bth-8499ISI: 000258532101155Local ID: oai:bth.se:forskinfo4F23DB790184F372C12574A40059C144OAI: oai:DiVA.org:bth-8499DiVA: diva2:836225
International Symposium on Circuits and Systems