Change search
ReferencesLink to record
Permanent link

Direct link
Speaker Separation Investigation
Blekinge Institute of Technology, School of Engineering, Department of Telecommunication Systems.
2007 (English)Independent thesis Advanced level (degree of Master (One Year))Student thesisAlternative title
Högtalareavskiljandeutredning (Swedish)
Abstract [en]

This report describes two important investigations which formed part of an overall project aimed at separating overlapping speech signals. The first investigation uses chirp signals to measure the acoustic transfer functions which would typically be found in the speaker separation project. It explains the behaviour of chirps in acoustic environments that can be further used to find the room reverberations as well, besides their relevance to measuring the transfer functions in conjunction with speaker separation. Chirps that have been used in this part are logarithmic and linear chirps. They have different lengths and are analysed in two different acoustic environments. Major findings are obtained in comparative analysis of different chirps in terms of their cross-correlations, specgrams and power spectrum magnitude. The second investigation deals with using automatic speech recognition (ASR) system to test the performance of the speaker separation algorithm with respect to word accuracy of different speakers. Speakers were speaking in two different scenarios and these were nonoverlapping and overlapping scenarios. In non-overlapping scenario speakers were speaking alone and in overlapping scenario two speakers were speaking simultaneously. To improve the performance of speaker separation in the overlapping scenario, I was working very close with my fellow colleague Mr. Holfeld who was improving the existing speech separation algorithm. After cross-examining our findings, we improved the existing speech separation algorithm. This further led to improvement in word accuracy of the speech recognition software in overlapping scenario.

Place, publisher, year, edition, pages
2007. , 121 p.
Keyword [en]
Speaker Separation, Acoustics, Cross-correlation, Automatic Speech Recognition, Room Impulse Response, Maximal Length Sequences, Chirps
National Category
Signal Processing Telecommunications
URN: urn:nbn:se:bth-5794Local ID: diva2:833197
Email Contact: Mobile: +46708290539Available from: 2015-04-22 Created: 2007-08-20 Last updated: 2015-06-30Bibliographically approved

Open Access in DiVA

fulltext(1001 kB)44 downloads
File information
File name FULLTEXT01.pdfFile size 1001 kBChecksum SHA-512
Type fulltextMimetype application/pdf

By organisation
Department of Telecommunication Systems
Signal ProcessingTelecommunications

Search outside of DiVA

GoogleGoogle Scholar
Total: 44 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 53 hits
ReferencesLink to record
Permanent link

Direct link