Change search
ReferencesLink to record
Permanent link

Direct link
iVector Based Language Recognition
Norwegian University of Science and Technology, Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Electronics and Telecommunications.
2012 (English)MasteroppgaveStudent thesis
Abstract [en]

The focus of this thesis is an fairly new approach to phonotactic language recognition, i.e. identifying a language from the sounds in an spoken utterance, known as iVector subspace modeling. The goal of the iVector is to compactly represent the discriminative information in a utterance so that further processing of the utterance is less computationally intensive. This might enable the system to be trained with more data, and thereby reach an higher performance. We present both the theory behind iVectors and experiments to better fit the iVector space to our development data. The final system got comparable result to our baseline PRLM system on the NIST LRE03 30 second evaluation set.

Place, publisher, year, edition, pages
Institutt for elektronikk og telekommunikasjon , 2012. , 83 p.
Keyword [no]
ntnudaim:8174, MTKOM kommunikasjonsteknologi, Lyd- og bildebehandling
URN: urn:nbn:no:ntnu:diva-19079Local ID: ntnudaim:8174OAI: diva2:566457
Available from: 2012-11-08 Created: 2012-11-08

Open Access in DiVA

fulltext(840 kB)1497 downloads
File information
File name FULLTEXT01.pdfFile size 840 kBChecksum SHA-512
Type fulltextMimetype application/pdf
cover(184 kB)39 downloads
File information
File name COVER01.pdfFile size 184 kBChecksum SHA-512
Type coverMimetype application/pdf

By organisation
Department of Electronics and Telecommunications

Search outside of DiVA

GoogleGoogle Scholar
Total: 1497 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 117 hits
ReferencesLink to record
Permanent link

Direct link