Enhancing AMR-WB+ with a Conversational Mode
Independent thesis Advanced level (professional degree), 20 credits / 30 HE creditsStudent thesis
With the arrival of the 2.5G and 3G mobile networks, demands on the quality of mobile services have increased. The AMR-WB+ hybrid speech and generic audio codec standardized by the 3GPP has been shown to perform very well on both speech and generic audio signals, but it does however exhibit a longer algorithmic delay than other low bit rate speech coders. This makes the codec in its current form unsuitable for conversational use.This thesis investigates ways in which the algorithmic delay in the AMR-WB+ codec can be lowered, and specically targets the elimination of large codec frame sizes. The reason for these large frame sizes is due to the variable frame size transform-based coding method present in the internal TCX coder. Two new low delay transform modes are therefore presented, implemented and evaluated in this thesis. These are based on perceptually warped filter banks that aim to rectify the usual shortcomings of shorter transforms. The new encoding modes are finally evaluated in simulations and a small formal listening test is conducted. Listening tests show that the quality of the new encoding modes is noticeably inferior to the original codec on broad spectral signals without any compensation in bit rates.
Place, publisher, year, edition, pages
2012. , 66 p.
Technology, amr-wb+, audio coding, warped dft, codec, low delay, transform coding
IdentifiersURN: urn:nbn:se:ltu:diva-50510Local ID: 7c262c71-7038-4f14-bcf1-8055ac01dfa6OAI: oai:DiVA.org:ltu-50510DiVA: diva2:1023869
Subject / course
Student thesis, at least 30 credits
Media Engineering, master's level
Carlson, JohanSehlstedt, Martin
Validerat; 20120305 (anonymous)2016-10-042016-10-04Bibliographically approved