Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Parametric Coding for Spatial Audio
KTH, School of Engineering Sciences (SCI), Aeronautical and Vehicle Engineering, Marcus Wallenberg Laboratory MWL.
2016 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

This thesis presents a stereo coding technique used as an extension for the Enhanced Voice Services (EVS) codec [10] [8]. EVS is an audio codec recently standardized by the 3rd Generation Partnership Project (3GPP) for compressing mono signals at chosen rates from 7.2 to 128 kbit/s (for xed bit rate) and around 5.9 kbit/s (for variable bit rate). The main goal of the thesis is to present the architecture of a parametric stereo codec and how the stereo extension of EVS may be built. Parametric stereo coding relies on the transmission of a downmixed signal, sum of left and right channels, and the necessary audible cues to synthesize back the stereo image from it at the decoding end. The codec has been implemented in MATLAB with use of the existing EVS codec. An important part of the thesis is dedicated to the description of the implementation of a robust downmixing technique. The remaining parts present the parametric coding architecture that has been adapted and used to develop the EVS stereo extension at 24.4 and 32 kbit/s and other open researches that have been conducted for more specic situations such as spatial coding for stereo or binaural applications. Whereas the downmixing algorithm quality has been confronted to subjective testing and proven to be more ecient than any other existing techniques, the stereo extension has been tested less extensively. Yet the quality reached with the proposed reconstruction algorithms tends to highlight the potential of the codec that could be revealed by future work.

Place, publisher, year, edition, pages
2016. , p. 70
Series
TRITA-AVE, ISSN 1651-7660 ; 2016:17
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:kth:diva-198520OAI: oai:DiVA.org:kth-198520DiVA, id: diva2:1057217
External cooperation
Orange
Supervisors
Examiners
Available from: 2016-12-16 Created: 2016-12-16 Last updated: 2016-12-16Bibliographically approved

Open Access in DiVA

fulltext(6226 kB)205 downloads
File information
File name FULLTEXT01.pdfFile size 6226 kBChecksum SHA-512
3792d730433fc4d4e3bd2e09aa9cd6a0f3748b7fb1b6a103d0be1318518e0c0dd58fc6d6c6dd22c70da980dc747319df5643d63f5330a3060ade61db0ad7e81b
Type fulltextMimetype application/pdf

By organisation
Marcus Wallenberg Laboratory MWL
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 205 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 119 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf