Change search
ReferencesLink to record
Permanent link

Direct link
Improving the Visual Experience When Coding Computer Generated Content Using the H.264 Standard
Norwegian University of Science and Technology, Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Telematics.
2011 (English)MasteroppgaveStudent thesis
Abstract [en]

The purpose of this Master thesis was to improve the visual experience when coding computer generated content (CGC) using the H.264 standard. As H.264 is designed primarily to code natural video, it exhibits weaknesses when coding CGC at low bit rates. The thesis has focused on identifying and modifying the components in the H.264 algorithm responsible for the occurrence of unwanted noise artifacts. The research method was based on performing quantitative research to confirm or deny the hypothesis claiming that the H.264 algorithm performs sub-optimally when coding CGC. Experiments were conducted using coders written specically for the thesis. The results from these experiments were then analyzed, and conclusions were drawn based on empirical observations. An implementation of H.264 was used to identify the noise artifacts resulting from coding CGC at low rates. The results indicated that H.264 indeed performs sub-optimally when coding CGC. We learned that the reason for this was that the characteristics of CGC led to the signal being more compactly represented in the spatial domain than in the transform domain. We therefore proposed to omit the component transform and quantize the residual signal directly. This method, called residual scalar quantization (RSQ), was shown to outperform traditional H.264 coding for certain CGC in terms of quantified visual quality and bit rate. However, even when outperformed, the RSQ coder did not exhibit any of the noise artifacts present when coding with the traditional coder. We also introduced Rate-Distortion optimization, which allowed the coder to adaptively choose between traditional and RSQ coding, ensuring that each block is coded optimally, independent of the source content. This scheme was shown to outperform both stand-alone coders for all sample content. A quantizer with representation levels tailored specifically for the characteristics of CGC was also presented, and experiments showed that it outperformed uniform quantization when coding CGC. The results in this thesis were produced by simplified versions of the actual coders, and may not be completely accurate. However, the accumulated results indicate that RSQ may indeed outperform traditional H.264 coding for CGC. To confirm the theories that have been presented, the proposed techniques should be implemented in a full-scale implementation of H.264 and the experiments repeated.

Place, publisher, year, edition, pages
Institutt for elektronikk og telekommunikasjon , 2011. , 112 p.
Keyword [no]
ntnudaim:6284, MTKOM kommunikasjonsteknologi, Lyd- og bildebehandling
URN: urn:nbn:no:ntnu:diva-14081Local ID: ntnudaim:6284OAI: diva2:446122
Available from: 2011-10-06 Created: 2011-10-06 Last updated: 2013-06-23Bibliographically approved

Open Access in DiVA

fulltext(2743 kB)488 downloads
File information
File name FULLTEXT01.pdfFile size 2743 kBChecksum SHA-512
Type fulltextMimetype application/pdf
cover(47 kB)28 downloads
File information
File name COVER01.pdfFile size 47 kBChecksum SHA-512
Type coverMimetype application/pdf
attachment(92784 kB)1381 downloads
File information
File name ATTACHMENT01.zipFile size 92784 kBChecksum SHA-512
Type attachmentMimetype application/zip

By organisation
Department of Telematics

Search outside of DiVA

GoogleGoogle Scholar
Total: 488 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 70 hits
ReferencesLink to record
Permanent link

Direct link