Change search
ReferencesLink to record
Permanent link

Direct link
A Scalable Coding Approach for High Quality Depth Image Compression
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Technology and Media. (Realistic 3D)
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Technology and Media. (Realistic 3D)
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Technology and Media. (Realistic 3D)
Mid Sweden University, Faculty of Science, Technology and Media, Department of Information Technology and Media. (Realistic 3D)
2012 (English)In: 3DTV-Conference, IEEE conference proceedings, 2012, Art. no. 6365469- p.Conference paper, Abstract (Refereed)
Abstract [en]

The distortion by using traditional video encoders (e.g. H.264) on the depth discontinuity can introduce disturbing effects on the synthesized view. The proposed scheme aims at preserving the most significantdepth transition for a better view synthesis. Furthermore, it has a scalable structure. The scheme extracts edge contours from a depth image and represents them by chain code. The chain code and the sampleddepth values on each side of the edge contour are encoded by differential and arithmetic coding. The depthimage is reconstructed by diffusion of edge samples and uniform sub-samples from the low quality depthimage. At low bit rates, the proposed scheme outperforms HEVC intra at the edges in the synthesized views, which correspond to the significant discontinuities in the depth image. The overall quality is also better with the proposed scheme at low bit rates for contents with distinct depth transition. © 2012 IEEE.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2012. Art. no. 6365469- p.
Keyword [en]
3DTV; Depth image coding; View synthesis
National Category
Computer Science
URN: urn:nbn:se:miun:diva-17433DOI: 10.1109/3DTV.2012.6365469ScopusID: 2-s2.0-84872086592Local ID: STCISBN: 978-1-4673-4903-1OAI: diva2:572047
3DTV-CONFERENCE 2012 The True Vision Capture, Transmission and Display of 3D Video, ETH Zurich, October 15-17, 2012
Available from: 2013-01-16 Created: 2012-11-26 Last updated: 2016-10-20Bibliographically approved
In thesis
1. Coding of three-dimensional video content: Depth image coding by diffusion
Open this publication in new window or tab >>Coding of three-dimensional video content: Depth image coding by diffusion
2013 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Three-dimensional (3D) movies in theaters have become a massive commercial success during recent years, and it is likely that, with the advancement of display technologies and the production of 3D contents, TV broadcasting in 3D will play an important role in home entertainments in the not too distant future. 3D video contents contain at least two views from different perspectives for the left and the right eye of viewers. The amount of coded information is doubled if these views are encoded separately. Moreover, for multi-view displays (i.e. different perspectives of a scene in 3D are presented to the viewer at the same time through different angles), either video streams of all the required views must be transmitted to the receiver, or the displays must synthesize the missing views with a subset of the views. The latter approach has been widely proposed to reduce the amount of data being transmitted. The virtual views can be synthesized by the Depth Image Based Rendering (DIBR) approach from textures and associated depth images. However it is still the case that the amount of information for the textures plus the depths presents a significant challenge for the network transmission capacity. An efficient compression will, therefore, increase the availability of content access and provide a better video quality under the same network capacity constraints.

In this thesis, the compression of depth images is addressed. These depth images can be assumed as being piece-wise smooth. Starting from the properties of depth images, a novel depth image model based on edges and sparse samples is presented, which may also be utilized for depth image post-processing. Based on this model, a depth image coding scheme that explicitly encodes the locations of depth edges is proposed, and the coding scheme has a scalable structure. Furthermore, a compression scheme for block-based 3D-HEVC is also devised, in which diffusion is used for intra prediction. In addition to the proposed schemes, the thesis illustrates several evaluation methodologies, especially, the subjective test of the stimulus-comparison method. It is suitable for evaluating the quality of two impaired images, as the objective metrics are inaccurate with respect to synthesized views.

The MPEG test sequences were used for the evaluation. The results showed that virtual views synthesized from post-processed depth images by using the proposed model are better than those synthesized from original depth images. More importantly, the proposed coding schemes using such a model produced better synthesized views than the state of the art schemes. As a result, the outcome of the thesis can lead to a better quality of 3DTV experience.

Place, publisher, year, edition, pages
Sundsvall: Mid Sweden University, 2013. 36 p.
Mid Sweden University licentiate thesis, ISSN 1652-8948
National Category
Engineering and Technology Signal Processing
urn:nbn:se:miun:diva-19087 (URN)STC (Local ID)978-91-87103-76-6 (ISBN)STC (Archive number)STC (OAI)
Available from: 2013-06-11 Created: 2013-06-06 Last updated: 2016-10-20Bibliographically approved

Open Access in DiVA

3DCON(454 kB)258 downloads
File information
File name FULLTEXT01.pdfFile size 454 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Li, YunSjöström, MårtenJennehag, UlfOlsson, Roger
By organisation
Department of Information Technology and Media
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 258 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 725 hits
ReferencesLink to record
Permanent link

Direct link