The JPEG Pleno Light Field Coding Standard 4D-Transform Mode: How to Design an Efficient 4D-Native Codec
Alves, G.
; Carvalho, M.
; Pagliari, C.L.P.
; Garcia, P.
; Seidel, I.
; Pereira, M.
; Schueler, C.
; Testoni, V.
;
Pereira, F.
; Silva, E.
IEEE Access Vol. NA, Nº NA, pp. 1 - 23, January, 2020.
ISSN (print):
ISSN (online): 2169-3536
Scimago Journal Ranking: 0,59 (in 2020)
Digital Object Identifier: 10.1109/ACCESS.2020.3024844
Abstract
The increasing demand for highly realistic and immersive visual experiences has led to the
emergence of richer 3D visual representation models such as light fields, point clouds and meshes. Light
fields may be modelled as a 2D array of 2D views, corresponding to a large amount of data, which demands
for highly efficient coding solutions. Although static light fields are inherently 4D structures, the coding
solutions in the literature mostly employ traditional 2D coding tools associated with techniques such as
depth-based image rendering to generate a residual, again to be coded using available 2D coding tools. To
address the market needs, JPEG has launched the so-called JPEG Pleno standard, which Part 2 is dedicated
to light field coding. The JPEG Pleno Light Field Coding standard includes two coding modes, one based
on the 4D-DCT, so-called 4D-Transform mode, and another based on depth-based synthesis, so-called 4DPrediction.
The 4D-Transform coding mode standardizes for the first time a 4D-native light field coding
solution where the full light field redundancy across the four dimensions is comprehensively exploited,
somehow extending to 4D the 2D coding framework adopted decades ago by the popular JPEG Baseline
standard. In this context, this paper describes and analyzes in detail the conceptual and algorithmic design
process which has led to the creation of the JPEG Pleno Light Field Coding standard 4D-Transform coding
mode. This has happened through a sequence of steps involving technical innovation design and integration
where increasingly sophisticated coding tools have been combined and improved to maximize the final rate
distortion (RD) performance.