Creating and sharing knowledge for telecommunications

JPEG AI Compressed Domain Face Detection

Alkhateeb, A. ; Gnutti, A. ; Guerrini, F. ; Leonardi, R. ; Ascenso, J. ; Pereira, F.

JPEG AI Compressed Domain Face Detection, Proc IEEE Workshop on MultiMedia Signal Processing - MMSP, West Lafayette, United States, Vol. , pp. - , October, 2024.

Digital Object Identifier:

Download Full text PDF ( 1 MB)

 

Abstract
Learning-based image coding has achieved competitive performance in terms of compression efficiency, while also gaining a key advantage in the ability to carry out computer vision tasks directly in the compressed domain. In fact, the latent representation which is generated using deep learning techniques may natively encapsulate all visual features needed for processing tasks, thereby eliminating the need to perform the expensive synthesis transform process at the decoder side. In this paper, it is proposed to perform face detection using the latent code present in the JPEG AI architecture. First, some experiments show how decoded images can be efficiently processed for face detection without retraining, albeit with some performance degradation. Then, for the first time a compressed domain RetinaFace- based detector applied to JPEG AI latent representations is competitively proposed. The performance achieved is comparable to the performance to the original RetinaFace applied to the reconstructed JPEG AI images, while reducing computational complexity since it bypasses the image decoding process. It is expected that this approach might be extended to other vision tasks since the JPEG AI representation format is not tailored specifically for any computer vision task.