CATRF: Codec-Adaptive TriPlane Radiance Fields for Volumetric Content Delivery

University of Massachusetts Amherst
CVPR Findings 2026
Teaser image

Overview of different codec-integrated NeRF compression pipelines. Prior methods either optimize plane-factorized NeRFs without exposure to the real codec-induced distortions (Codec-Agnostic), or they insert a differentiable, custome codec into the training loop to achieve end-to-end optimization (Learned-Codec-in-the-Loop). Our SCL approach combines the practicality of CA with the remarkable RD trade-off of LCL by embedding real JPEG/VP9/HEVC/AV1 codec compression into the training loop.

Abstract

Volumetric media promises next-generation content delivery applications, but its bandwidth demand remains a key bottleneck. Implicit and hybrid volumetric representations reduce model sizes, yet still require careful coding to reach 2D video-like bitrates. We present CATRF, a standard-codec-in-the-loop compression framework for plane-factorized radiance fields. During training, we quantize and pack 2D feature planes into codec-friendly canvases, run a standard codec round trip (JPEG/VP9/HEVC/AV1), then unpack and dequantize the decoded features before volume rendering. We use a straight-through estimator (STE) to insert the non-differentiable, standard codec pipeline into the training loop, allowing radiance-field features to adapt directly to the real, client-side codec distortions without introducing any learned codec parameters. On both static and dynamic benchmarks, CATRF consistently achieve a better rate-distortion trade-off over codec-agnostic and learned-codec-in-the-loop baselines, and also outperforms recent compressed 3DGS methods in both compression efficiency and decoding speed. These results highlight a practical path toward low-bitrate, compression-resilient volumetric representations for free-viewpoint video streaming.

Quantitative and Qualitative Results