Yayınlanmış 1 Ocak 2023 | Sürüm v1
Konferans bildirisi Açık

MULTI-SCALE DEFORMABLE ALIGNMENT AND CONTENT-ADAPTIVE INFERENCE FOR FLEXIBLE-RATE BI-DIRECTIONAL VIDEO COMPRESSION

  • 1. Koc Univ, Dept Elect & Elect Engn, Istanbul, Turkiye

Açıklama

The lack of ability to adapt the motion compensation model to video content is an important limitation of current end-to-end learned video compression models. This paper advances the state-of-the-art by proposing an adaptive motion-compensation model for end-to-end rate-distortion optimized hierarchical bi-directional video compression. In particular, we propose two novelties: i) a multi-scale deformable alignment scheme at the feature level combined with multi-scale conditional coding, ii) motion-content adaptive inference. In addition, we employ a gain unit, which enables a single model to operate at multiple rate-distortion operating points. We also exploit the gain unit to control bit allocation among intra-coded vs. bi-directionally coded frames by fine tuning corresponding models for truly flexible-rate learned video coding. Experimental results demonstrate state-of-the-art rate-distortion performance exceeding those of all prior art in learned video coding(1).

Dosyalar

bib-2223317a-a2f5-448a-8a47-0d3567768682.txt

Dosyalar (220 Bytes)

Ad Boyut Hepisini indir
md5:897361f6f20b7ca27d8ec09810b07280
220 Bytes Ön İzleme İndir