Day 1
Wednesday, June 12, 2024
08:50-09:00 | Opening Remarks |
09:00-10:00 | Keynote I Recent Developments and Perspectives in Video Coding and Its Standardization Speaker: Prof. Dr.-Ing. Jens-Rainer Ohm, RWTH Aachen University, Germany Moderator: Prof. C.-C. Jay Kuo, Univ. of Southern California, USA |
10:00-10:30 | Coffee Break |
10:30-12:00 | (Special) Oral Session W-1 Advances in Next Generation Video Coding Paper ID: 72, 70, 69, 51, 17 Chair: Dr. Wang-Q Lim, Fraunhofer HHI, Germany |
12:00-13:30 | Lunch & Industry Talk Speaker: Prof. Homer H. Chen, National Taiwan University, Taiwan Chair: Prof. Rachel Chiang, National Chung Cheng University, Taiwan |
13:30-15:00 | Oral Session W-2 Emerging Image/Video Coding and Optimization Paper ID: 85, 56, 24, 53, 74 Chair: Dr. Charles Bonnineau, InterDigital, Canada |
15:00-15:30 | Coffee Break |
15:30-16:30 | Online Session I Standards, Video Coding, and Quality Assessment Paper ID: 77, 58, 21, 101, 97. 45, 105 Chair: PD Mathias Wien, RWTH Aachen Univ., Germany |
16:30-18:00 | Oral Session W-3 Learned Image Coding Paper ID: 83, 9, 5, 14, 108 Chair: Prof. Seishi Takamura, Hosei University, Japan |
18:30-20:00 | Cocktail Reception |
Day 2
Thursday, June 13, 2024
09:00-10:00 | Keynote II Progress and Opportunities in Video Coding from Chip Design Perspective Speaker: Dr. Kevin Jou, MediaTek, Taiwan Moderator: Prof. Chia-Wen Lin, National Tsing Hua University, Taiwan |
10:00-10:30 | Coffee Break |
10:30-12:00 | Oral Session T-1 Energy and Complexity Management in Video Coding Paper ID: 98, 6, 94, 18, 28 Chair: Dr. Angeliki Katsenou, Univ. of Bristol, UK |
12:00-13:30 | Lunch |
13:30-15:00 | (Special) Oral Session T-2 Semantic Visual Compression towards Machine and Human Vision Paper ID: 50, 86, 67, 15, 36 Chair: Prof. Ivan Bajic, Simon Fraser University, Canada |
15:00-15:30 | Coffee Break |
15:30-16:30 | Panel Learned Image and Video Coding: Hype or Hope? Moderator: Prof. Dr.-Ing. Joern Ostermann, Leibniz Universität Hannover, Germany |
16:30-18:00 | Oral Session T-3 Learned Video Coding Paper ID: 20, 29, 47, 106, 27 Chair: Prof. Wen-Hsiao Peng, National Yang Ming Chiao Tung University, Taiwan |
19:00-21:00 | Dinner at National Taichung Theater |
Day 3
Friday, June 14, 2024
09:00-10:00 | Keynote III Video Compression in the Wild: Learnings and Opportunities from a Video Streaming Company Speaker: Dr. Anne Aaron, Netflix, USA Moderator: Prof. Byeungwoo Jeon, Sungkyunkwan University, Korea |
10:00-10:30 | Coffee Break |
10:30-12:00 | (Special) Oral Session F-1 Realistic 3D Graphics Representations and Compression Paper ID: 22, 52, 75, 12, 54 Chair: Prof. Dr.-Ing. Joern Ostermann, Leibniz Universität Hannover, Germany |
12:00-13:30 | AOMedia Social Event – sponsored by Meta and Netflix |
13:30-15:00 | Oral Session F-2 Objective and Subjective Quality Assessment Paper ID: 31, 30, 93, 62, 80 Chair: Prof. Tsung-Jung Liu, National Chung Hsing University, Taiwan |
15:00-15:30 | Coffee Break |
15:30-16:30 | Online Session II Image Coding and Processing Paper ID: 48, 99, 111, 41, 112, 39 Chair: Prof. Heming Sun, Yokohama National University, Japan |
16:30-18:00 | Oral Session F-3 Visual Data Processing, Coding, and Applications Paper ID: 8, 60, 71, 33, 59 Chair: Christopher Rosewarne, Canon, Australia |
18:00-18:30 | Closing and Best Paper Award Ceremony |
Day 1
Wednesday, June 12, 2024
10:30 – 12:00 | |
(Special) Oral Session W-1 Advances in Next Generation Video Coding |
|
Chair: Dr. Wang-Q Lim, Fraunhofer HHI, Germany | |
* indicates the presenter |
P-ID 072 | Overview of Intra Template Matching Tools in ECM Po-Han Lin*, Jian-Liang Lin, Vadim Seregin, Marta Karczewicz |
P-ID 070 | Deep video compression with conditional feature coding Sophie Pientka*, Jonathan Pfaff, Heiko Schwarz, Detlev Marpe, Thomas Wiegand |
P-ID 069 | ELIM: Extremely Low-complexity Implicit Neural Model for Super Resolution-based Coding Wenyu Wang, Junjie Wang, Dandan Ding, Urvang Joshi*, Debargha Mukherjee |
P-ID 051 | Simplified CNN In-Loop Filter with fixed Classifications Wang-Q Lim*, Björn Stallenberger, Jonathan Pfaff, Heiko Schwarz, Detlev Marpe, Thomas Wiegand |
P-ID 017 | Encoder-Quantization-Motion-based Video Quality Metrics Yixu Chen*, Zaixi Shang, Hai Wei, Yongjun Wu, Sriram Sethuraman |
13:30 – 15:00 | |
Oral Session W-2 Emerging Image/Video Coding and Optimization |
|
Chair: Dr. Charles Bonnineau, InterDigital, Canada | |
* indicates the presenter |
P-ID 085 | Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction Wen-Yang Lu*, Eduardo Pavez, Antonio Ortega, Xin Zhao, Shan Liu |
P-ID 056 | Nonlinear Transform Coding for VVC Intra Coding Michael Schäfer*, Heiko Schwarz, Jonathan Pfaff, Detlev Marpe, Thomas Wiegand |
P-ID 024 | Low-Complexity Transform Design Using Hybrid Intra MTS Charles Bonnineau*, Saurabh Puri, Karam Naser, Tangi Poirier, Fabrice Le Leannec |
P-ID 053 | Bitrate Ladder Construction using Visual Information Fidelity Krishna Srikar Durbha*, Hassene Tmar, Cosmin Stejerean, Ioannis Katsavounidis, Alan Bovik |
P-ID 074 | Fast First Pass in Two-Pass Video Encoding Using Sub-Sampling Anastasia Henkel, Christian R. Helmrich, Tobias Hinz, Jens Brandenburg, Adam Wieckowski*, Benjamin Bross, Detlev Marpe, Thomas Wiegand |
15:30 – 16:30 | |
Online Session I Standards, Video Coding, and Quality Assessment |
|
Chair: PD Mathias Wien, RWTH Aachen Univ., Germany | |
* indicates the presenter |
P-ID 077 | Standardization Status of MPEG Geometry-based Point Cloud Compression (G-PCC) Edition 2 Wei Zhang, FuZheng Yang, Yingzhan Xu, Marius Preda (Presented by Zikun Yuan*) |
P-ID 058 | Spatial Neighbor Information Assisted Motion Compensated Temporal Filter for Video Coding Zikun Yuan*, Weijia Zhu, Yuwen He, Li Zhang, Xiaohu Tang |
P-ID 021 | Template Matching-Based Subblock Motion Refinement Towards Next Generation Video Coding Lei Zhao, Kai Zhang, Li Zhang (Presented by Zikuan Yuan*) |
P-ID 101 | Temporal Enhanced Hybrid Neural Representation for Video Compression Jinxiang Wang*, Yangdong Liu, Shiping Zhu, Cheng Feng |
P-ID 097 | Low-Complexity 3D-Vision Conferencing System based on Accelerated RIFE Model Hongyue Huang*, Xilong Zhou, Hongbo Ning, Haopeng Lu, Qi Zhang, Yanpeng Liang, Wanjun Lyu, Chuanmin Jia, Xinfeng Zhang, Liuxin Zhang, Siwei Ma |
P-ID 045 | Multi-Agent Reinforcement Learning based Bit Allocation for Gaming Video Coding Guangjie Ren*, Zizheng Liu, Zhenzhong Chen, Shan Liu |
P-ID 105 | A Transformer-based Intra Luma Enhancement for H.266/VVC Wenrui Lv, Hui Yuan, Congrui Fu*, Shiqi Jiang, Junyan Huo |
16:30 – 18:00 | |
Oral Session W-3 Learned Image Coding |
|
Chair: Prof. Seishi Takamura, Hosei University, Japan | |
* indicates the presenter |
P-ID 083 | Fourier Basis Density Model Alfredo De la Fuente*, Saurabh Singh, Johannes Ballé |
P-ID 009 | CoCliCo: Extremely low bitrate image compression based on CLIP semantic and tiny color map Tom Bachard*, Tom Bordin, Thomas Maugey |
P-ID 005 | Transformer-based Learned Image Compression for Joint Decoding and Denoising Yi-Hsin Chen*, Kuan-Wei Ho, Shiau-Rung Tsai, Guan-Hsun LIN, Alessandro Gnutti, Wen-Hsiao Peng, Riccardo Leonardi |
P-ID 014 | Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model Panqi Jia, Ahmet Burakhan Koyuncu, Jue Mao, Ze Cui, Yi Ma, Timofey Solovyev, Alexander Karabutov, Yin Zhao, Jing Wang, Elena Alshina*, Andre Kaup, Ti Guoansheng |
P-ID 108 | Practical Learned Image Compression with Online Encoder Optimization Haotian Zhang*, Feihong Mei, Junqi Liao, Li Li, Houqiang Li, Dong Liu |
Day 2
Thursday, June 13, 2024
10:30 – 12:00 | |
Oral Session T-1 Energy and Complexity Management in Video Coding |
|
Chair: Dr. Angeliki Katsenou, Univ. of Bristol, UK | |
* indicates the presenter |
P-ID 098 | A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders Matthias Kränzler*, Christian Herglotz, Andre Kaup |
P-ID 006 | Complexity Metrics for VVC Decoder Power Reduction in Green Metadata Christian Herglotz, Matthias Kränzler*, Rui Dai, Andre Kaup |
P-ID 094 | Comparative Study of Hardware and Software Power Measurements in Video Compression Angeliki Katsenou*, Xinyi Wang, Daniel Schien, David Bull |
P-ID 018 | Video Super-Resolution for Optimized Bitrate and Green Online Streaming Vignesh V Menon*, Prajit T Rajendran, Amritha Premkumar, Benjamin Bross, Detlev Marpe |
P-ID 028 | Balancing Complexity of Template Matching-based Reference Picture Padding for Video Coding Nicolas Horst*, Priyanka Das, Tim Classen, Mathias Wien |
13:30 – 15:00 | |
(Special) Oral Session T-2 Semantic Visual Compression towards Machine and Human Vision |
|
Chair: Prof. Ivan Bajic, Simon Fraser University, Canada | |
* indicates the presenter |
P-ID 050 | An Effective Entropy Model for Semantic Feature Compression Tianma Shen, Ying Liu (Presented by Ivan Bajic*) |
P-ID 086 | Scalable Human-Machine Point Cloud Compression Mateen Ulhaq, Ivan Bajic* |
P-ID 067 | Improvements of the BD-rate Metrics using Monotonic Curve-fitting Methods Haiqiang Wang, Xin Zhao, Ding Ding, Xiang Pan, Zizheng Liu, Xiaozhong Xu, Shan Liu (Presented by Fan Zhang*) |
P-ID 015 | Probing Image Compression For Class-Incremental Learning Justin Yang*, Zhihao Duan, Andrew Peng, Yuning Huang, Jiangpeng He, Fengqing Maggie Zhu |
P-ID 036 | DMOFC: Discrimination Metric-Optimized Feature Compression Changsheng Gao*, Yiheng Jiang, Li Li, Dong Liu, Feng Wu |
16:30 – 18:00 | |
Oral Session T-3 Learned Video Coding |
|
Chair: Prof. Wen-Hsiao Peng, National Yang Ming Chiao Tung University, Taiwan | |
* indicates the presenter |
P-ID 020 | Adaptive Variance-Threshold-Based Skip Modes for Learned Video Compression Using a Motion Complexity Criterion Fabian Brand*, Jürgen Seiler, Johannes Sauer, Elena Alshina, Andre Kaup |
P-ID 029 | Analysis of Neural Video Compression Networks for 360-Degree Video Coding Andy Regensky*, Fabian Brand, Andre Kaup |
P-ID 047 | Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation Tianhao Peng*, Ge Gao, Heming Sun, Fan Zhang, David Bull |
P-ID 106 | GOP-based Deep Preprocessing for Video Coding Daichi Arai*, Shunsuke Iwamura, Kazuhisa Iguchi, Atsuro Ichigaya |
P-ID 027 | BVI-Artefact: An Artefact Detection Benchmark Dataset for Streamed Videos Chen Feng*, Duolikun Danier, Fan Zhang, Alex Mackin, Andrew Collins, David Bull |
Day 3
Friday, June 14, 2024
10:30 – 12:00 | |
(Special) Oral Session F-1 Realistic 3D Graphics Representations and Compression |
|
Chair: Prof. Dr.-Ing. Joern Ostermann, Leibniz Universität Hannover, Germany | |
* indicates the presenter |
P-ID 022 | Lossy video coding of V-DMC displacements Aleksei Martemianov*, Patrice Rondao Alface |
P-ID 052 | Dynamic Mesh Coding using Orthogonal Atlas Projection Danillo Graziosi, Kao Hayashi* |
P-ID 075 | BMT-PCGC: Point Cloud Geometry Compression with Bidirectional Mask Transformer Entropy Model Monyneath Yim*, Bing-Han Wu, Jui-Chiu Chiang |
P-ID 012 | Tool Space Exploration of the V-PCC Patch Generation for Practical Point Cloud Encoding Louis Fréneau*, Alexandre MERCAT, Guillaume Gautier, Joose Sainio, Jarno Vanne |
P-ID 054 | Immersive Video Compression using Implicit Neural Representations Ho Man Kwan*, Fan Zhang, Andrew Gower, David Bull |
13:30 – 15:00 | |
Oral Session F-2 Objective and Subjective Quality Assessment |
|
Chair: Prof. Tsung-Jung Liu, National Chung Hsing University, Taiwan | |
* indicates the presenter |
P-ID 031 | Full-reference Video Quality Assessment for User Generated Content Transcoding Zihao Qi*, Chen Feng, Duolikun Danier, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull |
P-ID 030 | RankDVQA-mini: Knowledge Distillation-Driven Deep Video Quality Assessment Chen Feng*, Duolikun Danier, Haoran Wang, Fan Zhang, Benoit Quentin Arthur Vallade, Alex Mackin, David Bull |
P-ID 093 | Beyond Curves and Thresholds – Introducing Uncertainty Estimation to Satisfied User Ratios for Compressed Video Jingwen Zhu, Hadi Amirpour, Raimund Schatz, Patrick Le Callet*, Christian Timmerer |
P-ID 062 | “Discriminability–Experimental Cost” tradeoff in subjective video quality assessment of codec: DCR with EVP rating scale versus ACR–HR Andreas Pastor, Pierre David, Ioannis Katsavounidis, Lukas Krasula, Andrey Norkin, Hassene Tmar, Patrick Le Callet* |
P-ID 080 | A FUNQUE Approach to the Quality Assessment of Compressed HDR Videos Abhinau K Venkataramanan, Cosmin Stejerean*, Ioannis Katsavounidis, Alan Bovik |
15:30 – 16:30 | |
Online Session II Image Coding and Processing |
|
Chair: Prof. Heming Sun, Yokohama National University, Japan | |
* indicates the presenter |
P-ID 048 | Wavelet-like Transform with Subbands Fusion in Decoupled structure for Deep Image Compression Ke Ma*, Yaojun Wu, Zhaobin Zhang, Semih Esenlik, Xiaoyan Sun, Kai Zhang, Li Zhang |
P-ID 099 | Image Encryption and Compression Based on Reversed Diffusion Model Yilin Guo*, Jianhui Chang |
P-ID 111 | A Quantization Loss Compensation Network for Remote Sensing Image Compression Shao Xiang*, Jing Xiao, Mi Wang |
P-ID 041 | Mutual Guidance Distillation for Joint Demosaicking and Denoising of Raw Images Jingyun Liu, Han Zhu*, Zhenzhong Chen, Shan Liu |
P-ID 112 | Lossless JPEG Recompression for Similar Images via Frequency Domain Block Matching Hongwei Sha*, Ming Lu, Zhan Ma |
P-ID 039 | Swin Transformer-based In-Loop Filter for VVC Intra Coding Tong Ouyang, Xin Chen, Huairui Wang, Han Zhu*, Zhenzhong Chen |
16:30 – 18:00 | |
Oral Session F-3 Visual Data Processing, Coding, and Applications |
|
Chair: Christopher Rosewarne, Canon, Australia | |
* indicates the presenter |
P-ID 008 | Talking Head Generation Based on 3D Morphable Facial Model Wen-Jiin Tsai* |
P-ID 060 | A Novel Region-Dependent Packing Method for Stereoscopic 360° Videos Using Horizontal Downsampling of Equirectangular Projection Hossein Pejman*, Stephane Coulombe, Carlos Vazquez, Mohammadreza Jamali, Ahmad Vakili |
P-ID 071 | Light Field View Synthesis using Deformable Convolutional Neural Networks Muhammad Zubair*, Paulo Nunes, Caroline Conti, Luis Ducla Soares |
P-ID 033 | Compressing Deep Image Super-resolution Models Yuxuan Jiang*, Jakub Nawała, Fan Zhang, David Bull |
P-ID 059 | Evaluation of Low Complexity Enhancement Video Codec (LCEVC) with HEVC and VVC on 4K Content Olena Chubach*, Ching-Yeh Chen |