2020 IEEE International Conference on Visual Communications and Image Processing (VCIP)

December 1 – 4, 2020, Virtual Conference

VCIP2020: Technical Program


Standard Hong Kong Time
(GMT + 8)

December 1, 2020 (Tuesday)

09:00-10:15

Tutorial 1

Tutorial 2

Tutorial 3

10:15-10:45

Break

10:45-12:00

Tutorial 1 (cont)

Tutorial 2 (cont)

Tutorial 3 (cont)

12:00-14:00

Lunch

14:00-15:15

Tutorial 4

Tutorial 5

15:15-15:45

Break

15:45-17:00

Tutorial 4 (cont)

Tutorial 5 (cont)

 

Tutorial

Tutorial 1 From Low to Super Resolution and Beyond

Chi-Wah Kok (CVX Semiconductor Pty Ltd.); Wing-Shan Tam (CVX Semiconductor Pty Ltd.)

 

Tutorial 2 Screen Content Coding in Recently Developed Video Coding Standards

Xiaozhong Xu (Tencent Media Lab.); Shan Liu (Tencent Media Lab.)

 

Tutorial 3 Recent Advances in End-to-End Learned Image and Video Compression

Wen-Hsiao Peng (National Chiao Tung University); Hsueh-Ming Hang (National Chiao Tung University)

 

Tutorial 4: Versatile Video Coding - Algorithms and Specification

Mathias Wien (RWTH Aachen University); Benjamin Bross (Fraunhofer Heinrich Hertz Institute)

 

Tutorial 5 Learned Image and Video Compression with Deep Neural Networks

Dong Xu (The University of Sydney); Guo Lu (Beijing Institute of Technology); Ren Yang (ETH Zurich); Radu Timofte (ETH Zurich)

 

 

Standard Hong Kong Time
(GMT + 8)

December 2, 2020 (Wednesday)

08:45-09:00

Opening Ceremony

09:00-10:00

Keynote I

10:00-10:30

Break

10:30-12:30

Oral 1.1: Best Paper Competition

Oral 1.2: Deep Network and Learning I

Oral 1.3: Point Cloud Analysis & Compression

12:30-14:00

Lunch

14:00-16:00

Oral 1.4: Multimedia Analysis, Representation and Understanding I

Poster 1.1: Segmentation / Light Field Analysis and Coding

16:00-16:30

Break

16:30-18:30

SS1: Image Feature Extraction Using Advanced Transform and Learning-based Approaches

Oral 1.5: Video Analysis and Coding I

 

 

Oral 1.1: Best Paper Competition

O1.1.1 Learning Graph Topology Representation with Attention Networks

Qi Yuanyuan (Beijing University of Posts and Telecommunications); Jiayue Zhangjia (Beijing University of Technology); Weiran Xu (Beijng University of Posts and Telecommunications); Jun Guo (Beijing University of Posts and Telecommunications)

 

O1.1.2 Unsupervised Point Cloud Registration via Salient Points Analysis (SPA)

Pranav A Kadam (University of Southern California); Min Zhang (University of Southern California); Shan Liu (Tencent America); C.-C. Jay Kuo (University of Southern California)

 

O1.1.3 Multi-Scale Video Inverse Tone Mapping with Deformable Alignment

Jiaqi Zou (Beijing University of Posts and Telecommunications); Ke Mei (Beijing University of Posts and Telecommunications); Songlin Sun (Beijing University of Posts and Telecommunications)

 

O1.1.4 APL: Adaptive Preloading of Short Video with Lyapunov Optimization

Haodan Zhang (Peking University); Yixuan Ban (Peking University); Zhimin Xu (Beijing Bytedance Technology Co., Ltd.); Xinggong Zhang (PKU); Zongming Guo (Peking University); Shengbin Meng (Beijing Bytedance Nertwork Technology Co., Ltd); Junlin Li (ByteDance Inc.); Yue Wang (Beijing ByteDance Technology Co., Ltd.)

 

O1.1.5 Improving Robustness of DNNs against Common Corruptions via Gaussian Adversarial Training

Chenyu Yi (NTU); Haoliang Li (NTU, Singapore); Renjie Wan (Nanyang Technological University); Alex Kot (Nanyang Technological University)

 

O1.1.6 Fully Neural Network Mode Based Intra Prediction of Variable Block Size

Heming Sun (Waseda University, Japan); Lu Yu (Zhejiang University); Jiro Katto (Waseda University)

 

 

Oral 1.2: Deep Network and Learning I

O1.2.1 HDR Image Compression with Convolutional Autoencoder

Fei Han (Beijing University of Technology); Jin Wang (Beijing University of Technology); Ruiqin Xiong (Peking University); Qing Zhu (Beijing University of Technology); Baocai  Yin (Beijing University of Technology)

 

O1.2.2 Mining larger Class Activation Map with Common Attribute Labels

Runtong ZHANG (University of Electronic Science and Technology of China); Fanman Meng (University of Electronic Science and Technology of China); Hongliang Li (University of Electronic Science and Technology of China); Qingbo Wu (University of Electronic Science and Technology of China); King Ngi Ngan (University of Electronic Science and Technology of China)

 

O1.2.3 Orthogonal Features Fusion Network for Anomaly Detection

Teli Ma (Beihang University); Yizhi Wang (Beijing University of Posts and Telecommunications); Jinxin Shao (Beihang University); Baochang Zhang (Beihang University); David Doermann (University at Buffalo)

 

O1.2.4 Network Update Compression for Federated Learning

Birendra Kathariya (University of Missouri-kansas City); Li Li (University of Science and Technology of China); Zhu Li (university of Missouri-kansas City); Lingyu Duan (Peking University); Shan Liu (Tencent America)

 

O1.2.5 Efficient Light Deep Network for Street Scene Parsing

ZheHui Wang (Beijing Institute of Technology); Sanyuan Zhao (Beijing Institute of Technology); Jianbing Shen (University of California, Los Angeles); Zhengchao Lei (National Computer Network Emergency Response Technical Team/Coordination Center of China)

 

O1.2.6 Compressing Facial Makeup Transfer Networks by Collaborative Distillation and Kernel Decomposition

Bianjiang Yang (Zhejiang University); Zi Hui (Zhejiang University); Haoji Hu (Zhejiang University, China); Xinyi Hu (Zhejiang University); Lu Yu (Zhejiang University)

 

 

Oral 1.3: Point Cloud Analysis and Compression

O1.3.1 Fast Recolor Prediction Scheme in Point Cloud Attribute Compression

Chuang Ma (Peking University Shenzhen Graduate School); Ge Li (SECE, Shenzhen Graduate School, Peking University); Qi Zhang (Peking University Shenzhen Graduate School); Yiting Shao (Peking University Shenzhen Graduate School); Jing Wang (Artificial Intelligence Research Center Peng Cheng Laboratory); Shan Liu (Tencent America)

 

O1.3.2 Spatiotemporal Guided Self-Supervised Depth Completion from LiDAR and Monocular Camera

Zhifeng Chen (Fuzhou University); Hantao Wang (Fuzhou university); Lijun Wu (Fuzhou university); Yanlin Zhou (University of Florida); Dapeng Wu (University of Florida)

 

O1.3.3 A Semantic Labeling Framework for ALS Point Clouds Based on Discretization and CNN

Xingtao Wang (Harbin Institute of Technology); Xiaopeng Fan (Harbin Institute of Technology); Debin Zhao (Harbin Institute of Technology)

 

O1.3.4 A Point Cloud Compression Framework via Spherical Projection

Yingshen He (Peking University Shenzhen Graduate School); Ge Li (SECE, Shenzhen Graduate School, Peking University); Yiting Shao (Peking University Shenzhen Graduate School); Jing Wang (Artificial Intelligence Research Center Peng Cheng Laboratory); Yueru Chen (Artificial Intelligence Research Center Peng Cheng Laboratory); Shan Liu (Tencent America)

 

O1.3.5 Point Cloud Attribute Compression via Successive Subspace Graph Transform

Yueru Chen (Artificial Intelligence Research Center Peng Cheng Laboratory); Yiting Shao (Peking University Shenzhen Graduate School); Jing Wang (Artificial Intelligence Research Center Peng Cheng Laboratory); Ge Li (SECE, Shenzhen Graduate School, Peking University); C.-C. Jay Kuo (USC)

 

O1.3.6 Point Cloud Geometry Prediction Across Spatial Scale using Deep Learning

Anique Akhtar (University of Missouri-Kansas City); Wen Gao (Tencent America); Xiang Zhang (Tencent America); Li Li (University of Science and Technology of China); Zhu Li (University of Missouri-Kansas City); Shan Liu (Tencent America)

 

 

Oral 1.4: Multimedia Analysis, Representation and Understanding I

O1.4.1 Fast Video Saliency Detection based on Feature Competition

Hang Yan (Shanghai Jiao Tong University); Yiling Xu (Shanghai Jiao Tong University); Jun Sun (SJTU); Le Yang (University of Canterbury); Yunfei Zhang (Tencent); Wei Huang (Tencent)

 

O1.4.2 Power/QoS-Adaptive HEVC FME Hardware using Machine Learning-Based Approximation Control

Wagner Penny (Federal University of Pelotas); Daniel Palomino (Federal University of Pelotas); Marcelo S Porto (Universidade Federal de Pelotas); Bruno Zatt (Federal University of Pelotas)

 

O1.4.3 A Discrete Cosine Model of Light Field Sampling for Improving Rendering Quality of Views

Ying Wei (Guangxi Normal University); Changjian Zhu (Guangxi Normal University); Yan Liu (Guangxi Normal University)

 

O1.4.4 FaME-ML: Fast Multirate Encoding for HTTP Adaptive Streaming Using Machine Learning

Ekrem Çetinkaya (Alpen-Adria-Universität Klagenfurt); Hadi Amirpourazarian (Alpen-Adria-Universität Klagenfurt); Christian Timmerer (Alpen-Adria-Universität Klagenfurt); Mohammad Ghanbari (University of Essex)

 

O1.4.5 Enhanced Saliency Prediction via Orientation Selectivity

Peng Ye (Shanghai University); Yongfang Wang (Shanghai University); Yumeng Xia (Shanghai University)

 

O1.4.6 Subjective and Objective Quality Assessment of the SoftCast Video Transmission Scheme

Anthony TRIOUX (Univ. Polytechnique Hauts-de-France); Giuseppe Valenzise (CNRS); Marco Cagnazzo (LTCI, Télécom ParisTech, Institut Polytechnique de Paris); Michel KIEFFER (Centrale Superlec); François-Xavier COUDOUX (Univ. Polytechnique Hauts-de-France); Patrick CORLAY (Univ. Polytechnique Hauts-de-France); Mohamed GHARBI (Univ. Polytechnique Hauts-de-France)

 

 

Poster 1.1: Segmentation / Light Field Analysis and Coding

P1.1.1 A New Bounding Box based Pseudo Annotation Generation Method for Semantic Segmentation

Xiaolong Xu (University of Electronic Science and Technology of China); Fanman Meng (University of Electronic Science and Technology of China); Hongliang Li (University of Electronic Science and Technology of China); Qingbo Wu (University of Electronic Science and Technology of China); King Ngi Ngan (University of Electronic Science and Technology of China); Shuai Chen (University of Electronic Science and Technology of China)

 

P1.1.2 A Dense-Gated U-Net for Brain Lesion Segmentation

Zhongyi Ji (Peking University); Xiao Han (Peking University); Tong Lin (Peking University); Wenmin Wang (Macau University of Science and Technology)

 

P1.1.3 On Segmentation of Maxillary Sinus Membrane using Automatic Vertex Screening

Kang-Rong Li (Guangdong University of Technology); Tai-Chiu Hsung (Guangdong University of Technology); Andy W.K. Yeung (University of Hong Kong); Michael M. Bornstein (University of Hong Kong)

 

P1.1.4 Disparity compensation of light fields for improved efficiency in 4D transform-based encoders

João M. Santos (Instituto de Telecomunicações); Lucas Thomaz (Instituto de Telecomunicações); Pedro Amado Assuncao (IT, Instituto Politécnico de Leiria); Luís Cruz (IT, Universidade de Coimbra); Luís Távora (Instituto Politécnico de Leiria); Sergio Faria (Instituto de Telecomunicações)

 

P1.1.5 GRNet: Deep Convolutional Neural Networks based on Graph Reasoning for Semantic Segmentation

Yang Wu (Hohai University); Aimin Jiang (Hohai University); Yibin Tang (Hohai University); H. K. Kwan (University of Windsor)

 

P1.1.6 Mono is Enough: Instance Segmentation from Single Annotated Sample

Yang Longrong (University of Electronic Science and Technology of China); Hongliang Li (University of Electronic Science and Technology of China); Qingbo Wu (University of Electronic Science and Technology of China); Fanman Meng (University of Electronic Science and Technology of China); King Ngi Ngan (University of Electronic Science and Technology of China)

 

P1.1.7 Fast Geometry Estimation for Phase-coding Structured Light Field

Sen Xiang (Wuhan University of Science and Technology)

 

P1.1.8 Random-access-aware Light Field Video Coding using Tree Pruning Method

Thuc Huu Nguyen (SKKU); Vinh Van Duong (Sungkyunkwan University); Byeungwoo Jeon (SKKU)

 

P1.1.9 4D-DCT Hardware Architecture for JPEG Pleno Light Field Coding

Matheus Jahnke (UFPEL); Jones W. Goebel (Federal University of Pelotas); Daniel Palomino (Federal University of Pelotas); Guilherme Correa (Federal University of Pelotas); Luciano Volcan Agostini (UFPel - Federal University of Pelotas); Marcelo S Porto (Universidade Federal de Pelotas); Bruno Zatt (Federal University of Pelotas)

 

 

Special Session 1: Image Feature Extraction Using Advanced Transform and Learning-based Approaches

SS1.1 Lightweight Color Image Demosaicking with Multi-Core Feature Extraction

Yufei Tan (Guangxi University); Kan Chang (Guangxi University); Hengxin Li (Guangxi University); Zhenhua Tang (Guangxi University); Tuanfa Qin (Guangxi University)

 

SS1.2 On 2D-3D Image Feature Detections for Image-To-Geometry Registration in Virtual Dental Model

Hui Tang (Guangdong University of Technology); Tai-Chiu Hsung (Guangdong University of Technology); Walter Y.H. Lam (University of Hong Kong); Leo Y.Y. Cheng (University of Hong Kong); Edmond H.N. Pow (University of Hong Kong)

 

SS1.3 Unsupervised Feedforward Feature (UFF) Learning for Point Cloud Classification and Segmentation

Min Zhang (University of Southern California); Pranav A Kadam (University of Southern California); C.-C. Jay Kuo (USC); Shan Liu (Tencent America)

 

SS1.4 Stereoscopic image reflection removal based on Wasserstein Generative Adversarial Network

Daniel P.K. Lun (The Hong Kong Polytechnic University); Xiuyuan Wang (The Hong Kong Polytechnic University); Yikun Pan (The Hong Kong Polytechnic University)

 

SS1.5 Graph Grouping Loss for Metric Learning of Face Image Representations

Nakamasa Inoue (Tokyo Institute of Technology)

 

SS1.6 Deep Blind Video Quality Assessment for User Generated Videos

Jiapeng Tang (Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University); Yu Dong (Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University); Rong Xie (Shanghai Jiao Tong University); Xiao Gu (Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University); Li Song (Shanghai Jiao Tong University); Lin Li (MIGU Co., Ltd); Bing Zhou (MIGU Co., Ltd)

 

 

Oral 1.5: Video Analysis and Coding I

O1.5.1 Introducing Latent Space Correlation to Conditional Autoencoders for Intra Prediction

Fabian Brand (Friedrich-Alexander University Erlangen-Nürnberg (FAU)); Jürgen Seiler (Friedrich-Alexander University Erlangen-Nürnberg); Andre Kaup (Friedrich-Alexander University Erlangen-Nürnberg)

 

O1.5.2 Sparse Representation-Based Intra Prediction for Lossless/Near Lossless Video Coding

Linwei Zhu (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Yun Zhang (Shenzhen Institutes of Advanced Technology); Na Li (Shenzhen Institutes of Advanced Technology); Jinyong Pi (Shenzhen Institutes of Advanced Technology); Xinju Wu (Shenzhen Institutes of Advanced Technology)

 

O1.5.3 Content-aware Hybrid Equi-angular Cubemap Projection for Omnidirectional Video Coding

Jinyong Pi (Shenzhen Institutes of Advanced Technology); Yun Zhang (Shenzhen Institutes of Advanced Technology); Linwei Zhu (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Xinju Wu (Shenzhen Institutes of Advanced Technology); Xuemei Zhou (Shenzhen Institutes of Advanced Technology)

 

O1.5.4 A Progressive Fast CU Split Decision Scheme for AVS3

Yuyuan Chen (Beijing University of Posts and Telecommunications); Songlin Sun (Beijing University of Posts and Telecommunications); Jiaqi Zhang (Institute of Computing Technology Chinese Academy of Sciences); Shanshe Wang (Beijing Boya RealScene Technologies Co., Ltd)

 

O1.5.5 FastSCCNet: Fast Mode Decision in VVC Screen Content Coding via Fully Convolutional Network

Sik-Ho Tsang (The Hong Kong Polytechnic University); Ngai-Wing Kwong (The Hong Kong Polytechnic University); Yui-Lam Chan (The Hong Kong Polytechnic University)

 

 

Standard Hong Kong Time
(GMT + 8)

December 3, 2020 (Thursday)

09:00-10:00

Keynote II

10:00-10:30

Break

10:30-12:30

SS2: Hyperspectral and Multichannel Image Processing

Oral 2.1: Image Analysis and Compression I

Oral 2.2: Recognition and Classification

12:30-14:00

Lunch

14:00-16:00

Demonstration

Poster 2.1: Multimedia Analysis, Representation and Understanding II

16:00-16:30

Break

16:30-18:30

Tencent Industrial Session (TC):

Video Analysis and Coding II

Poster 2.2: Image/Video Quality Assessment

 

 

Special Session 2: Hyperspectral and Multichannel Image Processing

SS2.1 Orthogonal Coded Multi-view Structured Light for Inter-view Interference Elimination

Bing Yan (Yunnan Electricity Research Institute); Zaichao Sun (Yunnan Electricity Research Institute); Zhaoyu Peng (Yunnan Electricity Research Institute); Weiju Dai (Yunnan Electricity Research Institute); Songjun Sun (Yunnan Electricity Research Institute); Gongyuan Zhang (Yunnan Electricity Research Institute); Nongtao Zhang (Yunnan Electricity Research Institute); Jun Xu (Yunnan Electricity Research Institute); Ren Wang (Yunnan Electricity Research Institute); Chunlin Li (Yunnan Electricity Research Institute)

 

SS2.2 Attention-Guided Fusion Network of Point Cloud and Multiple Views for 3D Shape Recognition

Bo Peng (Tianjin University); Zengrui Yu (Tianjin University); Jianjun Lei (Tianjin University); Jiahui Song (Tianjin University)

 

SS2.3 CNN-Based Anomaly Detection For Face Presentation Attack Detection With Multi-Channel Images

Yuge Zhang (Northwestern Polytechnical University); Min Zhao (Northwestern Polytechnical University); Longbin Yan (Northwestern Polytechnical University); Tiande Gao (Northwestern Polytechnical University); Jie Chen (Northwestern Polytechnical University)

 

SS2.4 A Robust Multilinear Mixing Model with l2,1 norm for Unmixing Hyperspectral Images

Minglei  Li (Tianjin University); Fei Zhu (Tianjin University); Alan J.X. Guo (Tianjin University)

 

SS2.5 Sparse Spectral Unmixing of Hyperspectral Images using Expectation-Propagation

Zeng Li (Heriot-Watt University); Yoann Altmann (School of Engineering, Heriot-Watt University); Jie Chen (Northwestern Polytechnical University); Stephen McLaughlin (School of Engineering, Heriot-Watt University); Susanto Raharda (Northwestern Polytechnical University)

 

SS2.6 The Hough-Based Multibeamlet Transform

Agnieszka Lisowska (University of Silesia)

 

 

Oral 2.1: Image Analysis and Compression I

O2.1.1 A Unified Single Image De-raining Model via Region Adaptive Coupled Network

Qingbo Wu (University of Electronic Science and Technology of China); Li Chen (University of Electronic Science and Technology of China); King Ngi Ngan (University of Electronic Science and Technology of China); Hongliang Li (University of Electronic Science and Technology of China); Fanman Meng (University of Electronic Science and Technology of China); LinFeng Xu (University of Electronic Science and Technology of China)

 

O2.1.2 3D-CNN Autoencoder for Plenoptic Image Compression

Tingting Zhong (Tsinghua University); Xin Jin (Tsinghua University); Kedeng Tong (Tsinghua University)

 

O2.1.3 Deep Convolutional Neural Network Based on Multi-Scale Feature Extraction for Image Denoising

Jing Zhang (Xidian University); Liu Sang (Xidian University); Zekang Wan (Xidian University); Yuchen Wang (Xidian University); Yunsong Li (Xidian University)

 

O2.1.4 Deep Temporal Color Constancy for AC Light Sources

Jun-Sang Yoo (Korea University); Chan-Ho Lee (Korea University); Jong-Ok Kim (Korea University)

 

O2.1.5 Spatial-Channel Context-based Entropy Modeling for End-to-end Optimized Image Compression

Chongxin Li (Shanghai Jiao Tong University); Jixiang Luo (Shanghai Jiao Tong Unversity); Wenrui Dai (Shanghai Jiao Tong University); Chenglin Li (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University)

 

O2.1.6 Deep Learning Based EBCOT Source Symbol Prediction Technique for JPEG2000 Image Compression Architecture

I-Hsiang Wang (National Taiwan University); Jian-Jiun Ding (National Taiwan University)

 

 

Oral 2.2: Recognition and Classification

O2.2.1 Learning the Connectivity: Situational Graph Convolution Network for Facial Expression Recognition

Jinzhao Zhou (South China University of Technology); Xingming Zhang (South China University of Technology); Yang Liu (South China University of Technology)

 

O2.2.2 News Image Steganography: A Novel Architecture Facilitates the Fake News Identification

Jizhe Zhou (University of Macau); Chi-Man Pun (University of Macau); Yu Tong (University of Macau)

 

O2.2.3 Drone-Based Car Counting via Density Map Learning

Jingxian Huang (Wuhan University); Guanchen Ding (Wuhan University); Yujia Guo (Wuhan University); Daiqin Yang (Wuhan University); Sihan Wang (Tencent); Tao Wang (Tencent); Yunfei Zhang (Tencent)

 

O2.2.4 Attribute Mix: Semantic Data Augmentation for Fine Grained Recognition

Hao Li (Shanghai Jiao Tong University); Xiaopeng Zhang (Huawei Noah's Ark Lab); Qi Tian (Huawei Cloud & AI); Hongkai Xiong (Shanghai Jiao Tong University)

 

O2.2.5 Unsupervised Embedding Learning by Noisy Similarity Label Optimization

Koki Tsubota (The University of Tokyo); Kiyoharu Aizawa (The University of Tokyo)

 

O2.2.6 Enriching Optical Flow with Appearance Information for Action Recognition

Yijun Pan (University of Science and Technology of China); Xiaoyan Sun (University of Science and Technology of China); Feng Wu (University of Science and Technology of China)

 

 

Demonstration

D.1 Special Cane with Visual Odometry for Real-time Indoor Navigation of Blind People

Jingyu Tang (East China Normal University); Menghan Hu (East China Normal University); Guodong Li (East China Normal University); Qingli Li (East China Normal University); Jian Zhang (East China Normal University); Xiaofeng Zhou (East China Normal University); Guangtao Zhai (Shanghai Jiao Tong University)

 

D.2 FishUI: Interactive Fisheye Distortion Visualization

Andy Regensky (Friedrich-Alexander-University Erlangen-Nürnberg (FAU)); Christian Herglotz (Friedrich-Alexander-Univerity Erlangen-Nürnberg); Andre Kaup (Friedrich-Alexander University Erlangen-Nürnberg)

 

D.3 Automatic Sheep Counting by Multi-object Tracking

JingSong Xu (University of Technology Sydney); Litao Yu (University of Technology Sydney); Jian Zhang (University of Technology Sydney); Qiang Wu (University of Technology Sydney)

 

D.4 Wearable Visually Assistive Device for Blind People to Appreciate Real-world Scene and Screen Image

Jin Ai (University of Shanghai for Science and Technology); Menghan Hu (East China Normal University); Guodong Li (East China Normal University); Guangtao Zhai (Shanghai Jiao Tong University); Jian Zhang (East China Normal University); Qingli Li (East China Normal University); Wenquan Sun (University of Shanghai for Science and Technology)

 

D.5 DENESTO: A Tool for Video Decoding Energy Estimation and Visualization

Matthias Kränzler (Friedrich-Alexander-Univerity Erlangen-Nürnberg); Christian Herglotz (Friedrich-Alexander-Univerity Erlangen-Nürnberg); Andre Kaup (Friedrich-Alexander University Erlangen-Nürnberg)

 

D.6 A Vision Based Fish Processing System

Zongjian Zhang (University of Technology Sydney); Litao Yu (University of Technology Sydney); Jian Zhang (University of Technology Sydney); Qiang Wu (University of Technology Sydney)

 

 

Poster 2.1: Multimedia Analysis, Representation and Understanding II

P2.1.1 Adaptive Resolution Change for Versatile Video Coding

Tsui-Shan Chang (Alibaba Group); Yu-Chen Sun (Alibaba Group); Ling Zhu (Alibaba Group); Jian Lou (Alibaba Group)

 

P2.1.2 Text-to-Image Generation via Semi-Supervised Training

Zhongyi Ji (Peking University); Wenmin Wang (Macau University of Science and Technology); Baoyang Chen (Peking University); Xiao Han (Peking University)

 

P2.1.3 A Hybrid Model for Natural Face De-Identification with Adjustable Privacy

Yunqian Wen (Shanghai Jiao Tong University); Bo Liu (University of Technology Sydney (UTS); Rong Xie (Shanghai Jiao Tong University); Yunhui Zhu (Shanghai Jiao Tong University); Jingyi Cao (Shanghai Jiao Tong University); Li Song (Shanghai Jiao Tong University)

 

P2.1.4 Absolute 3D Human Pose Estimation via Weakly-supervised Learning

Yiru Guo (Beihang University); Zerui Chen (Chinese Academy of Sciences)

 

P2.1.5 A Novel Quality Enhanced Low Complexity Rate Control Algorithm for HEVC

Fei Zhao (Peking University); ChungWen Ku (Peking university); Guoqing Xiang (Peking university); Yuan Li (Peking University); Huizhu Jia (Peking University); Yuehua Cui (Hulun Buir College); Xiaodong Xie (Peking University)

 

P2.1.6 A Review of Data Preprocessing Modules in Digital Image Forensics Methods Using Deep Learning

Alexandre Berthet (EURECOM); Jean-Luc Dugelay (EURECOM, Campus SophiaTech)

 

P2.1.7 Quality of Experience Evaluation for Streaming Video Using CGNN

Zhiming  Zhou (Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University); Yu Dong (Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University); Li Song (Shanghai Jiao Tong University); Rong Xie (Shanghai Jiao Tong University); Lin Li (MIGU Co.,Ltd); Bing Zhou (MIGU Co.,Ltd)

 

P2.1.8 Geometric-visual descriptor for improved image based localization

Achref OUNI (Institut Pascal); Eric Royer (Institut Pascal); Marc CHEVALDONNE (Institut Pascal); Michel Dhome (Institut Pascal)

 

P2.1.9 Machine Learning for Photometric Redshift Estimation of Quasars with Different Samples

Yanxia Zhang (National Astronomical Observatories, Chinese Academy of Sciences)

 

 

TC: Tencent Industrial Session: Video Analysis and Coding II

TC.1 Video Super Resolution Using Temporal Encoding ConvLSTM and Multi-Stage Fusion

Yuhang Zhang (Wuhan University); Zhenzhong Chen (Wuhan University); Shan Liu (Tencent America)

 

TC.2 Deep Inter Coding with Interpolated Reference Frame for Hierarchical Coding Structure

Yu Guo (Wuhan University); Zizheng Liu (Wuhan University); Zhenzhong Chen (Wuhan University); Shan Liu (Tencent America)

 

TC.3 An Optimized Video Encoder Implementation with Screen Content Coding Tools

Xiaozhong Xu (Tencent America); Shitao Wang (Tencent); Yu Chen (Tencent); Yiming Li (Tencent); Qing Zhang (Tencent); Yushan Zheng (Tencent); Shan Liu (Tencent America)

 

TC.4 Prediction-aware Quality Enhancement of VVC Using CNN

Fatemeh NASIRI (bcom); Wassim Hamidouche (INSA Rennes); Luce Morin (INSA Rennes); Nicolas  Dhollande (Aviwest); Gilda Cochorel (Aviwest)

 

TC.5 Adaptive Color Transform in VVC Standard

Hong-Jheng Jhu (Kwai Inc.); Xiaoyu Xiu (Kwai Inc.); Yi-Wen Chen (Kwai Inc.); Tsung-Chuan Ma (Kwai Inc.); Xianglin Wang (Kwai Inc.)

 

 

Poster 2.2: Image/Video Quality Assessment

P2.2.1 No-Reference Stereoscopic Image Quality Assessment Based on Convolutional Neural Network with A Long-Term Feature Fusion

Sumei Li (Tianjin University); Mingyi Wang (Tianjin University)

 

P2.2.2 No-Reference Objective Quality Assessment Method of Display Products

Huiqing Zhang ( Beijing University of Technology); Donghao Li (Beijing University of Technology); Lifang Wu (Beijing University of Technology); Zhifang Xia (Beijing University of Technology; The State Information Center of China)

 

P2.2.3 No-reference Stereoscopic Image Quality Assessment Based on Visual Attention

Sumei Li (Tianjin University); Ping Zhao (Tianjin University); Yongli Chang (Tianjin University)

 

P2.2.4 A Weighted Mean Absolute Error Metric for Image Quality Assessment

Sihan Hao (Tianjin University); Sumei Li (Tianjin University)

 

P2.2.5 No-Reference Stereoscopic Image Quality Assessment Considering Multi-loss Constraints

Yongtian Han (Tianjin University); Sumei Li (Tianjin University); Guanghui Yue (Shenzhen university); Yongli Chang (Tianjin University)

 

P2.2.6 Deep Local and Global Spatiotemporal Feature Aggregation for Blind Video Quality Assessment

Wei Zhou (University of Science and Technology of China); Zhibo Chen (University of Science and Technology of China)

 

P2.2.7 Stereo Image Quality Assessment Considering the Asymmetry of Statistical Information in Early Visual Pathway

Yongli Chang (Tianjin University); Sumei Li (Tianjin University); Li Ma (Tianjin University); Jie Jin (Tianjin University)

 

 

Standard Hong Kong Time
(GMT + 8)

December 4, 2020 (Friday)

09:00-10:00

Keynote III

10:00-10:30

Break

10:30-12:30

SS3: Presence and Experience for Immersive Virtual Reality

Oral 3.1: Video Analysis and Coding III

Oral 3.2: Deep Learning II

12:30-14:00

Lunch

14:00-16:00

Poster 3.1: Image Analysis and Compression III

Grand Challenge

16:00-16:30

Break

16:30-18:30

SS4: Task-oriented Image/Video Processing and Coding

Oral 3.3: Image Analysis and Compression II

Oral 3.4: Object Detection and Tracking

18:30-18:45

Closing Ceremony

 

 

Special Session 3: Presence and Experience for Immersive Virtual Reality

SS3.1 A Theory of Occlusion for Improving Rendering Quality of Views

Yijun  Zeng (Guangxi Normal University); Weiyan Chen (Guangxi Normal University); Mengqin Bai (Guangxi Normal University); Yangdong Zeng (Guilin Medical University); Changjian Zhu (Guangxi Normal University)

 

SS3.2 Application of Brain-Computer Interface and Virtual Reality in Advancing Cultural Experience

Hao-Lun Fu (NCKU); Po-Hsiang Fang (NCKU); Chan-Yu Chi (STUST); Chung-ting Kuo (STUST); Meng-Hsuan Liu (NCKU); Howard Muchen Hsu (NCKU); Cheng-Hsun Hsieh (STUST); Sheng-Fu Liang (NCKU); Shulan Hsieh (NCKU); Cheng-Ta Yang (NCKU)

 

SS3.3 Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio

Fang-Yi Chao (Univ Rennes, INSA Rennes, CNRS, IETR (Institut d’Electronique et de Télécommunication de Rennes) - UMR 6164, F-35000 Rennes, France); Cagri Ozcinar (Samsung R&D Institute UK); Lu Zhang (INSA Rennes); Wassim Hamidouche (INSA Rennes); Prof. Olivier Deforges (IETR, Rennes); Aljosa Smolic (Trinity College Dublin)

 

SS3.4 ERP-Based CTU Splitting Early Termination for Intra Prediction of 360 videos

Bernardo R Beling (Federal University of Pelotas); Iago C Storch (Federal University of Rio Grande do Sul); Luciano Volcan Agostini (UFPel - Federal University of Pelotas); Bruno Zatt (Federal University of Pelotas); Sergio Bampi (Nil); Daniel Palomino (Federal University of Pelotas)

 

SS3.5 Cloud-assisted Augmented Reality Streaming Service System: Architecture Design and Implementation

Hyunmin Noh (POSTECH); Hwangjun Song (POSTECH)

 

SS3.6 Geodesic Disparity Compensation for Inter-View Prediction in VR180

Bharath Vishwanath (UCSB); Kruthika  Koratti Sivakumar (UCSB); Kenneth Rose (UCSB)

 

 

Oral 3.1: Video Analysis and Coding III

O3.1.1 Motion Estimation for Spike Camera Data Sequence via Spike Interval Analysis

Jing Zhao (Peking University); Ruiqin Xiong (Peking University); Rui Zhao (Peking University); Jin Wang (Beijing University of Technology); Siwei Ma (Peking University, China); Tiejun Huang (Peking University)

 

O3.1.2 DOVE: Decomposition Oriented Video super-rEsolution

Huairui Wang (Wuhan University); Wanjie Sun (WHU); Daiqin Yang (Wuhan University); Zhenzhong Chen (Wuhan University)

 

O3.1.3 Fast Intra Coding Algorithm for Depth Map with End-to-End Edge Detection Network

Chang Liu (Beijing University of Technology); Kebin Jia (Beijing University of Technology); Pengyu Liu (Beijing University of Technology)

 

O3.1.4 Learning to encode user-generated short videos with lower bitrate and the same perceptual quality

Shengbin Meng (ByteDance Inc.); Yang Li (ByteDance Inc.); Yiting Liao (ByteDance Inc.); Junlin Li (ByteDance Inc.); Shiqi  Wang (CityU)

 

 

Oral 3.2: Deep Learning II

O3.2.1 Deep Learning-Based Nonlinear Transform for HEVC Intra Coding

Kun Yang (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Feng Wu (University of Science and Technology of China)

 

O3.2.2 Icon Colorization Based On Triple Conditional Generative Adversarial Networks

Qinru Han (Beijing University of Technology); Wenzhe Zhu (Beijing University of Technology); Qing Zhu (Beijing University of Technology)

 

O3.2.3 Video Anomaly Detection Using Open Data Filter and Domain Adaptation

Chen Zhang (University of Chinese Academy of Sciences); Guorong Li (University of Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Weigang Zhang (Harbin Institute of Technology, Weihai); Qingming Huang (University of Chinese Academy of Sciences)

 

O3.2.4 Temporal Action Proposal Generation via Multi-Task Feature Learning

Handong Ma (University of Electronic Science and Technology of China); Lixin Duan (University of Electronic Science and Technology of China)

 

O3.2.5 Improving Compression Artifact Reduction via End-to-End Learning of Side Information

Haichuan Ma (USTC); Dong Liu (University of Science and Technology of China); Feng Wu (University of Science and Technology of China)

 

O3.2.6 An Empirical Study of Emotion Recognition from Thermal Video Based on Deep Neural Networks

Herman Prawiro (National Tsing Hua University); Tse-Yu Pan (National Tsing Hua University); Min-Chun Hu (National Tsing Hua University)

 

 

Special Session 4: Task-oriented Image/Video Processing and Coding

SS4.1 IBC-Mirror Mode for Screen Content Coding for the Next Generation Video Coding Standards

Jian Cao (Sun Yat-sen University); Zhen Qiu (Sun Yat-sen University); Zhengren Li (Sun Yat-sen University); Fan Liang (Sun Yat-sen University); Jun WANG (Sun Yat-Sen University)

 

SS4.2 Sensitivity-Aware Bit Allocation for Intermediate Deep Feature Compression

Hu Yuzhang (Peking University); Sifeng Xia (Peking University); Wenhan Yang (Peking University); Jiaying Liu (Peking University)

 

SS4.3 Chain Code-Based Occupancy Map Coding for Video-Based Point Cloud Compression

Runyu Yang (University of Science and Technology of China); Ning Yan (University of Science and Technology of China); Li Li (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Feng Wu (University of Science and Technology of China)

 

SS4.4 Equirectangular Projection Oriented Intra Prediction for 360-Degree Video Coding

Yingbin Wang (Tencent); Zhenzhong Chen (Wuhan University); Shan Liu (Tencent America)

 

SS4.5 A Mixed Appearance-based and Coding Distortion-based CNN Fusion Approach for In-loop Filtering in Video

Jian Yue (University of Electronic Science & Technology of China); Yanbo Gao (University of Electronic Science & Technology of China); Shuai Li (Shandong University)*; Menghu Jia (University of Electronic Science & Technology of China)

 

SS4.6 A Novel Visual Analysis Oriented Rate Control Scheme for HEVC

Qi Zhang (Peking University); Shanshe Wang (Peking University); Siwei Ma (Peking University, China)

 

 

Oral 3.3: Image Analysis and Compression II

O3.3.1 The enhancement of underexposed images with blurred reflectance

Jinchao Zhou (University of electronic science and technology of China); Renjie Wan (Nanyang Technological University); Haoliang Li (NTU, Singapore); Alex Kot (Nanyang Technological University)

 

O3.3.2 HDR Deghosting Using Motion-Registration-Free Fusion in the Luminance Gradient Domain

Cheng-Yeh Liou (National Taiwan University); Cheng-Yen Chuang (National Taiwan University); Chia-Han Huang (National Taiwan University); Yi-Chang Lu (National Taiwan University)

 

O3.3.3 Volumetric End-to-End Optimized Compression for Brain Images

Shuo Gao (University of Science and Technology of China); Yueyi Zhang (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)

 

O3.3.4 A night-time outdoor data set for low-light enhancement

Yudong Zhou (Peking University); Ronggang Wang (Peking University); Yang Zhao (Hefei University of Technology)

 

O3.3.5 A Marked Point Process Model For Visual Perceptual Groups Extraction

Amal Mbarki (Faculty of Science of Tunis,); Mohamed Naouai (Faculty of Science of Tunis,)

 

O3.3.6 Towards Quantized DCT Coefficients Restoration for Compressed Images

Tong Ouyang (Wuhan University); Zhenzhong Chen (Wuhan University); Shan Liu (Tencent America)

 

 

Oral 3.4: Object Detection and Tracking

O3.4.1 Real-time Detection and Tracking Network with Feature Sharing

Guo Ente (Fuzhou University); Zhifeng Chen (Fuzhou University); Zhenjia Fan (Fuzhou University); XiuJun Yang (FuZhou University)

 

O3.4.2 Learning Matching Behavior Differences for Compressing Vehicle Re-identification Models

Yi Xie (Huaqiao University); Jianqing Zhu (Huaqiao University); Huanqiang Zeng (Huaqiao University); Canhui Cai (Huqaio University); Lixin Zheng (Huaqiao University)

 

O3.4.3 Bidirectional Consistency Constrained Template Update Learning for Siamese Trackers

Kexin Chen (University of Electronic Science and Technology of China); Xue Zhou (University of Electronic Science and Technology of China); Chao Liang (University of Electronic Science and Technology of China); Jianxiao Zou (University of Electronic Science and Technology of China)

 

O3.4.4 Robust Visual Tracking Via An Imbalance-Elimination Mechanism

Jin Feng (Beijing University of Posts and Telecommunications); Kaili Zhao (Beijing University of Posts and Telecommunications); Xiaolin Song (Beijing University of Posts and Telecommunications); Anxin Li (DOCOMO Beijing Labs); Honggang Zhang (Beijing University of Posts and Telecommunications)

 

O3.4.5 CSCNet: A Shallow Single Column Network for Crowd Counting

Zhida Zhou (University of Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Guorong Li (University of Chinese Academy of Sciences); Yifang Yang (University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)

 

O3.4.6 Learning Convolution Feature Aggregation via Edge Attention Convolution Network for Person Re-Identification

Chaoqun Lin (Beijing University of Posts and Telecommunications); Ruopei Guo (Beijing University of Posts & Telecommunications); MingKun Li (Beijing University of Posts and Telecommunications); Xianbiao Qi (Shenzhen Research Institute of Big Data); Chun-Guang Li (Beijing University of Posts & Telecommunications)

 

 

Poster 3.1: Image Analysis and Compression III

P3.1.1 Learning Photometric stereo via Manifold-based Mapping

Yakun Ju (Ocean University of China); Muwei Jian (Shandong University of Finance and Economics); Junyu Dong (Ocean University of China); Kin-Man Lam (The Hong Kong Polytechnic University)

 

P3.1.2 UGNet: Underexposed Images Enhancement Network based on Global Illumination Estimation

Yuan Fang (Beijing university of technology); Wenzhe Zhu (Beijing University of Technology); Qing Zhu (Beijing University of Technology)

 

P3.1.3 DEN: Disentanglement and Enhancement Networks for Low Illumination Images

Nelson Chong (ViPr); Vu-Hoang Tran (Ho Chi Minh University of Technology and Education); Punchok Kerdsiri (Thammasat University); Yuen Peng Loh (Multimedia University); Ching-Chun Huang (National Chiao Tung University)

 

P3.1.4 Optimization of Sliding-DCT based Gaussian Filtering for Hardware Accelerator

Tomoki Otsuka (Nagoya Institute of Technology); Norishige Fukushima (Nagoya Institute of Technology); Yoshihiro Maeda (Tokyo University of Science); Kenjiro Sugimoto (Waseda University); Sei-ichiro Kamata (Waseda University)

 

P3.1.5 Optical Flow Estimation Between Images of Different Resolutions via Variational Method

Rui Zhao (Peking University); Ruiqin Xiong (Peking University); Shuyuan Zhu (University of Electronic Science and Technology of China); Bing Zeng (University of Electronic Science and Technology of China); Tiejun Huang (Peking University); Wen Gao (PKU)

 

P3.1.6 Low Resolution Facial Manipulation Detection

Xiao Han (School of Electronic and Computer Engineering, Peking University); Zhongyi Ji (School of Electronic and Computer Engineering, Peking University); Wenmin Wang (Macau University of Science and Technology)

 

P3.1.7 Extending CCSDS 123.0-B-1 for Lossless 4D Image Compression

Panpan Zhang (Northwestern Polytechnical University); Xiuheng Wang (Northwestern Polytechnical University); Tiande Gao (Northwestern Polytechnical University); Zhenfu Feng (Xi'an University of Posts and Communications); Jie Chen (Northwestern Polytechnical University)

 

P3.1.8 Learning Redundant Sparsifying Transform based on Equi-Angular Frame

Min Zhang (Beijing University of Technology); Yunhui Shi (Beijing University of Technology); Xiaoyan Sun (University of Science and Technology of China); Nam Ling (Department of Computer Engineering Santa Clara University); Na Qi (Beijing University of Technology)

 

P3.1.9 Noise-Aware Texture-Preserving Low-Light Enhancement

Zohreh Azizi (University of Southern California); Xuejing Lei (University of Southern California); C.-C. Jay Kuo (University of Southern California)

 

P3.1.10 Fast compressed sensing recovery using generative models and sparse deviations modeling

Lei Cai (South China University of Technology); Yuli Fu (South China University of Technology); Tao Zhu (South China University of Technology); Huanqiang Zeng (Huaqiao University); Youjun Xiang (South China University of Technology); Xianfeng Li (South China University of Technology)

 

 

Grand Challenge

GC.1 NIR image colorization with graph-convolutional neural networks

Diego Valsesia (Politecnico di Torino); Giulia Fracastoro (Polito); Enrico Magli (POLITO)

 

GC.2 Deep Near Infrared Colorization with Semantic Segmentation and Transfer Learning

Fengqiao Wang (Xidian University); Lu Liu (Xidian University); Cheolkon Jung (Xidian University)

 

GC.3 A Multi-model Fusion Framework for NIR-to-RGB Translation

Longbin Yan (Northwestern Polytechnical University); Xiuheng Wang (Northwestern Polytechnical University); Min Zhao (   Northwestern Polytechnical University); Shumin Liu (Northwestern Polytechnical University); Jie Chen (Northwestern Polytechnical University)

 

GC.4 NIR Image Colorization Using SPADE Generator and Grayscale Approximated Self-Reconstruction

Tian Sun (Xi'an University of Posts & Telecommunications); Cheolkon Jung (Xidian University)

 

GC.5 Learning From Paired and Unpaired Data: Alternately Trained CycleGAN for NIR Image Colorization

Zaifeng Yang (A-STAR Singapore); Zhenghua Chen (Institute for Infocomm Research, A*STAR)