MediaX
MediaX
Home
News
People
Publications
Demo
Sponsors
Publications
Type
Conference paper
Paper-Journal
Date
2025
2024
2023
2022
2021
2020
2019
2018
2017
2016
2014
Ziying Zhang
,
Xiang Gao
,
Zhixin Wang
,
Qiang Hu
,
Xiaoyun Zhang
(2025).
TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration
. In
IEEE International Conference on Multimedia and Expo (ICME), 2025
.
Cite
Source Document
Yuan Tian
,
Shuo Wang
,
Rongzhao Zhang
,
Zijian Chen
,
Yankai Jiang
,
Chunyi Li
,
Xiangyang Zhu
,
Fang Yan
,
Qiang Hu
,
Xiaosong Wang
,
Guangtao Zhai
(2025).
Semantic versus Identity: A Divide-and-Conquer Approach towards Adjustable Medical Image De-Identification
. In
IEEE/CVF International Conference on Computer Vision (ICCV), 2025
.
Source Document
Huiyu Duan
,
Qiang Hu
,
Wang Jiarui
,
Liu Yang
,
Zitong Xu
,
Lu Liu
,
Xiongkuo Min
,
Chunlei Cai
,
Tianxiao Ye
,
Xiaoyun Zhang
,
Guangtao Zhai
(2025).
FineVQ: Fine-Grained User Generated Content Video Quality Assessment (Highlight)
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025
.
Code
Source Document
Lu Liu
,
Huiyu Duan
,
Qiang Hu
,
Liu Yang
,
Chunlei Cai
,
Tianxiao Ye
,
Huayu Liu
,
Xiaoyun Zhang
,
Guangtao Zhai
(2025).
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration
. In
IEEE/CVF International Conference on Computer Vision (ICCV), 2025
.
Source Document
Haoyun Jiang
,
Haolin Li
,
Jianwei Zhang
,
Fei Huang
,
Qiang Hu
,
Minmin Sun
,
Shuai Xiao
,
Yong Li
,
Junyang Lin
,
Jiangchao Yao
(2025).
CateKV: On Sequential Consistency for Long-Context LLM Inference Acceleration
. In
Forty-Second International Conference on Machine Learning (ICML), 2025
.
Source Document
Qiang Hu
,
Zihan Zheng
,
Houqiang Zhong
,
Sihua Fu
,
Li Song
,
Xiaoyun Zhang
,
Guangtao Zhai
,
Yanfeng Wang.
(2025).
4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025
.
Cite
Code
Project
Source Document
Houqiang Zhong
,
Shaocheng Shen
,
Ke Cai
,
Zhenglong Wu
,
Jiangchao Yao
,
Yuan Cheng
,
Xuefei Li
,
Xiaoyun Zhang
,
Li Song
,
Qiang Hu
(2025).
Serial Low-rank Adaptation of Vision Transformer
. In
IEEE International Conference on Multimedia and Expo (ICME), 2025
.
Cite
Source Document
Qiang Hu
,
Houqiang Zhong
,
Zihan Zheng
,
Xiaoyun Zhang
,
Zhengxue Cheng
,
Li Song
,
Guangtao Zhai
,
YanFeng Wang
(2025).
VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
. In
The Association for the Advancement of Artificial Intelligence (AAAI), 2025
.
Cite
Source Document
Qiang Hu
,
Qihan He
,
Houqiang Zhong
,
GuoLu
,
Xiaoyun Zhang
,
Guangtao Zhai
,
YanFeng Wang
(2025).
VARFVV: View-Adaptive Real-Time Interactive Free-View Video Streaming with Edge Computing
. In
IEEE Journal on Selected Areas in Communications (JSAC), 2025
.
Code
Source Document
Guo Lu
,
Xingtong Ge
,
Tianxiong Zhong
,
Qiang Hu
,
Jing Geng
(2025).
Preprocessing Enhanced Image Compression for Machine Vision
. In
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024
.
Source Document
Zihan Zheng
,
Houqiang Zhong
,
Qiang Hu
,
Xiaoyun Zhang
,
Li Song
,
Ya Zhang
,
YanFeng Wang
(2024).
HPC: Hierarchical Progressive Coding Framework for Volumetric Video
. In
Proceedings of the ACM International Conference on Multimedia(MM), 2024
.
Cite
Source Document
Haoning Wu
,
Shaocheng Shen
,
Qiang Hu
,
Xiaoyun Zhang
,
Ya Zhang
,
YanFeng Wang
(2024).
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
. In
Winter Conference on Applications of Computer Vision (WACV), 2025
.
Cite
Code
Project
Source Document
Zihan Zheng
,
Houqiang Zhong
,
Qiang Hu
,
Xiaoyun Zhang
,
Li Song
,
Ya Zhang
,
YanFeng Wang
(2024).
JointRF: End-to-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression Video
. In
IEEE International Conference on Image Processing (ICIP), 2024
.
Cite
Source Document
Liao Wang
,
Kaixin Yao
,
Chengcheng Guo
,
Zhirui Zhang
,
Qiang Hu
,
Jingyi Yu
,
Lan Xu
,
Minye Wu
(2024).
VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
.
Source Document
Zhiyu Zhang
,
Guo Lu
,
Huanxiong Liang
,
Anni Tang
,
Qiang Hu
,
Li Song
(2024).
Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization
. In
IEEE International Conference on Multimedia and Expo (ICME), 2024
.
Source Document
Yuteng Ye
,
Hang Zhou
,
Junqing Yu
,
Qiang Hu
,
Wei Yang
(2024).
Dynamic Feature Pruning and Consolidation for Occluded Person Re-Identification
. In
AAAI
.
Source Document
Zhixin Wang
,
Xiaoyun Zhang
(2023).
DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
.
Cite
Code
Project
Source Document
Liao Wang
,
Qiang Hu
,
Qihan He
,
Ziyu Wang
,
Jingyi Yu
,
Tinne Tuytelaars
,
Lan Xu
,
Minye Wu
(2023).
Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
.
Code
Source Document
Ziyu Wang
,
Wei Yang
,
Junming Cao
,
Qiang Hu
,
Lan Xu
,
Junqing Yu
,
Jingyi Yu
(2023).
NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Rendering
. In
IEEE International Conference on Computational Photography (ICCP), 2023
.
Source Document
Yangyi Dong
,
Xiaoyun Zhang
,
Zhixin Wang
,
Ya Zhang
,
Siheng Chen
,
YanFeng Wang
(2022).
Unpaired Face Restoration via Learnable Cross-Quality Shift
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) NTIRE, 2022
.
Yixuan Huang
,
Xiaoyun Zhang
,
Yu Fu
,
Siheng Chen
,
Ya Zhang
,
YanFeng Wang
,
Dazhi He
(2022).
Task Decoupled Framework for Reference-Based Super-Resolution
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
.
Baisong Guo
,
Xiaoyun Zhang
,
Haoning Wu
,
Yu Wang
,
Ya Zhang
,
Yan-Feng Wang
(2022).
LAR-SR: A Local Autoregressive Model for Image Super-Resolution
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
.
Qinye Zhou
,
Ziyi Li
,
Weidi Xie
,
Xiaoyun Zhang
,
Ya Zhang
,
YanFeng Wang
(2022).
A simple plugin for transforming images to arbitrary scales
. In
British Machine Vision Virtual Conference 2022
.
Chen Ju
,
Peisen Zhao
,
Siheng Chen
,
Ya Zhang
,
Xiaoyun Zhang
,
Qi Tian
(2022).
Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
. In
IEEE Transactions on Multimedia
.
Guo Lu
,
Tianxiong Zhong
,
Jing Geng
,
Qiang Hu
,
Dong Xu
(2022).
Learning Based Multi-Modality Image and Video Compression
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
.
Source Document
Lianyu Du
,
Liwei Hu
,
Xiaoyun Zhang
,
Yumin Zhong
,
Ya Zhang
,
YanFeng Wang
(2021).
Unsupervised Segmentation Framework with Active Contour Models for Cine Cardiac MRI
. In
IEEE International Conference on Image Processing (ICIP), 2021
.
Tianyue Cao
,
Lianyu Du
,
Xiaoyun Zhang
,
Siheng Chen
,
Ya Zhang
,
YanFeng Wang
(2021).
CaT: Weakly Supervised Object Detection With Category Transfer
. In
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021
.
Shixiang Feng
,
Beibei Liu
,
Ya Zhang
,
Xiaoyun Zhang
,
Yuehua Li
(2021).
Two-Stream Compare and Contrast Network for Vertebral Compression Fracture Diagnosis
. In
IEEE Transactions on Medical Imaging
.
Wenbo Bao
,
Wei-Sheng Lai
,
Xiaoyun Zhang
,
Zhiyong Gao
,
Ming-Hsuan Yang
(2021).
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement
. In
IEEE Transactions on Pattern Analysis and Artificial Intelligence
.
Xingyue Pu
,
Tianyue Cao
,
Xiaoyun Zhang
,
Xiaowen Dong
,
Siheng Chen
(2021).
Learning to Learn Graph Topologies
. In
Proceedings of the IEEE/CVF Neural Information Processing Systems (NIPS), 2021
.
Guo Lu
,
Xiaoyun Zhang
,
Wanli Ouyang
,
Li Chen
,
Zhiyong Gao
,
Dong Xu,
(2021).
An End-to-End Learning Framework for Video Compression
. In
IEEE Transactions on Pattern Analysis and Machine Intelligence
.
Xuan Liao
,
Wenhao Li
,
Qisen Xu
,
Xiangfeng Wang
,
Bo Jin
,
Xiaoyun Zhang
,
YanFeng Wang
,
Ya Zhang
(2020).
Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning
. In
CVPR
.
Qiang Hu
,
Jun Zhou
,
Xiaoyun Zhang
,
Zhiyong Gao
,
Ming-Ting Sun
(2020).
In-loop perceptual model-based rate-distortion optimization for HEVC real-time encoder
. In
Journal of Real-Time Image Processing
.
Chunlei Cai
,
Li Chen
,
Xiaoyun Zhang
,
Zhiyong Gao
(2020).
End-to-End Optimized ROI Image Compression
. In
IEEE Transactions on Image Processing
.
Yingying Xue
,
Shixiang Feng
,
Ya Zhang
,
Xiaoyun Zhang
,
YanFeng Wang
(2020).
Dual-task Self-supervision for Cross-Modality Domain Adaptation
. In
MICCAI
.
Guo Lu
,
Xiaoyun Zhang
,
Wanli Ouyang
,
Dong Xu
,
Li Chen
,
Zhiyong Gao
(2020).
Deep Non-local Kalman Network for Video Compression Artifact Reduction
. In
IEEE Transaction on Image Processing
.
Guo Lu
,
Chunlei Cai
,
Xiaoyun Zhang
,
Li Chen
,
Wanli Ouyang
,
Dong Xu
,
Zhiyong Gao
(2020).
Content adaptive and error propagation aware deep video compression
. In
European Conference on Computer Vision (ECCV)
.
Minye Wu
,
Haibin Ling
,
Ning Bi
,
Shenghua Gao
,
Qiang Hu
,
Hao Sheng
,
Jingyi Yu
(2020).
Visual Tracking With Multiview Trajectory Prediction
. In
IEEE Transactions on Image Processing (TIP), 2020
.
Code
Source Document
Xin Suo
,
Minye Wu
,
Yanshun Zhang
,
Yingliang Zhang
,
Lan Xu
,
Qiang Hu
,
Jingyi Yu
(2020).
Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning
. In
Proceedings of the 28th ACM International Conference on Multimedia(MM), 2020
.
Source Document
Minye Wu
,
Yuehao Wang
,
Qiang Hu
,
Jingyi Yu
(2020).
Multi-view neural human rendering
. In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
.
Code
Source Document
Quan Meng
,
Jiakai Zhang
,
Qiang Hu
,
Xuming He
,
Jingyi Yu
(2020).
LGNN: A Context-aware Line Segment Detector
. In
Proceedings of the 28th ACM International Conference on Multimedia(MM), 2020
.
Source Document
Shiqi Peng
,
Bolin Lai
,
Guangyu Yao
,
Ya Zhang
,
Xiaoyun Zhang
,
YanFeng Wang
,
Hui Zhao
(2019).
Weakly Supervised Segmentation of Vertebral Bodies with Iterative Slice-Propagation
. In
Domain Adaptation and Representation Transfer and Medical Image Learning with Less Labels and Imperfect Data
.
Qiang Hu
,
Jun Zhou
,
Xiaoyun Zhang
,
Zhiru Shi
,
Zhiyong Gao
(2019).
Viewport-Adaptive 360-degree Video Coding
. In
Multimedia Tools and Applications
.
Yuan Tian
,
Xiongkuo Min
,
Guangtao Zhai
,
Zhiyong Gao
(2019).
Video-based early Autism Detection via Temporal Pyramid Networks,
. In
IEEE International Conference on Multimedia and Expo (ICME)
.
Bolin Lai
,
Shiqi Peng
,
Guangyu Yao
,
Ya Zhang
,
Xiaoyun Zhang
,
YanFeng Wang
,
Hui Zhao
(2019).
Spatial Regularized Classification Network for Spinal Dislocation Diagnosis
. In
International Workshop on Machine Learning in Medical Imaging
.
Shiqi Peng
,
Bolin Lai
,
Guangyu Yao
,
Ya Zhang
,
Xiaoyun Zhang
,
YanFeng Wang
,
Hui Zhao
(2019).
Learning-Based Bone Quality Classification Method for Spinal Metastasis
. In
International Workshop on Machine Learning in Medical Imaging
.
Shangpeng Yan
,
Wenbo Bao
,
Xiaoyun Zhang
,
Zhiyong Gao
,
Li Chen
(2019).
Large Scale Near-duplicate Image Retrieval via Patch Embedding
. In
4th International Workshop on Compact and Efficient Feature Representation and Learning in Computer Vision
.
Chunlei Cai
,
Li Chen
,
Xiaoyun Zhang
,
Zhiyong Gao
(2019).
Efficient Variable Rate Image Compression with Multi-scale Decomposition Network
. In
IEEE Transactions on Circuits and Systems for Video Technology
.
Guo Lu
,
Wanli Ouyang
,
Dong Xu
,
Xiaoyun Zhang
,
Chunlei Cai
,
Zhiyong Gao
(2019).
DVC: An End-to-End Deep Video Compression Framework
. In
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR)
.
Wenbo Bao
,
Chao Ma
,
Wei-Sheng Lai
,
Xiaoyun Zhang
,
Zhiyong Gao
,
Ming-Hsuan Yang
(2019).
Depth-Aware Video Frame Interpolation
. In
IEEE International Conference on Computer Vision and Pattern Recognition (CVPR)
.
Shengyang Li
,
Xiaoyun Zhang
,
Xiaoxia Wang
,
Yumin Zhong
,
Xiaofen Yao
,
Ya Zhang
,
YanFeng Wang
(2019).
Children’s Neuroblastoma Segmentation Using Morphological Features
. In
International Workshop on Machine Learning in Medical Imaging
.
Chunlei Cai
,
Li Chen
,
Xiaoyun Zhang
,
Zhiyong Gao
(2019).
A Novel Deep Progressive Image Compression Framework
. In * Picture Coding Symposium (PCS)*.
Chunmei Xie
,
Xiaoyun Zhang
,
Hua Yang
,
Li Chen
,
Zhiyong Gao
(2018).
Video Stitching Based on Optical Flow
. In
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
.
Chunlei Cai
,
Li Chen
,
Lei Zhou
,
Xiaoyun Zhang
,
Zhiyong Gao
(2018).
Rcdfnn: Robust Change Detection Based on Convolutional Fusion Neural Network
. In
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
.
Guo Lu
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2018).
Novel Integration of Frame Rate Up Conversion and HEVC Coding Based on Rate-Distortion Optimization
. In
IEEE Transaction on Image Processing
.
Wenbo Bao
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2018).
KalmanFlow: Efficient Kalman Filtering for Video Optical Flow
. In
IEEE International Conference on Image Processing (ICIP)
.
Wenbo Bao
,
Xiaoyun Zhang
,
Li Chen
,
Lianghui Ding
,
Zhiyong Gao
(2018).
KalmanFlow 2.0: Efficient Video Optical Flow Estimation via Context-Aware Kalman Filtering
. In
IEEE Transaction on Image Processing
.
Wenbo Bao
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2018).
High-Order Model and Dynamic Filtering for Frame Rate Up-Conversion
. In
IEEE Transaction on Image Processing
.
Guo Lu
,
Wanli Ouyang
,
Dong Xu
,
Xiaoyun Zhang
,
Zhiyong Gao
,
Ming-Ting Sun
(2018).
Deep Kalman Filtering Network for Video Compression Artifact Reduction
. In
European Conference on Computer Vision (ECCV)
.
Yuan Tian
,
Zhaohui Che
,
Guangtao Zhai
,
Zhiyong Gao
(2018).
BAN, A Barcode Accurate Detection Network
. In
IEEE International Conference on Visual Communications and Image Processing (VCIP)
.
Cong Geng
,
Li Chen
,
Xiaoyun Zhang
,
Peng Zhou
,
Zhiyong Gao
(2018).
A Wavelet-based Learning for Face Hallucination with Loop Architecture
. In
IEEE International Conference on Visual Communications and Image Processing (VCIP)
.
Xiaoyi He
,
Qiang Hu
,
Xiaoyun Zhang
,
Chongyang Zhang
,
Weiyao Lin
,
Xintong Han
(2018).
LGNN: A Context-aware Line Segment Detector
. In
IEEE International Conference on Image Processing (ICIP), 2018
.
Source Document
Bing Yang
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2017).
Spatiotemporal salient object detection based on distance transform and energy optimization
. In
Neurocomputing
.
Chunlei Cai
,
Li Chen
,
Xiaoyun Zhang
,
Zhiyong Gao
,
Lonsn Liao
,
Jack Yu
(2017).
Moving segmentation in HEVC compressed domain based on logistic regression
. In * IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)*.
Wenbo Bao
,
Xiaoyun Zhang
,
Shangpeng Yan
,
Zhiyong Gao
(2017).
Iterative convolutional neural network for noisy image super-resolution
. In
IEEE International Conference on Image Processing (ICIP)
.
Wenbo Bao
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2017).
High-quality and real-time frame interpolation on heterogeneous computing system
. In * IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)*.
Bing Yang
,
Xiaoyun Zhang
,
Li Chen
,
Hua Yang
,
Zhiyong Gao
(2017).
Edge Guided Salient Object Detection
. In
Neurocomputing
.
Lin Chen
,
Hua Yang
,
Ji Zhu
,
Qin Zhou
,
Shuang Wu
,
Zhiyong Gao
(2017).
Deep spatial-temporal fusion network for video-based person re-identification
. In
IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
.
Lin Chen
,
Hua Yang
,
Shuang Wu
,
Zhiyong Gao
(2017).
Data generation for improving person re-identification
. In
ACM on Multimedia Conference (MM)
.
Chen Gang
,
Yang Bing
,
Zhang Xiaoyun
,
Gao Zhiyong
(2017).
Complexity control algorithm based on adaptive mode selection for interframe coding in high efficiency video coding
. In
Journal of Electronic Imaging
.
Guo Lu
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2017).
A Novel Frame Rate Up Conversion Using Iterative Non-Local Means Interpolation
. In * IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)*.
Bing Yang
,
Xiaoyun Zhang
,
Li Chen
,
Zhiyong Gao
(2016).
Principal Component Analysis-based Visual Saliency Detection
. In
IEEE Transactions on Multimedia
.
Qiang Hu, Xiaoyun Zhang, Zhiru Shi, Zhiyong Gao
(2016).
Neyman-Pearson Based Early Mode Decision for HEVC Encoding
. In
IEEE Transactions on Multimedia
.
Yong Guo
,
Li Chen
,
Zhiyong Gao
,
Xiaoyun Zhang
(2016).
Frame Rate Up-Conversion Using Linear Quadratic Motion Estimation and Trilateral Filtering Motion Smoothing
. In
Journal of Display Technology
.
Yong Guo
,
Li Chen
,
Zhiyong Gao
,
Xiaoyun Zhang
(2014).
Frame Rate Up-Conversion Method for Video Processing Applications
. In
IEEE Transactions on Broadcasting
.
Cite
×