不稳定传输中受损视频的低延迟修复方法

doi:10.52396/JUST-2020-0032

中国科学技术大学学报 ›› 2021, Vol. 51 ›› Issue (10): 717-724.DOI: 10.52396/JUST-2020-0032

• 信息科学 • 下一篇

不稳定传输中受损视频的低延迟修复方法

魏俣童¹, 鲍秉坤², 张子祺³, 朱进^1*

1.中国科学技术大学自动化系,安徽合肥 230027;
2.南京邮电大学通信与信息工程学院,江苏南京 210003;
3.声网公司,上海 200082

收稿日期:2020-12-24 修回日期:2021-02-05 出版日期:2021-10-31 发布日期:2022-01-11
通讯作者: *E-mail:jinzhu@ustc.edu.cn

A low-latency inpainting method for unstably transmitted videos

WEI Yutong¹, BAO Bingkun², ZHANG Ziqi³, ZHU Jin^1*

1. Department of Automation, University of Science and Technology of China, Hefei 230027, China;
2. College of Telecommunications Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;
3. Agora,Inc., Shanghai 200082, China

Received:2020-12-24 Revised:2021-02-05 Online:2021-10-31 Published:2022-01-11
Contact: *E-mail: jinzhu@ustc.edu.cn

摘要/Abstract

摘要： 视频流量已逐渐成为移动流量的重要组成部分,而不稳定传输中的视频缺损却仍然是一个亟待解决的问题.这种类型的视频缺损往往带有完全随机的特性,很难对其进行低延迟并且高精度的修复.我们率先关注了该不稳定传输中视频修复的任务,并提出了一种低延迟的视频修复方法,该方法包括两个阶段:在粗略修复阶段,先从参考帧中提取受损的二维光流图,再建立线性预测模型,根据运动在时间维度的连续性,来对受损帧进行初步的粗略修复.在精细修复阶段,提出了一个部分卷积神经网络(PCFC-Net),用于对所有参考信息进行综合并计算精细修复的结果.与基线相比,该方法在DAVIS数据集上的参考帧等待时间大大减少,同时PSNR和SSIM也提高了4.0％～12.7％.

关键词: 视频修复, 不稳定传输, 部分卷积神经网络, 线性预测

Abstract: Video traffic has gradually occupied the majority of mobile traffic, and video damage in unstable transmission remains a common and urgent problem. The difficulty of inpainting these damaged videos is that the holes randomly appear in random video frames, which are hard to be well settled with both low latency and high accuracy. We are the pioneer to look into the video inpainting task in unstable transmission and propose a low-latency video inpainting method which consists of two stages: In the coarsely inpainting stage, we achieve the extraction of damaged two-dimensional optical flow from reference frames, and establish a linear prediction model to coarsely inpaint the damaged frames according to the temporal consistency of motions. In the fine inpainting stage, a Partial Convolutional Frame Completion network(PCFC-Net) is proposed to synthesize all reference information and calculate a fine inpainting result. Compared with that of the state-of-the-art baselines, the waiting time for reference frames is greatly reduced while PSNR and SSIM are improved by 4.0%～12.7% on DAVIS dataset.

Key words: video inpainting, unstable transmission, partial CNN, linear prediction

中图分类号:

TP273

魏俣童, 鲍秉坤, 张子祺, 朱进. 不稳定传输中受损视频的低延迟修复方法[J]. 中国科学技术大学学报, 2021, 51(10): 717-724.

WEI Yutong, BAO Bingkun, ZHANG Ziqi, ZHU Jin. A low-latency inpainting method for unstably transmitted videos[J]. Journal of University of Science and Technology of China, 2021, 51(10): 717-724.

参考文献

[1] CISCO. Cisco visual networking index:Global mobile data traffic forecast update, 2017-2022. [2020-12-24], https://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/white-paper-c11-738429.pdf. 2019.
[2] Alatas O, Yan P, Shah M. Spatio-temporal regularity flow (SPREF): Its estimation and applications. IEEE Transactions on Circuits and Systems for Video Technology, 2007, 17(5): 584-589.
[3] Shih T K, Tang N C, Hwang J N. Exemplar-based video inpainting without ghost shadow artifacts by maintaining temporal continuity. IEEE Transactions on Circuits and Systems for Video Technology, 2009, 19(3): 347-360.
[4] Chung B, Yim C. Bi-sequential video error concealment method using adaptive homography-based registration. IEEE Transactions on Circuits and Systems for Video Technology, 2020, 30(6): 1535-1549.
[5] Wang C, Huang H, Han X, et al. Video inpainting by jointly learning temporal structure and spatial details. Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: IEEE, 2019, 33: 5232-5239.
[6] Xu R, Li X, Zhou B, et al. Deep flow-guided video inpainting. 2019, arXiv:1905.02884.
[7] Kim D, Woo S, Lee J Y, et al. Deep video inpainting. Proceedings of the Conference on Computer Vision and Pattern Recognition. Long Beach,USA: IEEE, 2019: 5792-5801.
[8] Ilg E, Mayer N, Saikia T, et al. FlowNet 2.0: Evolution of optical flow estimation with deep networks. Proceedings of the Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017: 2462-2470.
[9] Johnson J, Alahi A,Li F F. Perceptual losses for real-time style transfer and super-resolution. European Conference on Computer Vision. Amsterdam, Netherlands: IEEE, 2016: 694-711.
[10] Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions. Proceedings of the Conference On Computer Vision And Pattern Recognition. Boston, USA: IEEE, 2015: 1-9.
[11] Xu N, Yang L, Fan Y, et al. Youtube-VOS: Sequence-to-sequence video object segmentation. Proceedings of the European Conference on Computer Vision. Munich, Germnay: IEEE, 2018: 585-601.
[12] Pont-Tuset J, Perazzi F, Caelles S, et al. The 2017 DAVIS challenge on video object segmentation. 2017, arXiv:1704.00675.

不稳定传输中受损视频的低延迟修复方法

A low-latency inpainting method for unstably transmitted videos

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 2

编辑推荐

Metrics

本文评价

[1]	吴世崇，廖飞，吴文华，付在明. 基于集成建模方法的四旋翼通用控制器设计[J]. 中国科学技术大学学报, 2020, 50(8): 1084-1092.
[2]	唐文秀，奚文龙，李志鹏，吴俊英. 基于滑模变结构和高增益状态观测器的直流电机位置控制[J]. 中国科学技术大学学报, 2018, 48(1): 82-88.