应用科学学报 ›› 2021, Vol. 39 ›› Issue (3): 433-432.doi: 10.3969/j.issn.0255-8297.2021.03.009

• 信号与信息处理 • 上一篇    

基于FSCD-CNN的深度图像快速帧内预测模式选择算法

崔鹏涛1, 张倩1, 刘敬怀1, 周超1, 王斌1, 司文2   

  1. 1. 上海师范大学 信息与机电工程学院, 上海 200234;
    2. 上海商学院 商务信息学院, 上海 201400
  • 收稿日期:2019-12-18 发布日期:2021-06-08
  • 通信作者: 张倩,博士,副教授,研究方向为三维视频信息处理。E-mail:qianzhang@shnu.edu.cn E-mail:qianzhang@shnu.edu.cn

FSCD-CNN Based Fast Mode Decision Algorithm for Intra-prediction in Depth Map Coding

CUI Pengtao1, ZHANG Qian1, LIU Jinghuai1, ZHOU Chao1, WANG Bin1, SI Wen2   

  1. 1. School of Information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, China;
    2. Faculty of Business Information, Shanghai Business School, Shanghai 201400, China
  • Received:2019-12-18 Published:2021-06-08

摘要: 针对3D-HEVC的多视点视频加深度图的编码格式和四叉树编码结构所带来的编码复杂度问题,提出了一种深度图像快速帧内预测模式选择算法。首先,从深度视频序列中以最优的深度图最大编码单元(largest coding unit,LCU)划分深度为标签获取训练集;其次,构建了适用于LCU的Cu深度快速选择卷积神经网络(fast selecting Cu’s depth-convolutionalneural network,FSCD-CNN);最后,对深度图LCU进行划分深度预测,跳过部分编码模式决策,实现最佳LCU划分。实验结果表明,与相关文献对比,所提算法在保持了编码性能的同时平均减少了15%的编码时间,实验验证了其有效性和可靠性。

关键词: 3D-HEVC, 深度图, 最大编码单元, 卷积神经网络, 编码复杂度

Abstract: In view of the coding complexity caused by the encoding format of multiview video plus depth map and the quadtree coding structure in 3D-HEVC, a fast intra prediction mode selection algorithm for depth images based on FSCD-CNN (fast selecting cu’s depth-convolutional neural network) is proposed. First, a training set is obtained by dividing the depth of the optimal depth map LCU (largest coding unit) of a depth video sequence as labels. Second, a FSCD-CNN network is constructed, which is suitable for deep decision-making of LCU. At last, the optimal division of LCU is achieved by carrying out the depth-division prediction of depth map LCU and skipping some coding mode decisions. Experimental results show that the proposed algorithm could reduce the coding time by 15% on average while maintaining the same coding performance as other relevant literatures, and verify the effectiveness and reliability of this method.

Key words: 3D-HEVC, depth map, largest coding unit (LCU), convolutional neural network (CNN), coding complexity

中图分类号: