基于深度学习的轻量级图像压缩算法

doi:10.12066/j.issn.1007-2861.2467

上海大学学报(自然科学版) ›› 2024, Vol. 30 ›› Issue (3): 466-475.doi: 10.12066/j.issn.1007-2861.2467

基于深度学习的轻量级图像压缩算法

范沈伟, 李国平, 王国中

上海工程技术大学电子电气工程学院, 上海 201620

出版日期:2024-06-30 发布日期:2024-07-09
通讯作者: 李国平 (1974—), 男, 高级工程师, 博士, 研究方向为音视频编码、智能媒体处理等. E-mail:liguoping@sues.edu.cn
基金资助:
国家重点研发计划资助项目 (2019YFB1802700)

Lightweight image compression algorithm based on deep learning

FAN Shenwei, LI Guoping, WANG Guozhong

School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China

Online:2024-06-30 Published:2024-07-09

摘要/Abstract

摘要： 基于深度学习的图像压缩算法变换部分存在结构复杂、计算量大的问题. 为了加快其编码和解码的速度, 提出了一种在尽可能保持原有压缩图像质量的情况下, 使用知识蒸馏减少原网络参数量和乘-加运算计算量 (multiply-accumulation operations, MACs) 的方法. 同时训练原网络和轻量化网络, 通过将原网络的特征信息传递给轻量化网络, 提升轻量化网络的性能. 在轻量化网络的结构设计中, 为了保留更多的特征信息, 且尽可能地减少网络的参数量和MACs, 在减少其通道数量的同时引入了分组卷积. 在测试数据集 Kodak 以及 DIV2K 上的实验结果证明, 相比于原网络, 经过知识蒸馏的轻量化网络其参数量和 MACs 约为原来 1/16,且仍然保持了较好的图像质量.

关键词: 图像压缩, 深度学习, 知识蒸馏

Abstract: The transformation modules of image compression algorithms based on deep learning involves complex architectures and large quantities of computation. To speed up the encoding and decoding process, a method was proposed to reduce the number of parameters and multiply-accumulation operations (MACs) of the original network with knowledge distillation while maintaining the image quality as much as possible. The original and the lightweight networks were trained simultaneously, and the lightweight network performance was improved by receiving feature information from the original network. When designing the lightweight network, group convolution was introduced to retain more feature information and reduce the number of parameters and MACs of the network as much as possible, while the number of channels was reduced. Experiments on the test datasets Kodak and DIV2K showed that, compared with the original network, the lightweight network after knowledge distillation still maintained good image quality while the amount of parameters and MACs was approximately one-sixteenth that of the original network.

Key words: image compression, deep learning, knowledge distillation

中图分类号:

TP 37

范沈伟, 李国平, 王国中. 基于深度学习的轻量级图像压缩算法[J]. 上海大学学报(自然科学版), 2024, 30(3): 466-475.

FAN Shenwei, LI Guoping, WANG Guozhong. Lightweight image compression algorithm based on deep learning[J]. Journal of Shanghai University（Natural Science Edition）, 2024, 30(3): 466-475.

[1]	张衡, 舒展. 基于点云深度学习的装配式构件尺寸参数识别方法[J]. 上海大学学报(自然科学版), 2023, 29(3): 502-.
[2]	李忠雨, 孙浩东, 李娇. 基于损失加权的实时篮球裁判手势识别系统[J]. 上海大学学报(自然科学版), 2023, 29(1): 68-81.
[3]	杨英杰, 王永芳, 张涵. 基于注意力和反馈机制的 HDR 视频重建[J]. 上海大学学报(自然科学版), 2023, 29(1): 56-67.
[4]	李成范, 赵俊娟. 面向遥感图像的小样本目标检测改进算法研究[J]. 上海大学学报(自然科学版), 2022, 28(2): 314-323.
[5]	王榕榕, 徐树公, 黄剑波. 基于深度学习的图像抠图技术[J]. 上海大学学报(自然科学版), 2022, 28(2): 261-269.
[6]	许庚林, 冉峰, 邓良, 史华康, 郭爱英. 轻量化神经网络和哈希跟踪算法在嵌入式人脸抓拍系统中的应用[J]. 上海大学学报(自然科学版), 2021, 27(6): 1018-1028.
[7]	邢毅雪, 朱永华, 高海燕, 周金, 张克. 基于注意力机制的远程监督实体关系抽取[J]. 上海大学学报(自然科学版), 2021, 27(5): 983-992.
[8]	邵慧翔, 曾丹. 基于改进 YOLOv3 算法的水下小目标分类与识别[J]. 上海大学学报(自然科学版), 2021, 27(3): 481-491.
[9]	陈钰, 丁友东, 于冰, 徐敏. 基于像素流的视频彩色化[J]. 上海大学学报(自然科学版), 2021, 27(1): 18-27.
[10]	胡嘉成, 王向阳, 刘晗. 基于深度学习的连铸坯表面缺陷检测[J]. 上海大学学报(自然科学版), 2019, 25(4): 445-452.
[11]	谢志峰, 叶冠桦, 闫淑萁, 何绍荣, 丁友东. 基于生成对抗网络的HDR图像风格迁移技术[J]. 上海大学学报(自然科学版), 2018, 24(4): 524-534.
[12]	郎玥1, 周霁婷1, 梁小龙2, 张文俊1. 基于人脸识别的影视剧镜头自动标注及重剪系统[J]. 上海大学学报(自然科学版), 2017, 23(3): 353-363.
[13]	沈文枫, 张建蕾, 周丁倩, 陈圣波, 邱峰. 大数据时代的车牌汉字识别[J]. 上海大学学报(自然科学版), 2016, 22(1): 88-96.
[14]	陈俊丽，卿定湖，李翔，万旺根. 基于多小波的彩色图像分层树集合分裂算法[J]. 上海大学学报(自然科学版), 2011, 17(1): 39-43.

基于深度学习的轻量级图像压缩算法

Lightweight image compression algorithm based on deep learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 14

编辑推荐

Metrics

本文评价