基于Takenaka-Malmquist系的语音信号压缩与降噪方法

雷娅, 方勇, 张立明

doi:10.12066/j.issn.1007-2861.1996

上海大学学报(自然科学版) >

2020 , Vol. 26 >Issue 1: 33 - 46

DOI: https://doi.org/10.12066/j.issn.1007-2861.1996

研究论文

基于Takenaka-Malmquist系的语音信号压缩与降噪方法

展开

^1. 上海先进通信与数据科学研究院, 上海 200444
^2. 上海大学特种光纤与光接入网重点实验室, 上海 200444
^3. 澳门大学科学技术学院, 澳门 999078

收稿日期: 2018-01-11

网络出版日期: 2020-03-22

基金资助

国家自然科学基金资助项目(61271213);国家自然科学基金资助项目(61673253);上海市科委重点支撑项目(16010500100);澳门大学科技局资助项目(MYRG 2014-00009-FST);澳门大学科技局资助项目(2016-00053-FST)

收起

Research on compression and denoising of speech signal based on the Takenaka-Malmquist system

Expand

^1. Shanghai Institute for Advanced Communication and Data Science, Shanghai 200444, China
^2. Key Laboratory of Specialty Fiber Optics and Optical Access Networks, Shanghai University, Shanghai 200444, China
^3. Faculty of Technology and Science, University of Macau, Macau 999078, China

Received date: 2018-01-11

Online published: 2020-03-22

Fold

摘要

语音信号的稀疏表示是语音压缩与降噪等语音处理的关键技术之一. 在匹配追踪(matching pursuit, MP)、正交匹配追踪(orthogonal matching pursuit, OMP)等算法的基础上, 提出了一种基于Takenaka-Malmquist系的贪婪权值算法(a greedy weight algorithm based on the Takenaka-Malmquist system, TMGW). 采用TMGW对语音信号进行重构时只需要较少的分解项数, 从而达到语音压缩的目的. 同时, 根据稀疏分解后信号与噪声在时频面上能量分布不同的特点, 该算法可实现对含噪语音的降噪. 实验结果表明, TMGW比基于自适应Gabor子字典的匹配追踪算法(matching pursuit algorithm based on the adaptive Gabor sub-dictionary, GMP)更适用于语音信号的稀疏表示.

关键词： 基于Takenaka-Malmquist系的贪婪权值算法; 稀疏表示; 语音压缩; 语音降噪

本文引用格式

雷娅, 方勇, 张立明 . 基于Takenaka-Malmquist系的语音信号压缩与降噪方法[J]. 上海大学学报(自然科学版), 2020 , 26(1) : 33 -46 . DOI: 10.12066/j.issn.1007-2861.1996

Abstract

The sparse representation of speech signal is one of the important research directions in speech compression, denoising and other speech processing. On the basis of matching pursuit (MP), orthogonal matching pursuit (OMP) and other greedy algorithms, this paper proposes a greedy weight algorithm based on Takenaka-Malmquist system (TMGW) for the compression of speech signal. This algorithm has the advantage of requiring only fewer decomposition numbers when reconstructing the speech signal, and it does well for achieving the goal of speech compression. Besides, in view of the fact that energy distribution between the signal and noise at time-frequency surface after sparse decomposition is different, this algorithm can realize the purpose of denoising. The experiment results show that the TMGW algorithm is more effective for the sparse representation of speech signal than the matching pursuit algorithm based on the adaptive Gabor sub-dictionary (GMP).

Key words： greedy weight algorithm based on the Takenaka-Malmquist system (TMGW); sparse representation; voice compression; voice denoising

参考文献

[1]	曹洁 . 基于DSP的语音信号滤波技术研究[D]. 兰州: 兰州理工大学, 2011: 1-3.
[1]	Cao J . Research on DSP based voice signal filtering technology[D]. Lanzhou: Lanzhou University of Technology, 2011: 1-3.
[2]	郭金库 . 信号稀疏表示理论及其应用 [M]. 北京: 科学出版社, 2013: 20-53.
[3]	Rudin W . Function theory in the unit ball of ${C}^N$[M]. Berlin: Springer, 1980: 1-50.
[4]	Martínez-Avenda?o R A, Rosenthal P . An introduction to operators on the Hardy-Hilbert space[M]. Berlin: Springer, 2006: 1-34.
[5]	吴勃英, 林迎珍 . 应用型再生核空间 [M]. 北京: 科学出版社, 2012: 50-100.
[6]	Qian T, Wang Y B . Adaptive Fourier series: a variation of greedy algorithm[J]. Advances in Computational Mathematics, 2011,34(3):279-293.
[7]	Qian T . Adaptive Fourier decompositions and rational approximations, part Ⅰ: theory[J]. International Journal of Wavelets Multiresolution and Information Processing, 2014,12(5):54-67.
[8]	钱涛 . 自适应Fourier变换: 一个贯穿复几何, 调和分析及信号分析的数学方法 [M]. 北京: 科学出版社, 2015: 1-200.
[9]	Zhang L M, Hong W, Mai W X , et al. Adaptive Fourier decomposition and rational approximation, part Ⅱ: software system design and development[J]. International Journal of Wavelets Multiresolution and Information Processing, 2014,12(5):413-423.
[10]	Mallat S G, Zhang Z . Matching pursuits with time-frequency dictionaries[J]. IEEE Transactions on Signal Processing, 1993,41(12):3397-3415.
[11]	Chang L H, Wu J Y . An improved RIP-based performance guarantee for sparse signal recovery via orthogonal matching pursuit[J]. IEEE Transactions on Information Theory, 2014,60(9):405-408.
[12]	Qian T, Zhang L M, Li Z X . Algorithm of adaptive Fourier decomposition[J]. IEEE Transactions on Signal Processing, 2011,59(12):5899-5906.
[13]	He C, Zhang L M, He X J , et al. A new image decomposition and reconstruction approach: adaptive Fourier decomposition [C]// International Conference on Multimedia Modeling. 2015: 227-236.
[14]	Wang Z, Wan F, Wong C M , et al. Adaptive Fourier decomposition based ECG denoising[J]. Computers in Biology and Medicine, 2016,77:195-205.
[15]	Zhang L M . Adaptive Fourier decomposition based time-frequency analysis[J]. Journal of Electronic Science and Technology, 2014,12(2):201-205.
[16]	陈丹艳 . 自适应啸叫抑制算法的研究与DSP实现[D]. 成都: 电子科技大学, 2014: 18-19.
[16]	Chen D Y . Research on adaptive squeal suppression algorithm and DSP implementation[D]. Chengdu: University of Electronic Science and Technology, 2014: 18-19.
[17]	Kobayashi Y, Kondo K . Speech intelligibility estimation using support vector regression and critical band segmental SNR in noisy condition[J]. IEEJ Transactions on Electronics, Information and Systems, 2013,133(8):1556-1564.
[18]	Gao Y, Ku M, Qian T , et al. FFT formulations of adaptive Fourier decomposition[J]. Journal of Computational and Applied Mathematics, 2017,324:204-215.
[19]	尹忠科, 邵君, Vandergheynst P . 利用FFT实现基于MP的信号稀疏分解[J]. 电子与信息学报, 2006,28(4):614-618.
[19]	Yin Z K, Shao J, Vandergheynst P . Signal sparse decomposition based on MP using FFT[J]. Journal of Electronics and Information, 2006,28(4):614-618.
[20]	尹忠科, 王建英, 邵君 . 基于原子库结构特性的信号稀疏分解[J]. 西南交通大学学报, 2005,40(2):173-178.
[20]	Yin Z K, Wang J Y, Shao J . Signal sparse decomposition based on the structural characteristics of atomic database[J]. Journal of Southwest Jiaotong University, 2005,40(2):173-178.
[21]	Cen Y G, Wang F F, Zhao R Z , et al. Tree-based backtracking orthogonal matching pursuit for sparse signal reconstruction[J]. Journal of Applied Mathematics, 2013,2013:1-8.
[22]	王军华, 方勇 . 基于Curvelet域自适应数学形态学降噪的含噪图像盲分离方法[J]. 上海大学学报(自然科学版), 2010,16(4):336-341.
[22]	Wang J H, Fang Y . A blind image separation method based on adaptive mathematical morphology noise reduction in curvelet domain[J]. Journal of Shanghai University (Natural Science Edition), 2010,16(4):336-341.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献