留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于三次样条插值的扩展谱减语音增强算法

周坤 陈文杰 陈伟海 林岩 孙先涛

周坤,陈文杰,陈伟海,等. 基于三次样条插值的扩展谱减语音增强算法[J]. 北京航空航天大学学报,2023,49(10):2826-2834 doi: 10.13700/j.bh.1001-5965.2021.0744
引用本文: 周坤,陈文杰,陈伟海,等. 基于三次样条插值的扩展谱减语音增强算法[J]. 北京航空航天大学学报,2023,49(10):2826-2834 doi: 10.13700/j.bh.1001-5965.2021.0744
ZHOU K,CHEN W J,CHEN W H,et al. Extended subtraction speech enhancement based on cubic spline interpolation[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2826-2834 (in Chinese) doi: 10.13700/j.bh.1001-5965.2021.0744
Citation: ZHOU K,CHEN W J,CHEN W H,et al. Extended subtraction speech enhancement based on cubic spline interpolation[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2826-2834 (in Chinese) doi: 10.13700/j.bh.1001-5965.2021.0744

基于三次样条插值的扩展谱减语音增强算法

doi: 10.13700/j.bh.1001-5965.2021.0744
基金项目: 国家自然科学基金(51975002)
详细信息
    通讯作者:

    E-mail:wjchen@ahu.edu.cn

  • 中图分类号: TN912.35

Extended subtraction speech enhancement based on cubic spline interpolation

Funds: National Natural Science Foundation of China (51975002)
More Information
  • 摘要:

    语音识别系统易受噪声影响,采用谱减法等传统语音增强算法滤除噪声,但存在“音乐噪声”的困扰。针对该类问题,提出一种基于分数阶傅里叶与三次样条插值的语音增强算法。对语音进行分数阶傅里叶变换,并采用谱减法对含噪语音进行预处理,通过基于维纳滤波的噪声估计器来估计噪声,实现噪声的迭代更新;采用三次样条插值将含噪语音与估计噪声进行函数化;将函数化后的含噪语音与估计噪声进行几何谱减法处理,获得纯净语音。仿真结果显示:与传统的语音增强算法相比,所提算法在改善“音乐噪声”的问题上效果明显,同时也改善了语音可懂性和语音质量。

     

  • 图 1  插值方法比较

    Figure 1.  Comparison of interpolation algorithms

    图 2  本文算法框图

    Figure 2.  The proposed algorithm block diagram

    图 3  不同阶数与最优阶数的比较

    Figure 3.  Comparison between different order and optimal order

    图 4  语音信号时域波形与语谱(SNR=−5 dB,噪声:white)

    Figure 4.  Time domain waveform and speech spectrum of speech signal (SNR=−5dB, noise: white)

    图 5  ESS算法的时域波形与语谱图

    Figure 5.  Time domain waveform and speech spectrum of ESS algorithm

    图 6  不同噪声环境下的STOI值

    Figure 6.  STOI values in different noise environments

    表  1  语音加权权值

    Table  1.   Speech weighted weight

    噪声$ {\sigma _Y} $$ {\mu _Y} $$ {\sigma _D} $$ {\mu _D} $
    white0.990.180.750.22
    pink0.970.440.770.16
    f160.980.590.730.32
    volvo1.000.200.860.40
    下载: 导出CSV

    表  2  不同噪声环境下的PESQ值

    Table  2.   PESQ values in different noise environments

    算法PESQ值(f16)PESQ值(pink)
    −5 dB0 dB5 dB10 dB20 dB−5 dB0 dB5 dB10 dB20 dB
    GA1.01611.63961.88082.20172.89650.89781.49332.03192.43682.9058
    PSC1.38411.59202.02512.41863.14841.15541.46382.00662.48523.1574
    SS1.15671.52681.98742.44153.14961.13051.59662.03462.47933.1836
    CSI-SS1.49281.73022.24032.41253.00831.37641.72082.11422.324 3.0432
    算法PESQ值(volvo) PESQ值(white)
    −5 dB0 dB5 dB10 dB20 dB−5 dB0 dB5 dB10 dB20 dB
    GA2.37852.75593.11693.32983.66430.369 0.98241.467 2.06012.6876
    PSC2.70693.12173.36433.56173.99951.13921.32861.70162.14852.8672
    SS2.63852.926 3.20083.66074.03431.10991.35141.79212.284 2.8803
    CSI-SS2.86243.24123.41023.55423.80111.29851.59021.99992.05272.8752
    注:−5,0,5,10,20 dB表示信噪比。
    下载: 导出CSV
  • [1] WANG L L, HU X, HU J, et al. Research on control system of an exoskeleton upper-limb rehabilitation robot[J]. Journal of Biomedical Engineering, 2016, 33(6): 1168-1175.
    [2] LAVANYA T, NAGARAJAN T, VIJAYALAKSHMI P. Multi-level single-channel speech enhancement using a unified framework for estimating magnitude and phase spectra[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, 28: 1315-1327. doi: 10.1109/TASLP.2020.2986877
    [3] DROPPO J. Single channel enhancement for speech recognition [C]//Proceedings of the 2008 Hands-Free Speech Communication and Microphone Arrays. Piscataway: IEEE Press, 2008: 93-97.
    [4] WANG D X, YIN F L, ZHANG H C. Experiment evaluation of microphone array placement for speech enhancement[C]//Proceedings of the 6th International Symposium on Test and Measurement. Dalian: ISTM, 2005: 1583-1586.
    [5] BOLL S F, FPROCESSING S. Suppression of acoustic noise in speech using spectral subtraction[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979, 27(2): 113-120. doi: 10.1109/TASSP.1979.1163209
    [6] JAMIESON D G, BRENNAN R L, CORNELISSE L E. Evaluation of a speech enhancement strategy with normal-hearing and hearing-impaired listeners[J]. Ear and Hearing, 1995, 16(3): 274-286. doi: 10.1097/00003446-199506000-00004
    [7] FLANAGAN J L, JOHNSTON J D, ZAHN R, et al. Computer-steered microphone arrays for sound transduction in large rooms[J]. The Journal of the Acoustical Society of America, 1985, 78(5): 1508-1518. doi: 10.1121/1.392786
    [8] ZELINSKI R. A microphone array with adaptive post-filtering for noise reduction in reverberant rooms[C]//Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Piscataway: IEEE Press, 2002: 2578-2581.
    [9] GNANAMANICKAM J, NATARAJAN Y, SRI PREETHAA K R. A hybrid speech enhancement algorithm for voice assistance application[J]. Sensors, 2021, 21(21): 7025. doi: 10.3390/s21217025
    [10] 张晓艳, 张天骐, 葛宛营, 等. 联合深度神经网络和凸优化的单通道语音增强算法[J]. 声学学报, 2021, 46(3): 471-480.

    ZHANG X Y, ZHANG T Q, GE W Y, et al. Monaural speech enhancement combining deep neural network and convex optimation[J]. Acta Acustica, 2021, 46(3): 471-480(in Chinese).
    [11] BEROUTI M, SCHWARTZ R, MAKHOUL J. Enhancement of speech corrupted by acoustic noise[C]//Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Piscataway: IEEE Press, 2003: 208-211.
    [12] LU Y, LOIZOU P C. A geometric approach to spectral subtraction[J]. Speech Communication, 2008, 50(6): 453-466. doi: 10.1016/j.specom.2008.01.003
    [13] WEI Y, ZENG Y M, LI C. Single-channel speech enhancement based on subband spectral entropy[J]. Journal of the Audio Engineering Society, 2018, 66(3): 100-113. doi: 10.17743/jaes.2018.0003
    [14] SIM B L, TONG Y C, CHANG J S, et al. A parametric formulation of the generalized spectral subtraction method[J]. IEEE Transactions on Speech and Audio Processing, 1998, 6(4): 328-337. doi: 10.1109/89.701361
    [15] MURAKAMI T, ISHIDA Y A. Adaptive filtering for attenuating musical noise caused by spectral subtraction[C]//Proceedings of the 9th International Conference on Spoken Language Processing. Baixas: ISCA, 2006: 1443.
    [16] 宋智威, 熊成林, 黄路, 等. 基于牛顿插值的单相整流器功率前馈无差拍控制[J]. 电网技术, 2018, 42(11): 3623-3629.

    SONG Z W, XIONG C L, HUANG L, et al. Power feedback-forward and deadbeat control of single-phase rectifier based on Newton interpolation[J]. Power System Technology, 2018, 42(11): 3623-3629(in Chinese).
    [17] 牛少彰, 钮心忻, 杨义先, 等. 基于拉格朗日插值公式的数字水印分存算法[J]. 北京邮电大学学报, 2003, 26(3): 8-11.

    NIU S Z, NIU X X, YANG Y X, et al. Digital watermarking sharing algorithm based on Lagrange interpolation formula[J]. Journal of Beijing University of Posts and Telecommunications, 2003, 26(3): 8-11(in Chinese).
    [18] PHUNG V M, NGUYEN V M, PHAN T H. Hermite interpolation on algebraic curves in C2[J]. Indagationes Mathematicae, 2019, 30(5): 874-890. doi: 10.1016/j.indag.2019.07.001
    [19] HUSSAIN M Z, IRSHAD M, SARFRAZ M, et al. Interpolation of discrete time signals using cubic spline function[C]//Processings of the 19th International Conference on Information Visualisation. Piscataway: IEEE Press, 2015: 454-459.
    [20] HWANG S, BYUN J, PARK Y C. Performance comparison evaluation of speech enhancement using various loss functions[J]. The Journal of the Acoustical Society of Korea, 2021, 40(2): 176-182.
    [21] KOLBAEK M, TAN Z H, JENSEN J. On the relationship between short-time objective intelligibility and short-time spectral-amplitude mean-square error for speech enhancement[J]. IEEE-ACM Transactions on Audio, Speech, and Language Processing, 2019, 27(2): 283-295. doi: 10.1109/TASLP.2018.2877909
    [22] SALEEM N, KHATTAK M I, NAWAZ A, et al. Perceptually weighted β-order spectral amplitude Bayesian estimator for phase compensated speech enhancement[J]. Applied Acoustics, 2021, 178: 108007. doi: 10.1016/j.apacoust.2021.108007
  • 加载中
图(6) / 表(2)
计量
  • 文章访问数:  123
  • HTML全文浏览量:  43
  • PDF下载量:  9
  • 被引次数: 0
出版历程
  • 收稿日期:  2021-12-12
  • 录用日期:  2022-04-05
  • 网络出版日期:  2022-04-21
  • 整期出版日期:  2023-10-31

目录

    /

    返回文章
    返回
    常见问答