
尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!



周坤 陈文杰 陈伟海 林岩 孙先涛

周坤,陈文杰,陈伟海,等. 基于三次样条插值的扩展谱减语音增强算法[J]. 北京航空航天大学学报,2023,49(10):2826-2834 doi: 10.13700/j.bh.1001-5965.2021.0744
引用本文: 周坤,陈文杰,陈伟海,等. 基于三次样条插值的扩展谱减语音增强算法[J]. 北京航空航天大学学报,2023,49(10):2826-2834 doi: 10.13700/j.bh.1001-5965.2021.0744
ZHOU K,CHEN W J,CHEN W H,et al. Extended subtraction speech enhancement based on cubic spline interpolation[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2826-2834 (in Chinese) doi: 10.13700/j.bh.1001-5965.2021.0744
Citation: ZHOU K,CHEN W J,CHEN W H,et al. Extended subtraction speech enhancement based on cubic spline interpolation[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2826-2834 (in Chinese) doi: 10.13700/j.bh.1001-5965.2021.0744


doi: 10.13700/j.bh.1001-5965.2021.0744
基金项目: 国家自然科学基金(51975002)


  • 中图分类号: TN912.35

Extended subtraction speech enhancement based on cubic spline interpolation

Funds: National Natural Science Foundation of China (51975002)
More Information
  • 摘要:



  • 图 1  插值方法比较

    Figure 1.  Comparison of interpolation algorithms

    图 2  本文算法框图

    Figure 2.  The proposed algorithm block diagram

    图 3  不同阶数与最优阶数的比较

    Figure 3.  Comparison between different order and optimal order

    图 4  语音信号时域波形与语谱(SNR=−5 dB,噪声:white)

    Figure 4.  Time domain waveform and speech spectrum of speech signal (SNR=−5dB, noise: white)

    图 5  ESS算法的时域波形与语谱图

    Figure 5.  Time domain waveform and speech spectrum of ESS algorithm

    图 6  不同噪声环境下的STOI值

    Figure 6.  STOI values in different noise environments

    表  1  语音加权权值

    Table  1.   Speech weighted weight

    噪声$ {\sigma _Y} $$ {\mu _Y} $$ {\sigma _D} $$ {\mu _D} $
    下载: 导出CSV

    表  2  不同噪声环境下的PESQ值

    Table  2.   PESQ values in different noise environments

    −5 dB0 dB5 dB10 dB20 dB−5 dB0 dB5 dB10 dB20 dB
    CSI-SS1.49281.73022.24032.41253.00831.37641.72082.11422.324 3.0432
    算法PESQ值(volvo) PESQ值(white)
    −5 dB0 dB5 dB10 dB20 dB−5 dB0 dB5 dB10 dB20 dB
    GA2.37852.75593.11693.32983.66430.369 0.98241.467 2.06012.6876
    SS2.63852.926 3.20083.66074.03431.10991.35141.79212.284 2.8803
    注:−5,0,5,10,20 dB表示信噪比。
    下载: 导出CSV
  • [1] WANG L L, HU X, HU J, et al. Research on control system of an exoskeleton upper-limb rehabilitation robot[J]. Journal of Biomedical Engineering, 2016, 33(6): 1168-1175.
    [2] LAVANYA T, NAGARAJAN T, VIJAYALAKSHMI P. Multi-level single-channel speech enhancement using a unified framework for estimating magnitude and phase spectra[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020, 28: 1315-1327. doi: 10.1109/TASLP.2020.2986877
    [3] DROPPO J. Single channel enhancement for speech recognition [C]//Proceedings of the 2008 Hands-Free Speech Communication and Microphone Arrays. Piscataway: IEEE Press, 2008: 93-97.
    [4] WANG D X, YIN F L, ZHANG H C. Experiment evaluation of microphone array placement for speech enhancement[C]//Proceedings of the 6th International Symposium on Test and Measurement. Dalian: ISTM, 2005: 1583-1586.
    [5] BOLL S F, FPROCESSING S. Suppression of acoustic noise in speech using spectral subtraction[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979, 27(2): 113-120. doi: 10.1109/TASSP.1979.1163209
    [6] JAMIESON D G, BRENNAN R L, CORNELISSE L E. Evaluation of a speech enhancement strategy with normal-hearing and hearing-impaired listeners[J]. Ear and Hearing, 1995, 16(3): 274-286. doi: 10.1097/00003446-199506000-00004
    [7] FLANAGAN J L, JOHNSTON J D, ZAHN R, et al. Computer-steered microphone arrays for sound transduction in large rooms[J]. The Journal of the Acoustical Society of America, 1985, 78(5): 1508-1518. doi: 10.1121/1.392786
    [8] ZELINSKI R. A microphone array with adaptive post-filtering for noise reduction in reverberant rooms[C]//Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Piscataway: IEEE Press, 2002: 2578-2581.
    [9] GNANAMANICKAM J, NATARAJAN Y, SRI PREETHAA K R. A hybrid speech enhancement algorithm for voice assistance application[J]. Sensors, 2021, 21(21): 7025. doi: 10.3390/s21217025
    [10] 张晓艳, 张天骐, 葛宛营, 等. 联合深度神经网络和凸优化的单通道语音增强算法[J]. 声学学报, 2021, 46(3): 471-480.

    ZHANG X Y, ZHANG T Q, GE W Y, et al. Monaural speech enhancement combining deep neural network and convex optimation[J]. Acta Acustica, 2021, 46(3): 471-480(in Chinese).
    [11] BEROUTI M, SCHWARTZ R, MAKHOUL J. Enhancement of speech corrupted by acoustic noise[C]//Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Piscataway: IEEE Press, 2003: 208-211.
    [12] LU Y, LOIZOU P C. A geometric approach to spectral subtraction[J]. Speech Communication, 2008, 50(6): 453-466. doi: 10.1016/j.specom.2008.01.003
    [13] WEI Y, ZENG Y M, LI C. Single-channel speech enhancement based on subband spectral entropy[J]. Journal of the Audio Engineering Society, 2018, 66(3): 100-113. doi: 10.17743/jaes.2018.0003
    [14] SIM B L, TONG Y C, CHANG J S, et al. A parametric formulation of the generalized spectral subtraction method[J]. IEEE Transactions on Speech and Audio Processing, 1998, 6(4): 328-337. doi: 10.1109/89.701361
    [15] MURAKAMI T, ISHIDA Y A. Adaptive filtering for attenuating musical noise caused by spectral subtraction[C]//Proceedings of the 9th International Conference on Spoken Language Processing. Baixas: ISCA, 2006: 1443.
    [16] 宋智威, 熊成林, 黄路, 等. 基于牛顿插值的单相整流器功率前馈无差拍控制[J]. 电网技术, 2018, 42(11): 3623-3629.

    SONG Z W, XIONG C L, HUANG L, et al. Power feedback-forward and deadbeat control of single-phase rectifier based on Newton interpolation[J]. Power System Technology, 2018, 42(11): 3623-3629(in Chinese).
    [17] 牛少彰, 钮心忻, 杨义先, 等. 基于拉格朗日插值公式的数字水印分存算法[J]. 北京邮电大学学报, 2003, 26(3): 8-11.

    NIU S Z, NIU X X, YANG Y X, et al. Digital watermarking sharing algorithm based on Lagrange interpolation formula[J]. Journal of Beijing University of Posts and Telecommunications, 2003, 26(3): 8-11(in Chinese).
    [18] PHUNG V M, NGUYEN V M, PHAN T H. Hermite interpolation on algebraic curves in C2[J]. Indagationes Mathematicae, 2019, 30(5): 874-890. doi: 10.1016/j.indag.2019.07.001
    [19] HUSSAIN M Z, IRSHAD M, SARFRAZ M, et al. Interpolation of discrete time signals using cubic spline function[C]//Processings of the 19th International Conference on Information Visualisation. Piscataway: IEEE Press, 2015: 454-459.
    [20] HWANG S, BYUN J, PARK Y C. Performance comparison evaluation of speech enhancement using various loss functions[J]. The Journal of the Acoustical Society of Korea, 2021, 40(2): 176-182.
    [21] KOLBAEK M, TAN Z H, JENSEN J. On the relationship between short-time objective intelligibility and short-time spectral-amplitude mean-square error for speech enhancement[J]. IEEE-ACM Transactions on Audio, Speech, and Language Processing, 2019, 27(2): 283-295. doi: 10.1109/TASLP.2018.2877909
    [22] SALEEM N, KHATTAK M I, NAWAZ A, et al. Perceptually weighted β-order spectral amplitude Bayesian estimator for phase compensated speech enhancement[J]. Applied Acoustics, 2021, 178: 108007. doi: 10.1016/j.apacoust.2021.108007
  • 加载中
图(6) / 表(2)
  • 文章访问数:  222
  • HTML全文浏览量:  61
  • PDF下载量:  17
  • 被引次数: 0
  • 收稿日期:  2021-12-12
  • 录用日期:  2022-04-05
  • 网络出版日期:  2022-04-21
  • 整期出版日期:  2023-10-31


