Li Chao, Xiong Zhang, Xue Ling, et al. Adaptive threshold method for real-time audio segmentation[J]. Journal of Beijing University of Aeronautics and Astronautics, 2005, 31(12): 1317-1321. (in Chinese)
Citation: Li Chao, Xiong Zhang, Xue Ling, et al. Adaptive threshold method for real-time audio segmentation[J]. Journal of Beijing University of Aeronautics and Astronautics, 2005, 31(12): 1317-1321. (in Chinese)

Adaptive threshold method for real-time audio segmentation

  • Received Date: 22 Sep 2004
  • Publish Date: 31 Dec 2005
  • Content-based audio analysis has become an interesting direction for many researchers. Deep analysis on audio signal segmentation was reviewed. Conventionally, automatic segmentation can be implemented by calculating some audio features like short-term energy, amplitude, fundamental frequency or others, in time-domain or frequency-domain, via referencing to several constant thresholds established in advance. But these methods were found lack of reliability in such applications, because of the complexity of real-time audio signals, together with the fluky changing of environment and various models of acquiring devices. An adaptive threshold adjusting method based on background learning was introduced. On condition of real-time environment, a so-called environment factor was computed iteratively through background learning, and then it was used as a measure to control the fluctuating of real thresholds. To make a balance between efficiency and precision, a state table was introduced to help judging on the types of audio clips. Validity of the methods was proved by a group of experiments.

     

  • [1] Subramanya S, Abdou Y. Segmentation of audio data based on the binary images of the audio samples . Proc of Inter Conference on Intelligent Systems . Denver:IEEE, 1999 [2] Thomas K, Michael S, Martin W, et al. Strategies for automatic segmentation of audio data . Proc of ICASSP . Istanbul:IEEE, 2000 [3] Foote J. Automatic audio segmentation using a measure of audio novelty . Proc of ICME 2000 . NY:IEEE, 2000. 452~455 [4] 孙文彦,熊 璋,李 超,等. 语音信号实时传输中的动态变长分帧算法 . 通信学报,2001,22(7):80~86 Sun Wenyan, Xiong Zhang, Li Chao, et al. An dynamic variable length packetization algorithm in real-time speech transmission[J]. Journal of China Institute of Communications, 2001, 22(7):80~86(in Chinese) [5] 卢 坚,毛 兵,孙正兴,等.一种改进的基于说话者的语音分割算法[J].软件学报,2002,13(2):274~279 Lu Jian, Mao Bing, Sun Zhengxing, et al. An improved speaker based speech segmentation algorithm[J]. Journal of Software, 2002, 13(2):274~279(in Chinese) [6] Robert T, Alan J, Takeo K. A system for video surveillance and monitoring . CMU-RI-TR-00-12, 2000 [7] George T, Perry C. Multi-feature audio segmentation for browsing and annotation . Proc of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . Mohonk:IEEE, 1999
  • Relative Articles

    [1]LIU C J,QIAO Z,YAN H W,et al. Semantic segmentation network of remote sensing images based on dual path supervision[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(3):732-741 (in Chinese). doi: 10.13700/j.bh.1001-5965.2023.0155.
    [2]WU N,MU C P,HE Y,et al. Multi-scale infrared and visible image fusion based on nest connection[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(2):683-691 (in Chinese). doi: 10.13700/j.bh.1001-5965.2023.0077.
    [3]HOU Z Q,DAI N,CHENG M J,et al. Two-branch real-time semantic segmentation algorithm based on spatial information guidance[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(1):19-29 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0980.
    [4]SHI Jiliang, ZHANG Qian, ZHOU Zunfu, YANG Sihong. Face Image Inpainting Combining Semantic Segmentation and Edge Texture[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0258
    [5]DENG Yupeng, GUO Fang, WANG Rong, SONG Zhenfeng. A referring image segmentation method based on bidirectional vision-language interaction module[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0462
    [6]ZHAO Boting, LIU Jun, CHAI Hongxu, ZHANG Jianye, WU Ruibin. A real-time visual SLAM algorithm for complex dynamic scenarios[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0658
    [7]MA S G,ZHANG Z X,PU L,et al. Real-time robust visual tracking based on spatial attention mechanism[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(2):419-432 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0329.
    [8]MA S G,CHEN Q M,HOU Z Q,et al. Lightweight semantic segmentation algorithm based on GLCNet[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(11):3358-3366 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0822.
    [9]CHEN Jia-jun, LI Xiang, SONG Yan-song, DONG Xiao-na. Real-time tracking of infrared dim-small target with multi-feature adaptive fusion under double confidence[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0802
    [10]YANG S C,CUI H G,ZHOU S D,et al. Real-time performance/security guarantee technology of vehicle control operating system[J]. Journal of Beijing University of Aeronautics and Astronautics,2024,50(7):2051-2065 (in Chinese). doi: 10.13700/j.bh.1001-5965.2022.0594.
    [11]KUANG Xianyan, LEI Hui, WU Cuiqin, WANG Xingxing, CHENG Fujun. Dual-dimension Attention and Precise Boundary for Real-time Traffic Scene Semantic Segmentation[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0506
    [12]JIANG Fei-hong, LIU Zhen-bao, XUE Yuan, KONG Man-zhao, ZHAO Tian. A Real-time Estimation Method for Stall Angle of Attack of Iced Aircraft[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0420
    [13]SONG L P,CHEN D F,TIAN T,et al. A real-time correlation algorithm for GEO targets based on radar ranging and velocity measurement[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(8):2167-2175 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0615.
    [14]ZHANG J,ZHANG Z R,HONG Z C,et al. Robust optimization of aviation logistics network in context of COVID-19 pandamic[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(9):2218-2226 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0664.
    [15]GUAN S Y,ZHANG C,MENG C,et al. Vascular ultrasound image segmentation algorithm based on phase symmetry[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(10):2645-2650 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0696.
    [16]GUO Zhong-jie, REN Yuan, WANG Ya-peng, QIU Zi-yi, LI Meng-li. On-chip real-time monitoring with adaptive compensation for total dose bandgap[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0697
    [17]ZHOU H,HOU Q Y,BIAN C J,et al. An infrared small target detection network under various complex backgrounds realized on FPGA[J]. Journal of Beijing University of Aeronautics and Astronautics,2023,49(2):295-310 (in Chinese). doi: 10.13700/j.bh.1001-5965.2021.0221.
    [18]MA Zhiwei, LI Haojie, FAN Xin, LUO Zhongxuan, LI Jianjun, WANG Zhihui. A real scene underwater semantic segmentation method and related dataset[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(8): 1515-1524. doi: 10.13700/j.bh.1001-5965.2021.0527
    [19]ZHENG Yuxiang, HAO Pengyi, WU Dong'en, BAI Cong. Medical image segmentation based on multi-layer features and spatial information distillation[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(8): 1409-1417. doi: 10.13700/j.bh.1001-5965.2021.0504
    [20]WENG Huiyan, CAI Guobiao, ZHENG Hongru, LIU Lihui, ZHANG Baiyi, HE Bijiao. Numerical simulation of effect of background pressure on electric propulsion plume field[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(10): 1854-1862. doi: 10.13700/j.bh.1001-5965.2021.0039
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views(3342) PDF downloads(19) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return