Adaptive threshold method for real-time audio segmentation

Li Chao; Xiong Zhang; Xue Ling; Liu Yun

Volume 31 Issue 12

Dec. 2005

Turn off MathJax

Article Contents

Journal of Beijing University of Aeronautics and Astronautics > 2005 > 31(12): 1317-1321.

Li Chao, Xiong Zhang, Xue Ling, et al. Adaptive threshold method for real-time audio segmentation[J]. Journal of Beijing University of Aeronautics and Astronautics, 2005, 31(12): 1317-1321. (in Chinese)

Citation:

Li Chao, Xiong Zhang, Xue Ling, et al. Adaptive threshold method for real-time audio segmentation[J]. Journal of Beijing University of Aeronautics and Astronautics, 2005, 31(12): 1317-1321. (in Chinese)

Citation:

PDF( 344 KB)

Adaptive threshold method for real-time audio segmentation

School of Computer Science and Technology, Beijing University of Aeronautics and Astronautics, Beijing 100083, China

Received Date: 22 Sep 2004
Publish Date: 31 Dec 2005

Abstract

Abstract

Content-based audio analysis has become an interesting direction for many researchers. Deep analysis on audio signal segmentation was reviewed. Conventionally, automatic segmentation can be implemented by calculating some audio features like short-term energy, amplitude, fundamental frequency or others, in time-domain or frequency-domain, via referencing to several constant thresholds established in advance. But these methods were found lack of reliability in such applications, because of the complexity of real-time audio signals, together with the fluky changing of environment and various models of acquiring devices. An adaptive threshold adjusting method based on background learning was introduced. On condition of real-time environment, a so-called environment factor was computed iteratively through background learning, and then it was used as a measure to control the fluctuating of real thresholds. To make a balance between efficiency and precision, a state table was introduced to help judging on the types of audio clips. Validity of the methods was proved by a group of experiments.
- real-time,
- adaptivity,
- audio,
- segmentation,
- background

FullText(HTML)

References(1)

References

[1] Subramanya S, Abdou Y. Segmentation of audio data based on the binary images of the audio samples . Proc of Inter Conference on Intelligent Systems . Denver:IEEE, 1999 [2] Thomas K, Michael S, Martin W, et al. Strategies for automatic segmentation of audio data . Proc of ICASSP . Istanbul:IEEE, 2000 [3] Foote J. Automatic audio segmentation using a measure of audio novelty . Proc of ICME 2000 . NY:IEEE, 2000. 452~455 [4] 孙文彦,熊璋,李超,等. 语音信号实时传输中的动态变长分帧算法 . 通信学报,2001,22(7):80~86 Sun Wenyan, Xiong Zhang, Li Chao, et al. An dynamic variable length packetization algorithm in real-time speech transmission[J]. Journal of China Institute of Communications, 2001, 22(7):80~86(in Chinese) [5] 卢坚,毛兵,孙正兴,等.一种改进的基于说话者的语音分割算法[J].软件学报,2002,13(2):274~279 Lu Jian, Mao Bing, Sun Zhengxing, et al. An improved speaker based speech segmentation algorithm[J]. Journal of Software, 2002, 13(2):274~279(in Chinese) [6] Robert T, Alan J, Takeo K. A system for video surveillance and monitoring . CMU-RI-TR-00-12, 2000 [7] George T, Perry C. Multi-feature audio segmentation for browsing and annotation . Proc of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics . Mohonk:IEEE, 1999

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article Metrics

Article views(3574) PDF downloads(22)

Adaptive threshold method for real-time audio segmentation

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Adaptive threshold method for real-time audio segmentation

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content