Volume 26 Issue 2
Feb.  2000
Turn off MathJax
Article Contents
HAN Jiang, YIN Bao-lin. Model for Speech Recognition Based on Multiple Time Scale Features[J]. Journal of Beijing University of Aeronautics and Astronautics, 2000, 26(2): 201-205. (in Chinese)
Citation: HAN Jiang, YIN Bao-lin. Model for Speech Recognition Based on Multiple Time Scale Features[J]. Journal of Beijing University of Aeronautics and Astronautics, 2000, 26(2): 201-205. (in Chinese)

Model for Speech Recognition Based on Multiple Time Scale Features

  • Received Date: 28 Oct 1998
  • Publish Date: 29 Feb 2000
  • The model explicitly models the correlation among successive frames of speech signals in segment scale by using segmental features representing contours of spectral parameters. By using the proposed segmental feature dependent non-stationary time series model, the new model not only achieves the modeling of correlation between different scale features but also implicitly models the correlation among neighboring frames in frame scale via parametric mean trajectory function. A modified Viterbi algorithm based on joint statistical distance of multiple time scale features is proposed, and a algorithm based on the maximum likelihood criteria for estimating the model parameters is also proposed in the training. Experimental results show that the new model achieves better performance than the standard HMM and the trended HMM.

     

  • loading
  • [1] Furui S. Speaker independent isolated word recognizer using dynamic features of speech spectrum[J]. IEEE Trans Acoust Speech Signal Process, 1981,34(1):52~59. [2]Deng L,Aksmanovic M,Sun D,et al.Speech recognition using hidden Markov models with polynomial regression functions as non-stationary states[J].IEEE Trans Speech Audio Processing,1993,2(4):507~520. [3]Juang B H,Rabiner L R.The segmental K-means algorithm for estimating parameters of hidden Markov models[J].IEEE Trans Acoust Speech Signal Process,1990,38(9):1639~1641. [4]Chen S H,Wang Y R.Vector quantization of pitch information in Mandarin speech[J].IEEE Trans Commun,1990,38:1317~1320.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views(2753) PDF downloads(861) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return