语音识别中基于最小描述长度准则的决策树动态剪枝算法Decision tree dynamic pruning method based on minimum description length in speech recognition
徐向华;朱杰;郭强;
摘要:
在基于语音学决策树状态聚类时,包含不同数量捆绑状态的决策树对应不同的复杂度。通过研究模型的复杂度对系统性能和说话人自适应的影响,提出一种决策树剪枝方法——基于最小描述长度(Minimum Description Length:MDL)准则的决策树动态剪枝。该方法利用训练充分的决策树作为初始模型,根据自适应语料的数量动态地选择不同复杂度的模型,决策树剪枝时初始模型的合理选择,自适应语料的充分应用以及MDL准则对随机模型和确定性模型的集成,使得所提出的方法与说话人自适应相结合后取得了系统性能明显提高。
关键词:
基金项目:
通讯作者:
Email:
参考文献:
- 1 高升,徐波,黄泰翼.基于决策树的汉语三音子模型.声学 学报,2000;25(6):504-509
- 2 Reichl W, Chou W. Robust decision tree state tying for continuous speech recognition. IEEE Trans. Speech and Audio Processing, 2000; 8(5): 555-566
- 3 Mehta M, Rissanen J, Agrawal R. MDL-based decision tree pruning. KDD: Montreal, Canada. 1995: 216-221
- 4 Zhang Z P, Furui S. MDL-based cluster number decision methods for speaker clustering and MLLR adaptation. ITRW on Adaptation methods for speech recognition, 2001: 41-44
- 5 Shinoda K, Watanabe T. Speaker adaptation with autonomous model complexity control by MDL principle. ICASSP, 1996(2): 717-720
- 6 王作英,李健.汉语连续语音识别的语速自适应算法.声学学 报,2003;28(3):229-234
- 7 Shinoda K, Watanabe T. MDL-based context-dependent sub-word modeling for speech recognition. J. Acoust. Soc. Jpn. (E), 2000; 21(2): 79-86
- 8 Rissanen J. Universal coding, information, prediction, and estimation. IEEE Trans. Inform... Theory, 1984; 30(4): 629-636
- 9 Rabiner L, Juang B H. Fundamentals of speech recognition. Prentice Hall, Englewood Cliffs, 1993
- 10 Yong S J, Odell J J, Woodland P C. Tree-based state tying for high accuracy acoustic modeling. Proc. Hum. Lang. Technol, 1994: 307-312
- 11 Xu X H, Zhu J, Guo Q. FCM BP based parameter clustering method in speech recognition. ICMLC 2004: 3717-3720