作者其他论文
文献详情
Multi-scale TextTiling for automatic story segmentation in Chinese broadcast news
文献类型:会议
作者:Xie, Lei[1]  Zeng, Jia[2]  Feng, Wei[3]  
机构:NW Polytech Univ, Audio Speech & Language Proc Grp, Sch Comp Sci, Xian 710072, Peoples R China.
年:2008
通讯作者:Xie, L (reprint author), NW Polytech Univ, Audio Speech & Language Proc Grp, Sch Comp Sci, Xian 710072, Peoples R China.
会议名称:INFORMATION RETRIEVAL TECHNOLOGY
页码范围:345-355
收录情况:CPCI-S(WOS:000256869500033)  
所属部门:计算机学院
人气指数:761
浏览次数:742
被引频次:10
语言:外文
关键词:story segmentation; topic segmentation; spoken document segmentation; TextTiling; multi-scale fusion; spoken document retrieval; multimedia retrieval
摘要:
This paper applies Chinese subword representations, namely character and syllable n-grams, into the TextTiling-based automatic story segmentation of Chinese broadcast news. We show the robustness of Chinese subwords against speech recognition errors, out-of-vocabulary (OOV) words and versatility in word segmentation in lexical matching on errorful Chinese speech recognition transcripts. We propose a multi-scale TextTiling approach that integrates both the specificity of words and the robustness ...More
0
评论(0 条评论)
登录