作者其他论文
文献详情
Broadcast news story segmentation using conditional random fields and multimodal features
文献类型:会议
作者:Wang, Xiaoxuan[1]  Xie, Lei[2]  Lu, Mimi[3]  Ma, Bin[4]  Chng, Eng Siong[5]  Li, Haizhou[6]  
机构:[1]School of Computer Science, Northwestern Polytechnical University, China
[2]School of Computer Science, Northwestern Polytechnical University, China
[3]Institute for Infocomm Research, Singapore
[4]Institute for Infocomm Research, Singapore
[5]School of Computer Engineering, Nanyang Technological University, Singapore
[6]Institute for Infocomm Research, Singapore
年:2012
通讯作者:Wang, X.(xwang@nwpu-aslp.org)
页码范围:1206-1215
收录情况:EI(20121915006535)  
所属部门:计算机学院
人气指数:1630
浏览次数:1610
语言:外文
摘要:
In this paper, we propose integration of multimodal features using conditional random fields (CRFs) for the segmentation of broadcast news stories. We study story boundary cues from lexical, audio and video modalities, where lexical features consist of lexical similarity, chain strength and overall cohesiveness; acoustic features involve pause duration, pitch, speaker change and audio event type; and visual features contain shot boundaries, anchor faces and news title captions. These features ar ...More
0
评论(0 条评论)
登录