...
首页> 外文期刊>IEEE multimedia >Unsupervised Speaker Identification for TV News
【24h】

Unsupervised Speaker Identification for TV News

机译:电视新闻的无监督说话人识别

获取原文
获取原文并翻译 | 示例
           

摘要

Identifying the speakers in TV news would help listeners analyze and understand news content, but doing so in news videos is challenging because new faces often appear. Previous research has identified speakers on pretrained faces for TV shows and movies. Using an unsupervised method, this article proposes labeling speakers using just the available information in the news video without external information. The proposed framework segments the audio by speaker, parses closed captions for speaker names, identifies who is speaking, and performs optical character recognition for speaker names. The framework uses face recognition, face clustering, face landmarking, natural language processing tools, and speaker diarization. Results indicate 63.6 percent accuracy for identifying speakers for CNN News.
机译:识别电视新闻中的讲话者将有助于听众分析和理解新闻内容,但是在新闻视频中这样做是具有挑战性的,因为经常会出现新面孔。以前的研究已经确定了电视节目和电影的受过训练的脸上的说话者。本文建议采用一种无监督的方法,仅使用新闻视频中的可用信息为演讲者添加标签,而无需外部信息。拟议的框架按讲话者对音频进行分段,分析讲话者姓名的隐藏字幕,识别正在讲话的人,并对讲话者姓名进行光学字符识别。该框架使用人脸识别,人脸聚类,人脸地标,自然语言处理工具和说话人区分。结果表明,为CNN新闻识别发言人的准确性为63.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号