首页> 外文会议>European Conference on IR Research >StyleExplorer: A Toolkit for Textual Writing Style Visualization
【24h】

StyleExplorer: A Toolkit for Textual Writing Style Visualization

机译:STYLEEXPLORER:文本写作样式可视化的工具包

获取原文

摘要

The analysis of textual writing styles is a well-studied problem with ongoing and active research in fields like authorship attribution, author profiling, text segmentation or plagiarism detection. While many features have been proposed and shown to be effective to characterize authors or document types in terms of high-dimensional feature vectors, an intuitive, human-friendly view on the computed data is often lacking. For example, machine learning algorithms are able to attribute previously unseen documents to a set of known authors by utilizing those features, but a visualization of the most discriminating features is usually not provided. To this end, we present StyleExplorer, a freely available web tool that is able to extract textual features from documents and to visualize them in multiple variants. Besides analyzing single documents intrinsically, it is also possible to visually compare multiple documents in single views with respect to selected metrics, making it a valuable analysis tool for various tasks in natural language processing as well as for areas in the humanities that work and analyze textual data.
机译:文本写作风格的分析是一个良好的问题,具有持续和积极的研究,如作者归因,作者分析,文本分割或抄袭检测。虽然已经提出了许多特征,并且证明是有效地在高维特征向量方面表征作者或文档类型,但通常缺乏对计算数据的直观的人友好的视图。例如,通过利用这些特征,机器学习算法能够将先前未知的文档归因于一组已知作者,但通常不提供最辨别特征的可视化。为此,我们呈现StyleXplorer,这是一个可自由的Web工具,可以从文档中提取文本功能,并在多个变体中可视化它们。除了本质上分析单个文档外,还可以在视觉上与所选指标进行单一视图比较多个文件,使其成为自然语言处理中各种任务的有价值的分析工具,以及用于工作和分析文本的人文领域数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号