Vector space explorations of literary language

van Cranenburgh Andreas; Van Dalen-Oskam Karina; van Zundert Joris

首页> 外文期刊>Language Resources and Evaluation >Vector space explorations of literary language

【24h】

Vector space explorations of literary language

机译：文学语言的向量空间探索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Literary novels are said to distinguish themselves from other novels through conventions associated with literariness. We investigate the task of predicting the literariness of novels as perceived by readers, based on a large reader survey of contemporary Dutch novels. Previous research showed that ratings of literariness are predictable from texts to a substantial extent using machine learning, suggesting that it may be possible to explain the consensus among readers on which novels are literary as a consensus on the kind of writing style that characterizes literature. Although we have not yet collected human judgments to establish the influence of writing style directly (we use a survey with judgments based on the titles of novels), we can try to analyze the behavior of machine learning models on particular text fragments as a proxy for human judgments. In order to explore aspects of the texts associated with literariness, we divide the texts of the novels in chunks of 2-3 pages and create vector space representations using topic models (Latent Dirichlet Allocation) and neural document embeddings (Distributed Bag-of-Words Paragraph Vectors). We analyze the semantic complexity of the novels using distance measures, supporting the notion that literariness can be partly explained as a deviation from the norm. Furthermore, we build predictive models and identify specific keywords and stylistic markers related to literariness. While genre plays a role, we find that the greater part of factors affecting judgments of literariness are explicable in bag-of-words terms,even in short text fragments and among novels with higher literary ratings. The code and notebook used to produce the results in this paper are available at https://github.com/andreasvc/litvecspace..

机译：据说文学小说通过与文学性相关的惯例将自己与其他小说区分开。我们根据对当代荷兰小说的大型读者调查，调查了预测读者感知小说文学性的任务。先前的研究表明，使用机器学习可以从文本上很大程度上预测文学水平，这表明有可能将读者对于哪些小说是文学的共识解释为对代表文学特征的写作风格的共识。尽管我们尚未收集人的判断来直接确定写作风格的影响（我们使用基于小说标题的判断进行调查），但我们可以尝试分析特定文本片段上的机器学习模型的行为作为代理。人的判断。为了探索与文学相关的文本方面，我们将小说的文本分成2-3页，并使用主题模型（潜在狄利克雷分配）和神经文档嵌入（分布式词袋）创建矢量空间表示形式段落向量）。我们使用距离量度来分析小说的语义复杂性，支持以下观点：文学性可以部分解释为与规范的偏离。此外，我们建立了预测模型，并确定了与识字相关的特定关键字和风格标记。尽管体裁发挥了作用，但我们发现影响文学水平判断的大部分因素都可以用词袋解释，即使是短文本片段和文学评价较高的小说也是如此。 https://github.com/andreasvc/litvecspace上提供了用于产生本文结果的代码和笔记本。

著录项

来源
《Language Resources and Evaluation》 |2019年第4期|625-650|共26页
作者
van Cranenburgh Andreas; Van Dalen-Oskam Karina; van Zundert Joris;
展开▼
作者单位

Univ Groningen Informat Sci Groningen Netherlands;

Royal Netherlands Acad Arts & Sci Huygens ING Amsterdam Netherlands|Univ Amsterdam Amsterdam Netherlands;

Royal Netherlands Acad Arts & Sci Huygens ING Amsterdam Netherlands;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Literature; Literariness; Document embeddings; Topic models;

机译：文献;文学性文件嵌入;主题模型;

相似文献

外文文献
中文文献
专利

1. An Exploration of the Canon of Hausa Prose Fiction in Hausa Language and Translation: The Literary Contest of 1933 as a Historical Reference [J] . Chaibou Elhadji Oumarou Advances in Literary Study . 2017,第1期

机译：豪萨语和翻译中豪萨散文小说佳能的探索：以1933年文学大赛为历史参照
2. REFORMING URBAN SPATIAL MORPHOLOGY WITHIN SOCIAL SUSTAINABILITY: AN EXPLORATION OF PATTERN LANGUAGE OF MIDDLE EASTERN OPEN SPACES: SPACE OF BEIN EL-QASREEN, CASE STUDY IN CAIRO [J] . G. MOHAMMED, K. THWAITES International journal of sustainable development and planning . 2011,第4期

机译：在社会可持续性范围内改革城市空间形态：探索中东开放空间的模式语言：贝因·卡塞琳空间，在开罗进行案例研究
3. Contra-Thermodynamic, Photocatalytic E - Z Isomerization of Styrenyl Boron Species: Vectors to Facilitate Exploration of Two-Dimensional Chemical Space [J] . Molloy John J., Metternich Jan B., Daniliuc Constantin G., Angewandte Chemie . 2018,第12期

机译：反对热力学，光催化E - ＆苯乙烯基硼种类的Z异构化：促进探索二维化学空间的载体
4. SPACE EXPLORATION SYMPOSIUM (A3) Mars Exploration - Part 3 (3C):CONCEPTUALIZATION OF DESIGN MODIFICATIONS IN RE-ENTRY VEHICLES - VECTORING FOR REDIRECTION OF PLASMA [C] . Srikanth Raviprasad, Chrishma Singh-Derewa, Poonampreet Kaur Josan International Astronautical Congress . 2014

机译：太空勘探研讨会（A3）火星勘探 - 第3部分（3C）：重新入境车辆设计修改的概念化 - 血浆重定向矢量
5. An exploration of the word2vec algorithm: Creating a vector representation of a language vocabulary that encodes meaning and usage patterns in the vector space structure [D] . Le, Thu Anh. 2016

机译：word2vec算法的探索：创建语言词汇的矢量表示，该矢量表示编码矢量空间结构中的含义和用法模式
6. The Variable Vector Countermeasure Suit (V2Suit) for space habitation and exploration [O] . Kevin R. Duda, Rebecca A. Vasquez, Akil J. Middleton, 2015

机译：用于空间居住和探索的可变矢量对抗服（V2Suit）
7. Vector space explorations of literary language [O] . Andreas van Cranenburgh, Karina van Dalen-Oskam, Joris van Zundert 2019

机译：矢量空间探索文学语言

Vector space explorations of literary language

摘要

著录项

相似文献

相关主题

期刊订阅