Including category information as supplements in latent semantic analysis of Hindi documents

Karthik Krishnamurthi; Vijayapal Reddy Panuganti; Vishnu Vardhan Bulusu

首页> 外文期刊>International Journal of Computational Science and Engineering >Including category information as supplements in latent semantic analysis of Hindi documents

【24h】

Including category information as supplements in latent semantic analysis of Hindi documents

机译：包括类别信息作为印地文文件潜在语义分析的补充

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Latent semantic analysis (LSA) is a mathematical model that is used to capture the semantic structure of documents by using the correlations between the textual elements in them. LSA captures the semantic structure very well being independent of external sources of semantics. However, the model's performance increases when it is supplemented with extra information. The work presented in this paper is to modify the model to analyse word correlations in documents by considering the document category information as supplements in the process. This enhancement is called supplemented latent semantic analysis (SLSA). SLSA's performance is empirically evaluated in a document classification application by comparing the accuracies of classification against plain LSA for various term weighting schemes. An increment of 1.14%, 1.30% and 1.63% is observed in the classification accuracies when SLSA is compared with plain LSA for tf, idf and tfidf respectively in the initial term-by-document matrix.

机译：None

著录项

来源
《International Journal of Computational Science and Engineering》 |2017年第2期|共8页
作者
Karthik Krishnamurthi; Vijayapal Reddy Panuganti; Vishnu Vardhan Bulusu;
展开▼
作者单位

Department of Computer Science Christ University;

Department of Computer Science and Engineering Gokaraju Rangaraju Institute of Engineering and Technology;

Department of Computer Science and Engineering JNTUHCEJ;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Dimensionality reduction; Document classification; Latent semantic analysis; LSA; Semantic structure; Singular value decomposition;

机译：减少维度分类;潜在语义分析;LSA;语义结构;奇异值分解;

相似文献

外文文献
中文文献
专利

1. Research on multi-document summarization based on latent semantic indexing [J] . QIN Bing, LIU Ting, ZHANG Yu, 哈尔滨工业大学学报（英文版） . 2005,第001期
2. A Two-Stage Feature Selection Method for Text Categorization by Using Category Correlation Degree and Latent Semantic Indexing [J] . WANG Fei, LI Cai-hong, WANG Jing-shan, 上海交通大学学报（英文版） . 2015,第001期
3. Metaphor Analysis Method Based on Latent Semantic Analysis [J] . TAO Ran, WEI Yaping, YANG Tangfeng 东华大学学报（英文版） . 2021,第001期
4. Metaphor Analysis Method Based on Latent Semantic Analysis [J] . 陶然, 卫亚萍, 杨唐峰东华大学学报：英文版 . 2021,第001期
5. Including category information as supplements in latent semantic analysis of Hindi documents [J] . Karthik Krishnamurthi, Vijayapal Reddy Panuganti, Vishnu Vardhan Bulusu International Journal of Computational Science and Engineering . 2017,第1a2期

机译：包括类别信息作为印地文文件潜在语义分析的补充
6. Capturing the semantic structure of documents using summaries in Supplemented Latent Semantic Analysis [J] . KARTHIK KRISHNAMURTHI, VIJAYAPAL REDDY PANUGANTI GRIET, VISHNU VARDHAN BULUSU WSEAS Transactions on Computers . 2015,第Null期

机译：使用补充潜在语义分析中的摘要捕获文档的语义结构
7. Comparison of Latent Semantic Analysis and Probabilistic Latent Semantic Analysis for Documents Clustering [J] . Kuta, Marcin, Kitowski, Computing and informatics . 2015,第3期

机译：文档聚类的潜在语义分析与概率潜在语义分析的比较
8. An Empirical Evaluation of Dimensionality Reduction Using Latent Semantic Analysis on Hindi Text [C] . Krishnamurthi Karthik, Sudi Ravi Kumar, Panuganti Vijayapal Reddy, International Conference on Asian Language Processing . 2013

机译：基于印度语文本潜在语义分析的降维效果实证评估
9. Generalized latent semantic analysis for document representation [D] . Matveeva, Irina 2008

机译：用于文档表示的广义潜在语义分析
10. MOWDOC: A Dataset of Documents From Taking the Measure of Work for Building a Latent Semantic Analysis Space [O] . Kim F. Nimon 2020

机译：mowdoc：从衡量建立潜在语义分析空间的工作的文件数据集
11. Development of a computer system for generating semantic template of a group of documents by using latent semantic analysis [O] . Yuriy Taranenko, Maryna Kabanova 2016

机译：开发用于通过使用潜在语义分析生成一组文档的语义模板的计算机系统
12. Comparison of Human and Latent Semantic Analysis (LSA) Judgements of Pairwise Document Similarities for a News Corpus [R] . Pincombe, B. 2004

机译：新闻语料库中两两文档相似度的人类和潜在语义分析（Lsa）判断的比较

Including category information as supplements in latent semantic analysis of Hindi documents

摘要

著录项

相似文献

相关主题

期刊订阅