首页>
外国专利>
System and Method for Using an Exemplar Document to Retrieve Relevant Documents from an Inverted Index of a Large Corpus
System and Method for Using an Exemplar Document to Retrieve Relevant Documents from an Inverted Index of a Large Corpus
展开▼
机译:使用示例文档从大语料库的倒排索引中检索相关文档的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and method for using an exemplar document or search query to retrieve relevant documents from an inverted index of a large corpus of documents. The system and method groups words by synonym and calculates term frequency (TF) and inverse document frequency (IDF) scores for the respective word groups. A composite term frequency-inverse document frequency (TF-IDF) score is calculated for each word group and the documents of the corpus are ranked based on the TF-IDF scores, utilizing a vector space model incorporating a cosine similarity function.
展开▼