首页>
外国专利>
Ranking importance of symbols in underlying grouped and differentiated files based on content
Ranking importance of symbols in underlying grouped and differentiated files based on content
展开▼
机译:根据内容对基础分组和区分文件中符号的重要性进行排名
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods and apparatus identify groups of files based on symbols corresponding to an underlying data stream of original bits of data that are determined to be informationally important. The resulting symbols of a selected group are ordered according to how effectively each symbol characterizes the selected group of interest. The subset of symbols is used to find similar files from a general population of files to the files in the group of interest. Additionally, groups of common files can be identified from a general population of files and a group selected therefrom for use in identifying a subset of symbols which characterize the selected group for use as a filter to identify further like files.
展开▼