Recognition of the Script in Serbian Documents Using Frequency Occurrence and Co-Occurrence Analysis

机译：使用频率出现和共现分析来识别塞尔维亚文档中的脚本

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Any document in Serbian language can be written in two different scripts: Latin or Cyrillic. Although characteristics of these scripts are similar, some of their statistical measures are quite different. The paper proposed a method for the extraction of certain script from document according to the occurrence and co-occurrence of the script types. First, each letter is modeled with the certain script type according to characteristics concerning its position in baseline area. Then, the frequency analysis of the script types occurrence is performed. Due to diversity of Latin and Cyrillic script, the occurrence of modeled letters shows substantial statistics dissimilarity. Furthermore, the co-occurrence matrix is computed. The analysis of the co-occurrence matrix draws a strong margin as a criteria to distinguish and recognize the certain script. The proposed method is analyzed on the case of a database which includes different types of printed and web documents. The experiments gave encouraging results.

机译：塞尔维亚语的任何文档都可以用两种不同的脚本编写：拉丁文或西里尔文。尽管这些脚本的特征相似，但是它们的某些统计量却大不相同。提出了一种根据脚本类型的发生和共现从文档中提取特定脚本的方法。首先，根据字母在基线区域中的位置特征，使用特定的脚本类型对每个字母建模。然后，执行脚本类型发生的频率分析。由于拉丁文和西里尔文文字的多样性，模型字母的出现显示出统计学上的巨大差异。此外，计算共现矩阵。对共现矩阵的分析得出了很大的余量作为区分和识别特定脚本的标准。在包含不同类型的打印文档和Web文档的数据库的情况下分析了所提出的方法。实验给出了令人鼓舞的结果。

著录项

期刊名称 The Scientific World Journal
作者
Darko Brodić; Zoran N. Milivojević; Čedomir A. Maluckov;
展开▼
作者单位

展开▼
年(卷),期 2013(2013),-1
年度 2013
页码 896328
总页数 14
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Recognition of the Script in Serbian Documents Using Frequency Occurrence and Co-Occurrence Analysis [J] . DarkoBrodi?, Zoran N.Milivojevi?, ?edomir A.Maluckov ScientificWorldJournal . 2013,第3期

机译：使用频率发生和共同发生分析识别塞尔维亚文档中的脚本
2. Wavelet based co-occurrence histogram features for texture classification with an application to script identification in a document image [J] . P.S. Hiremath S. Shivashankar Pattern recognition letters . 2008,第9期

机译：基于小波的共现直方图特征，用于纹理分类以及在文档图像中脚本识别的应用
3. Analysis of pathogen co-occurrence in host-seeking adult hard ticks from Serbia. [J] . Tomanovic S., Chochlakis D., Radulovic Z., Experimental & applied acarology . 2013,第3期

机译：分析来自塞尔维亚的寄主寻求成年硬tick的病原体同时存在。
4. Enriching Document Representation with the Deviations of Word Co-occurrence Frequencies [C] . Yang Wei, Jinmao Wei, Zhenglu Yang International conference on algorithms and architectures for parallel processing . 2015

机译：利用单词共现频率的偏差丰富文档表示
5. PATTERNS OF OCCURRENCE AND CO-OCCURRENCE FOR SWIFT FOX (VULPES VELOX), WESTERN BURROWING OWL (ATHENE CUNICULARIA HYPUGAEA), AND MOUNTAIN PLOVER (CHARADRIUS MONTANUS) ON BLACK-TAILED PRAIRIE DOG (CYNOMYS LUDOVICIANUS) COLONIES: A TREND DATA SUMMARY AND A HIERARCHICAL OCCUPANCY ANALYSIS [D] . Parker, Ryan Andrew. 2019

机译：Swift Fox（Vulpees Velox），西部挖洞猫头鹰（八烯丝般的vervaea）和山地珩科鸟（Charadrius Montanus）上的山峰（Cynomys Ludovicianus）殖民地的殖民地概况和分层占用分析
6. Leveraging output term co-occurrence frequencies and latent associations in predicting medical subject headings [O] . Ramakanth Kavuluru, Yuan Lu -1

机译：利用输出项的共现频率和潜在关联来预测医学主题
7. Analysis of the Co-occurrence of Accupoints and Pathologies Documented in the Classical Acupuncture Literature [O] . Oh Junho 2015

机译：古典针灸文献中记载的准确点和病理并存的分析

Recognition of the Script in Serbian Documents Using Frequency Occurrence and Co-Occurrence Analysis

摘要

著录项

相似文献

相关主题

期刊订阅