首页> 外文期刊>Methodology and computing in applied probability >An Iterative Approach to Determining the Length of the Longest Common Subsequence of Two Strings
【24h】

An Iterative Approach to Determining the Length of the Longest Common Subsequence of Two Strings

机译:确定两个字符串最长公共子序列长度的迭代方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper concerns the longest common subsequence (LCS) shared by two sequences (or strings) of length N, whose elements are chosen at random from a finite alphabet. The exact distribution and the expected value of the length of the LCS, k say, between two random sequences is still an open problem in applied probability. While the expected value E(N) of the length of the LCS of two random strings is known to lie within certain limits, the exact value of E(N) and the exact distribution are unknown. In this paper, we calculate the length of the LCS for all possible pairs of binary sequences from N = 1 to 14. The length of the LCS and the Hamming distance are represented in color on two all-against-all arrays. An iterative approach is then introduced in which we determine the pairs of sequences whose LCS lengths increased by one upon the addition of one letter to each sequence. The pairs whose score did increase are shown in black and white on an array, which has an interesting fractal-like structure. As the sequence length increases, R(N) (the proportion of sequences whose score increased) approaches the Chvatal-Sankoff constant a_c (the proportionality constant for the linear growth of the expected length of the LCS with sequence length). We show that R(N) is converging more rapidly to a_c than E(N)/N.
机译:本文涉及由长度为N的两个序列(或字符串)共享的最长公共子序列(LCS),其元素是从有限字母中随机选择的。在两个随机序列之间,LCS长度的精确分布和期望值(例如k)在应用概率上仍然是一个未解决的问题。虽然已知两个随机字符串的LCS长度的期望值E(N)在一定范围内,但E(N)的确切值和确切分布是未知的。在本文中,我们计算了从N = 1到14的所有可能的二进制序列对的LCS长度。LCS的长度和汉明距离在两个相对于所有数组上以彩色表示。然后引入一种迭代方法,其中我们确定在每个序列上增加一个字母后,其LCS长度增加一的序列对。分数确实增加的线对在阵列上以黑白显示,具有有趣的分形结构。随着序列长度的增加,R(N)(分数增加的序列比例)接近Chvatal-Sankoff常数a_c(LCS预期长度与序列长度线性增长的比例常数)。我们证明,R(N)比E(N)/ N收敛到a_c更快。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号