🤖 AI Summary
This paper investigates the asymptotic behavior of the expected length of the longest common subsequence (LCS) of $d$ independent, uniformly random strings of length $n$ over an alphabet of size $k$, focusing on the generalized Chvátal–Sankoff constant $gamma_{k,d} = lim_{n oinfty} mathbb{E}[mathrm{LCS}(X^{(1)},dots,X^{(d)})]/n$. Using probabilistic analysis, extremal combinatorics, concentration inequalities, and asymptotic expansions, we derive the first tight asymptotic formula for the binary case: $gamma_{2,d} = frac{1}{2} + Theta(d^{-1/2})$. Furthermore, when $d = Omega(log k)$, we establish nearly optimal upper and lower bounds on $gamma_{k,d}$. These results substantially improve upon prior work—which only established existence and coarse order-of-magnitude estimates—by providing precise, fine-grained asymptotics for the expected LCS length in the multi-string setting.
📝 Abstract
We study the generalized Chv'atal-Sankoff constant $gamma_{k,d}$, which represents the normalized expected length of the longest common subsequence (LCS) of $d$ independent uniformly random strings over an alphabet of size $k$. We derive asymptotically tight bounds for $gamma_{2,d}$, establishing that $gamma_{2,d} = frac{1}{2} + Thetaleft(frac{1}{sqrt{d}}
ight)$. We also derive asymptotically near-optimal bounds on $gamma_{k,d}$ for $dge Omega(log k)$.