The number of distinct adjacent pairs in geometrically distributed wordsArticleAuthors: Margaret Archibald

; Aubrey Blecher

; Charlotte Brennan ; Arnold Knopfmacher ; Stephan Wagner ; Mark Ward
0000-0001-5635-6733##0000-0003-2487-3220##NULL##NULL##NULL##NULL
Margaret Archibald;Aubrey Blecher;Charlotte Brennan;Arnold Knopfmacher;Stephan Wagner;Mark Ward
A sequence of geometric random variables of length $n$ is a sequence of $n$ independent and identically distributed geometric random variables ($\Gamma_1, \Gamma_2, \dots, \Gamma_n$) where $\mathbb{P}(\Gamma_j=i)=pq^{i-1}$ for $1~\leq~j~\leq~n$ with $p+q=1.$ We study the number of distinct adjacent two letter patterns in such sequences. Initially we directly count the number of distinct pairs in words of short length. Because of the rapid growth of the number of word patterns we change our approach to this problem by obtaining an expression for the expected number of distinct pairs in words of length $n$. We also obtain the asymptotics for the expected number as $n \to \infty$.
Comment: 18 pages, 1 table, 1 figure. This is the final version for publication
Volume: vol. 22 no. 4
Section: Analysis of Algorithms
Published on: January 28, 2021
Accepted on: November 10, 2020
Submitted on: August 12, 2019
Keywords: Mathematics - Combinatorics, 05A15, 05A05
Funding:
Source : OpenAIRE Graph- MCTP: Sophomore Transitions: Bridges into a Statistics Major and Big Data Research Experiences via Learning Communities; Funder: National Science Foundation; Code: 1246818
- Emerging Frontiers of Science of Information; Funder: National Science Foundation; Code: 0939370
- Category I: Anvil - A National Composable Advanced Computational Resource for the Future of Science and Engineering; Funder: National Science Foundation; Code: 2005632