Another view of sequential sampling in the birth process with immigration

da Silva, Poly H.; Jamshidpey, Arash (ORCID:0000000239391884); Tavaré, Simon

doi:10.1007/s00285-023-02041-0

Abstract

We explore properties of the family sizes arising in a linear birth process with immigration (BI). In particular, we study the correlation of the number of families observed during consecutive disjoint intervals of time. LettingS(a, b) be the number of families observed in (a, b), we study the expected sample variance and its asymptotics forpconsecutive sequential samples$$S_p =(S(t_0,t_1),\dots , S(t_{p-1},t_p))$$ $S_{p} = (S (t_{0}, t_{1}), \dots, S (t_{p - 1}, t_{p}))$ , for$$0=t_0 $0 = t_{0} < t_{1} < \dots < t_{p}$ . By conditioning on the sizes of the samples, we provide a connection between$$S_p$$ $S_{p}$ andpsequential samples of sizes$$n_1,n_2,\dots ,n_p$$ $n_{1}, n_{2}, \dots, n_{p}$ , drawn from a single run of a Chinese Restaurant Process. Properties of the latter were studied in da Silva et al. (Bernoulli 29:1166–1194, 2023.https://doi.org/10.3150/22-BEJ1494). We show how the continuous-time framework helps to make asymptotic calculations easier than its discrete-time counterpart. As an application, for a specific choice of$$t_1,t_2,\dots , t_p$$ $t_{1}, t_{2}, \dots, t_{p}$ , where the lengths of intervals are logarithmically equal, we revisit Fisher’s 1943 multi-sampling problem and give another explanation of what Fisher’s model could have meant in the world of sequential samples drawn from a BI process.

More Like this