Models for, loosely, the total population size arising from all generations of the offspring of some progenitor. Alternatively, a type of count model for a Markov stochastic pure-birth branching process.1
Let us suppose that each individual
The Galton-Watson version of this model considers this by generation; We write $(k)=_{i k} _i $ for the number of people infected in the
The distribution of subcritical processes is sometimes tedious to calculate, although we can get a nice form for the generating function of a geometric offspring distribution cascade process.
Set
This can get us a formula for the first two factorial moments, and hence the vanilla moments and thus mean and variance etc.
More generally the machinery of Lagrangian distributions is all we need to analyse these.
Maybe I should use (Dwass 1969) to get the moments? Dominic Yeo explains beautifully as always.
🏗 🏗 🏗
1 Lagrangian distributions
A tractable (sub-?)clade. Specifically, if I have a given initial population and a given offspring distribution for some population of… things… a Lagrangian distribution gives me a model for the size of the total population. There are other interpretations (queueing seems popular), but this one is extremely useful for me.
See (P. C. Consul and Shoukri 1988; P. C. Consul and Famoye 2006 Ch 6.2) for a deep dive. They introduce various families via the pgf, which is powerful and general, although we generally prefer it if we can recover the pmf.
Terminology: the total cascade size of a subcritical branching process has a “delta Lagrangian” or “general Lagrangian” distribution, depending on whether the cluster has, respectively, a deterministic or random starting population. We define the offspring distribution of such a branching process as
1.1 Borel-Tanner distribution
A delta Lagrangian distribution, the Borel distribution is the distribution of a cascade size starting from a population size of
- Spelled
-
- Pmf
-
- Mean
-
- Variance
-
Note to self: Wikipedia suggests an intriguing correspondence with random walks, which I should follow up (Pitman 1998; Dwass 1969).
The only R implementation I could find for this variate is in VGAM, although it is not so complicated.
1.2 Poisson-Poisson Lagrangian
See (P. C. Consul and Famoye 2006 Ch 9.3). Also known as the Generalised Poisson, although there are many things called that.
- Spelled
-
- Pmf
-
- Mean
-
- Variance
-
Returning to the cascade interpretation: Suppose we have
- an initial population is distributed
) - and everyone in the population has a number of offspring distributed
).
Then the total population is distributed as
Notice that the Poisson-Poisson can produce long tails, in the sense that it can have a large variance with finite mean, but not heavy tails, in the sense of the variance becoming infinite while retaining a finite mean; both variance and expectation go to infinity together.
Here, I implemented the GPD for you in python. There are versions for R, presumably. A quick search turned up RMKDiscrete and LaplacesDemon.
1.3 General Lagrangian distribution
A larger family of Lagrangian distributions (the largest?) family is summarised in (P. Consul and Shenton 1972), in an unintuitive (for me) way.
One “parameter”: a differentiable (infinitely differentiable?) function, not necessarily a pgf,
How does this fall out as an actual distribution?
🏗
- Spelled
- ?
- Pmf
- ?
- Mean
- ?
- Variance
- ?
2 References
Footnotes
I assumed a count model, but it turns out there are continuous-state generalisations. See, e.g. (Burridge 2013a, 2013b).↩︎