Multivariate Gamma distributions

October 14, 2019 — March 14, 2022

algebra
geometry
high d
Lévy processes
measure
probability
signal processing
sparser than thou
spheres
stochastic processes

Various multivariate distributions that are marginally Gamma distributed but have correlations. No unifying insight here, this is a grab-bag right now.

How general can the joint distribution of a Gamma vector be? Much effort has been spent on this question (; ; ; ; ; ; ; ) and several more below. There are lots of slightly different options, is the short version.

Figure 1

So here is the simplest multivariate case:

1 Bivariate thinned Gamma RVs

There are many possible ways of generating two correlated Gamma variates, but as far as I am concerned the default is the Beta thinning method.

For this and throughout, we use the fact that gGamma(a,λ)E[g]=aλVar[g]=aλ2E[g2]=a2+aλ2.

Suppose we have g1Gamma(a1+a2,λ) and bBeta(a1,a2) and gGamma(a2,λ) with all rvs jointly independent. Now we generate g2=bg1+g. Then E[g2]=E[bg1]+E[g]=a1+a2λ and E[g1g2]=E[g1(bg1+g)]=E[g1bg1+g1g)]=E[g12b]+E[g1g)]=E[g12]E[b]+E[g1]E[g)]=(a1+a2)2+(a1+a2)λ2a1a1+a2+a1+a2λa2λ=a1(a1+a2)+a1λ2+a2(a1+a2)λ2=a1(a1+a2)+a1+a2(a1+a2)λ2=a12+2a1a2+a1+a22λ2 and thus Var[g1,g2]=E[g1g2]E[g1]E[g2]=a12+2a1a2+a1+a22(a1+a2)2λ2=a12+2a1a2+a1+a22a122a1a2a22λ2=a1λ2 Thence Corr[g1,g2]=Var[g1,g2]Var[g1]Var[g2]=a1λ2a1+a2λ2=a1a1+a2.

We can use this to generate more than two correlated gamma variates by successively thinning them, as long as we take care to remember that we can only add independent Gamma variates together to produce a new Gamma variate in general (unlike, say, Gaussian processes.)

2 Multivariate latent thinned Gamma RVs

A different construction.

Suppose we have K independent latent Gamma variates. ζkGamma(A,λ),k=1,2,,K. Let us also suppose we have a RD×K latent mixing matrix, R=[rd,k] whose entries are all non-negative and whose rows sum to A. Now, let us define new random variates gd=k=1Kbd,kζk,d=1,2,,D. where bd,kBeta(rd,k,Ard,k). By construction and the Beta thinning rule, we know that gdGamma(A,λ),d=1,2,,D. What can we say about the correlation between them?

For this, we need the fact that bBeta(α,β)E[b]=αα+βVar[b]=αβ(α+β)2(α+β+1)E[b2]=α2(α+β+1)+αβ(α+β)2(α+β+1)

Expanding out the gs, E[g1g2]=E[(k=1Kb1,kζk)(k=1Kb2,kζk)]=E[k=1Kj=1Kb1,kζkb2,jζj]=k=1Kj=1KE[b1,kζkb2,jζj]=k=1KE[b1,kb2,kζk2]+2k=1Kj=k+1KE[b1,kζkb2,jζj]=k=1KE[b1,k]E[b2,k]E[ζk2]+2k=1Kj=k+1KE[b1,k]E[ζk]E[b2,j]E[ζj]=A2+Aλ2A2k=1Kr1,kr2,k+2k=1Kj=k+1Kr1,kAAλr2,jAAλ=A+1λ2Ak=1Kr1,kr2,k+2k=1Kj=k+1Kr1,kr2,jλ2=1λ2(A+1Ak=1Kr1,kr2,k+2k=1Kj=k+1Kr1,kr2,j)=1λ2(1Ak=1Kr1,kr2,k+k=1Kj=1Kr1,kr2,j)=1Ar1r2+A2λ2

So Var[g1,g2]=E[g1g2]Eg1Eg2=r1r2Aλ2. and Corr[g1,g2]=Var[g1,g2]Var[g1]Var[g2]=r1r2Aλ2Aλ2=r1r2A2.

After all that work, we find pretty much what we would have expected for the correlation of rvs constructed by weighting of latent Gaussian factors. Was there a shortcut that could have gotten us there quicker?

3 Kingman constructions

Figure 2

Kingman (), Lijoi and Prünster (). TBD. See

3.1 Random correlated weights

Ranganath and Blei () creates random measures that seem to induce Gamma weights by imposing a Gaussian correlation structure on the random weights of a random selection of Poisson atoms. This seems to still be a Kingman construction?

4 Generalized Multivariate Gamma

Das and Dey () discusses a lineage of Generalized Multivariate Gammas that sound interesting. For them, multivariate appears to mean square-matrix-variate, but in the papers they cite, (; ) it seems to be vector-valued. What is going on here? Am I being boneheaded? Lukacs and Laha () is also referenced; what does that say?

5 Dussauchoy Multivariate Gamma

Figure 3

Defined in terms of Characteristic Function, to have analogous joint independence properties to the multivariate Gaussian ().

6 Gaver Multivariate Gamma

Some kind of negative-binomial mixture? ()

7 Maximally general multivariate Gamma

The paper to rule them all is Pérez-Abreu and Stelzer (), wherein we get a (maximally?) general d-variate Gamma distribution in terms of two functions α and λ on the unit sphere SRd, with respect to the norm . In fact α is a Borel measure, so we more properly say it is a function on the Borel subsets of the unit sphere, and λ must be Borel-measurable. Enough of that; I play loose with measure-theoretic niceties from here on.

Now, let’s match the notation of the original paper. We use the convention that the Fourier transform μ^ of a measure μ on M=Rd is given in terms of inner product (ω,x)xω. μ^(ω)=Mei(ω,x)μ(dx)ωM. The following theorem then characterises multivariate Gamma distributions in terms of these Fourier transforms.

Let μ be an infinitely divisible probability distribution on Rd. If there exists a finite measure α on the unit sphere SRd, … and a Borel-measurable function λ:SRd,R+ such that the Fourier transform of the [probability] measure is given μ^(ω)=exp(SRd,R+(eirvω1)eλ(v)rr drα(dv)) for all ωRd, then μ is called a d-dimensional Gamma distribution with parameters α and λ, abbreviated Γd(α,λ) distribution. If λ is constant, we call μ a -homogeneous Γd(α,λ)-distribution.

Intriguingly the choice of which norm doesn’t seem to matter much; changing the norm just perturbs λ in a simple fashion. I assume that L2 is easiest in practice; integral of polynomials on those are straightforward at least.

It is not clear to me what families of λs and αs are interesting from that description. If we could get that inner integral R+(eirvω1)eλ(v)rr dr to look polynomial we might do OK.

We do know which ones are admissible, at least:

Let α be a finite measure on SRd, and λ:SRd,R+ a measurable function. Then [the previous construction] defines a Lévy measure vμ and thus there exists a Γd(α,λ) probability distribution μ if and only if SRd,ln(1+1λ(v))α(dv)<. Moreover, Rd(x1)vμ(dx)< holds true. The condition is trivially satisfied, if λ is bounded away from zero α-almost everywhere.

Let us fix v=vv and consider that homogeneous case where λ is constant on the v=1 sphere? Then the term with λ simplifies to a weird complex logarithm on that same sphere μ^(ω)=exp(SRd,R+(eirvω1)erλr drα(dv))=exp(SRd,(lnλλivω)α(dv))

This is a weird creature. Note that everything cancels out if ω=0. How flexible is that? If we fix some d, we can get a d-point gamma correlation by choosing different measures on the sphere in Rd.

8 References

Barndorff-Nielsen, Maejima, and Sato. 2006. Some Classes of Multivariate Infinitely Divisible Distributions Admitting Stochastic Integral Representations.” Bernoulli.
Barndorff-Nielsen, Pedersen, and Sato. 2001. Multivariate Subordination, Self-Decomposability and Stability.” Advances in Applied Probability.
Bladt, and Nielsen. 2010. Multivariate Matrix-Exponential Distributions.” Stochastic Models.
Buchmann, Kaehler, Maller, et al. 2015. Multivariate Subordination Using Generalised Gamma Convolutions with Applications to V.G. Processes and Option Pricing.” arXiv:1502.03901 [Math, q-Fin].
Collins, Dasgupta, and Schapire. 2001. A Generalization of Principal Components Analysis to the Exponential Family.” In Advances in Neural Information Processing Systems.
Das, and Dey. 2010. On Bayesian Inference for Generalized Multivariate Gamma Distribution.” Statistics & Probability Letters.
Dussauchoy, and Berland. 1975. A Multivariate Gamma Type Distribution Whose Marginal Laws Are Gamma, and Which Has a Property Similar to a Characteristic Property of the Normal Case.” In A Modern Course on Statistical Distributions in Scientific Work. NATO Advanced Study Institutes Series.
Foti, Futoma, Rockmore, et al. 2013. A Unifying Representation for a Class of Dependent Random Measures.” In Artificial Intelligence and Statistics.
Gaver. 1970. Multivariate Gamma Distributions Generated by Mixture.” Sankhyā: The Indian Journal of Statistics, Series A (1961-2002).
Grunwald, Hyndman, and Tedesco. 1996. “A Unified View of Linear AR(1) Models.”
Jun Li, and Dacheng Tao. 2013. Simple Exponential Family PCA.” IEEE Transactions on Neural Networks and Learning Systems.
Kingman. 1967. Completely Random Measures.” Pacific Journal of Mathematics.
Kirch, Edwards, Meier, et al. 2019. Beyond Whittle: Nonparametric Correction of a Parametric Likelihood with a Focus on Bayesian Time Series Analysis.” Bayesian Analysis.
Krishnaiah, and Rao. 1961. Remarks on a Multivariate Gamma Distribution.” The American Mathematical Monthly.
Krishnamoorthy, and Parthasarathy. 1951. A Multivariate Gamma-Type Distribution.” The Annals of Mathematical Statistics.
Laverny, Masiello, Maume-Deschamps, et al. 2021. Estimation of Multivariate Generalized Gamma Convolutions Through Laguerre Expansions.” arXiv:2103.03200 [Math, Stat].
Lijoi, and Prünster. 2010. Models Beyond the Dirichlet Process.” In Bayesian Nonparametrics.
Liou, Su, Chiang, et al. 2011. Gamma Random Field Simulation by a Covariance Matrix Transformation Method.” Stochastic Environmental Research and Risk Assessment.
Liu, Dobriban, and Singer. 2017. E-PCA: High Dimensional Exponential Family PCA.”
Lukacs, and Laha. 1964. Applications of Characteristic Functions.
Mathai, and Moschopoulos. 1991. On a Multivariate Gamma.” Journal of Multivariate Analysis.
Mathai, and Provost. 2005. Some Complex Matrix-Variate Statistical Distributions on Rectangular Matrices.” Linear Algebra and Its Applications, Tenth Special Issue (Part 2) on Linear Algebra and Statistics,.
Mathal, and Moschopoulos. 1992. A Form of Multivariate Gamma Distribution.” Annals of the Institute of Statistical Mathematics.
Meier. 2018. A matrix Gamma process and applications to Bayesian analysis of multivariate time series.”
Meier, Kirch, Edwards, et al. 2019. beyondWhittle: Bayesian Spectral Inference for Stationary Time Series.”
Mohamed, Ghahramani, and Heller. 2008. Bayesian Exponential Family PCA.” In Advances in Neural Information Processing Systems.
Pérez-Abreu, and Stelzer. 2014. Infinitely Divisible Multivariate and Matrix Gamma Distributions.” Journal of Multivariate Analysis.
Ranganath, and Blei. 2018. Correlated Random Measures.” Journal of the American Statistical Association.
Sato. 1999. Lévy Processes and Infinitely Divisible Distributions.
Semeraro. 2008. A Multivariate Variance Gamma Model for Financial Applications.” International Journal of Theoretical and Applied Finance.
Sim. 1993. Generation of Poisson and Gamma Random Vectors with Given Marginals and Covariance Matrix.” Journal of Statistical Computation and Simulation.
Singpurwalla, and Youngren. 1993. Multivariate Distributions Induced by Dynamic Environments.” Scandinavian Journal of Statistics.
Walker. 2021. On Infinitely Divisible Multivariate Gamma Distributions.” Communications in Statistics - Theory and Methods.
Warren, David. 1986. Outflow Skewness in Non-Seasonal Linear Reservoirs with Gamma-Distributed Markovian Inflows.” Journal of Hydrology.
Warren, D. 1992. A Multivariate Gamma Distribution Arising from a Markov Model.” Stochastic Hydrology and Hydraulics.
Wolpert, Robert L. 2021. Lecture Notes on Stationary Gamma Processes.” arXiv:2106.00087 [Math].
Wolpert, R., and Ickstadt. 1998. Poisson/Gamma Random Field Models for Spatial Statistics.” Biometrika.