Pólya-Gamma augmentation trick

2017-02-20 — 2022-04-01

Suspiciously similar content

An infinite weird RV useful in Bayesian Binomial regression (and maybe other things?) (Polson, Scott, and Windle 2013). C&C the optimization-driven approach to a similar problem in Gumbel-max tricks.

See also my former colleague, Louis Tiao, A Primer on Pólya-gamma Random Variables - Part II: Bayesian Logistic Regression.

Gregory Gunderson, in Pólya-Gamma Augmentation, explains the problem we are trying to solve.

…in logistic regression, the dependent variables are assumed to be i.i.d. from a Bernoulli distribution with parameter $p$ , and therefore the likelihood function is $L (p) \propto \prod_{n = 1}^{N} p^{y_{n}} (1 - p)^{1 - y_{n}} = p^{\sum y_{n}} (1 - p)^{N - \sum y_{n}}$ The observations interact with the response through a linear relationship with the log-odds, $\log (\frac{p}{1 - p}) = β_{0} + x_{1} β_{1} + x_{2} β_{2} + \dots + x_{D} β_{D} = β^{⊤} x$ If we solve for $p$ in (2), we get $p = \frac{\exp (β^{⊤} x_{n})}{1 + \exp (β^{⊤} x_{n})}$ and a likelihood of $L (β) \propto \frac{{[\exp (β^{⊤} x)]}^{\sum y_{n}}}{{[1 + \exp (β^{⊤} x)]}^{N}}$ Due to this functional form, Bayesian inference for logistic regression is intractable.

Using Pólya-Gamma RVs we devise an auxiliary variable sample.

1 References

Polson, Scott, and Windle. 2013. “Bayesian Inference for Logistic Models Using Pólya–Gamma Latent Variables.” Journal of the American Statistical Association.