Normal-inverse-Wishart distribution

normal-inverse-Wishart
Notation	(μ,Σ)∼NIW(μ0,λ,Ψ,ν)
Parameters	μ0∈ℝD location (vector of real); λ>0 (real); Ψ∈ℝD×D inverse scale matrix (pos. def.); ν>D−1 (real)
Support	μ∈ℝD;Σ∈ℝD×D covariance matrix (pos. def.)
PDF	f(μ,Σ\|μ0,λ,Ψ,ν)=𝒩(μ\|μ0,1λΣ) 𝒲−1(Σ\|Ψ,ν)

Short description: Multivariate parameter family of continuous probability distributions

In probability theory and statistics, the normal-inverse-Wishart distribution (or Gaussian-inverse-Wishart distribution) is a multivariate four-parameter family of continuous probability distributions. It is the conjugate prior of a multivariate normal distribution with unknown mean and covariance matrix (the inverse of the precision matrix).^[1]

Definition

Suppose

μ | μ_{0}, λ, Σ \sim 𝒩 (μ | μ_{0}, \frac{1}{λ} Σ)

has a multivariate normal distribution with mean $μ_{0}$ and covariance matrix $\frac{1}{λ} Σ$ , where

Σ | Ψ, ν \sim 𝒲^{- 1} (Σ | Ψ, ν)

has an inverse Wishart distribution. Then $(μ, Σ)$ has a normal-inverse-Wishart distribution, denoted as

(μ, Σ) \sim N I W (μ_{0}, λ, Ψ, ν) .

Characterization

Probability density function

f (μ, Σ | μ_{0}, λ, Ψ, ν) = 𝒩 (μ | μ_{0}, \frac{1}{λ} Σ) 𝒲^{- 1} (Σ | Ψ, ν)

The full version of the PDF is as follows:^[2]

$f (μ, Σ | μ_{0}, λ, Ψ, ν) = \frac{λ^{D / 2} | Ψ |^{ν / 2} | Σ |^{- \frac{ν + D + 2}{2}}}{(2 π)^{D / 2} 2^{\frac{ν D}{2}} Γ_{D} (\frac{ν}{2})} exp {- \frac{1}{2} T r ({Ψ Σ}^{- 1}) - \frac{λ}{2} (μ - μ_{0})^{T} Σ^{- 1} (μ - μ_{0})}$

Here $Γ_{D} [\cdot]$ is the multivariate gamma function and $T r (Ψ)$ is the Trace of the given matrix.

Properties

Scaling

Marginal distributions

By construction, the marginal distribution over $Σ$ is an inverse Wishart distribution, and the conditional distribution over $μ$ given $Σ$ is a multivariate normal distribution. The marginal distribution over $μ$ is a multivariate t-distribution.

Posterior distribution of the parameters

Suppose the sampling density is a multivariate normal distribution

y_{i} | μ, Σ \sim 𝒩_{p} (μ, Σ)

where $y$ is an $n \times p$ matrix and $y_{i}$ (of length $p$ ) is row $i$ of the matrix .

With the mean and covariance matrix of the sampling distribution is unknown, we can place a Normal-Inverse-Wishart prior on the mean and covariance parameters jointly

(μ, Σ) \sim N I W (μ_{0}, λ, Ψ, ν) .

The resulting posterior distribution for the mean and covariance matrix will also be a Normal-Inverse-Wishart

(μ, Σ | y) \sim N I W (μ_{n}, λ_{n}, Ψ_{n}, ν_{n}),

where

μ_{n} = \frac{λ μ_{0} + n \bar{y}}{λ + n}

λ_{n} = λ + n

ν_{n} = ν + n

Ψ_{n} = Ψ + S + \frac{λ n}{λ + n} (\bar{y} - μ_{0}) (\bar{y} - μ_{0})^{T} w i t h S = \sum_{i = 1}^{n} (y_{i} - \bar{y}) (y_{i} - \bar{y})^{T}

.

To sample from the joint posterior of $(μ, Σ)$ , one simply draws samples from $Σ | y \sim 𝒲^{- 1} (Ψ_{n}, ν_{n})$ , then draw $μ | Σ, y \sim 𝒩_{p} (μ_{n}, Σ / λ_{n})$ . To draw from the posterior predictive of a new observation, draw $\tilde{y} | μ, Σ, y \sim 𝒩_{p} (μ, Σ)$ , given the already drawn values of $μ$ and $Σ$ .^[3]

Generating normal-inverse-Wishart random variates

Generation of random variates is straightforward:

Sample $Σ$ from an inverse Wishart distribution with parameters $Ψ$ and $ν$
Sample $μ$ from a multivariate normal distribution with mean $μ_{0}$ and variance $\frac{1}{λ} Σ$

Related distributions

The normal-Wishart distribution is essentially the same distribution parameterized by precision rather than variance. If $(μ, Σ) \sim N I W (μ_{0}, λ, Ψ, ν)$ then $(μ, Σ^{- 1}) \sim N W (μ_{0}, λ, Ψ^{- 1}, ν)$ .
The normal-inverse-gamma distribution is the one-dimensional equivalent.
The multivariate normal distribution and inverse Wishart distribution are the component distributions out of which this distribution is made.

Notes

↑ Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [1]
↑ Simon J.D. Prince(June 2012). Computer Vision: Models, Learning, and Inference. Cambridge University Press. 3.8: "Normal inverse Wishart distribution".
↑ Gelman, Andrew, et al. Bayesian data analysis. Vol. 2, p.73. Boca Raton, FL, USA: Chapman & Hall/CRC, 2014.

References

Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media.
Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [2]

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Normal-inverse-Wishart distribution. Read more

[murphy-1] Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [1]

[2] Simon J.D. Prince(June 2012). Computer Vision: Models, Learning, and Inference. Cambridge University Press. 3.8: "Normal inverse Wishart distribution".

[3] Gelman, Andrew, et al. Bayesian data analysis. Vol. 2, p.73. Boca Raton, FL, USA: Chapman & Hall/CRC, 2014.

[1]

[2]

[3]

Anonymous

Search

Normal-inverse-Wishart distribution

Namespaces

More

Page actions

Contents

Definition

Characterization

Probability density function

Properties

Scaling

Marginal distributions

Posterior distribution of the parameters

Generating normal-inverse-Wishart random variates

Related distributions

Notes

References

Navigation

Navigation

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Notation	$(μ, Σ) \sim N I W (μ_{0}, λ, Ψ, ν)$
Parameters	$μ_{0} \in ℝ^{D}$ location (vector of real) $λ > 0$ (real) $Ψ \in ℝ^{D \times D}$ inverse scale matrix (pos. def.) $ν > D - 1$ (real)
Support	$μ \in ℝ^{D}; Σ \in ℝ^{D \times D}$ covariance matrix (pos. def.)
PDF	$f (μ, Σ \| μ_{0}, λ, Ψ, ν) = 𝒩 (μ \| μ_{0}, \frac{1}{λ} Σ) 𝒲^{- 1} (Σ \| Ψ, ν)$

Anonymous

Search

Normal-inverse-Wishart distribution

Definition

Characterization

Probability density function

Properties

Scaling

Marginal distributions

Posterior distribution of the parameters

Generating normal-inverse-Wishart random variates

Related distributions

Notes

References

Navigation

Wiki tools

Page tools

Other projects

Categories