Marchenko–Pastur distribution

From HandWiki
Short description: Distribution of singular values of large rectangular random matrices
Plot of the Marchenko-Pastur distribution for various values of lambda

In the mathematical theory of random matrices, the Marchenko–Pastur distribution, or Marchenko–Pastur law, describes the asymptotic behavior of singular values of large rectangular random matrices. The theorem is named after Ukrainian mathematicians Volodymyr Marchenko and Leonid Pastur who proved this result in 1967.

If X denotes a m×n random matrix whose entries are independent identically distributed random variables with mean 0 and variance σ2<, let

Yn=1nXXT

and let λ1,λ2,,λm be the eigenvalues of Yn (viewed as random variables). Finally, consider the random measure

μm(A)=1m#{λjA},A.

counting the number of eigenvalues in the subset A included in .

Theorem. Assume that m,n so that the ratio m/nλ(0,+). Then μmμ (in weak* topology in distribution), where

μ(A)={(11λ)𝟏0A+ν(A),if λ>1ν(A),if 0λ1,

and

dν(x)=12πσ2(λ+x)(xλ)λx𝟏x[λ,λ+]dx

with

λ±=σ2(1±λ)2.

The Marchenko–Pastur law also arises as the free Poisson law in free probability theory, having rate 1/λ and jump size σ2.

Cumulative distribution function

Using the same notation, cumulative distribution function reads

Fλ(x)={λ1λ𝟏x[0,λ)+(λ12λ+F(x))𝟏x[λ,λ+)+𝟏x[λ+,),if λ>1F(x)𝟏x[λ,λ+)+𝟏x[λ+,),if 0λ1,

where F(x)=12πλ(πλ+σ2(λ+x)(xλ)(1+λ)arctanr(x)212r(x)+(1λ)arctanλr(x)2λ+2σ2(1λ)r(x)) and r(x)=λ+xxλ.

Moments

For each k1, its k-th moment isr=0k11r+1(kr)(k1r)λr=1kr=0k1(kr)(kr+1)λr

Some transforms of this law

The Cauchy transform (which is the negative of the Stieltjes transformation) is given by

Gμ(z)=z+σ2(λ1)(zσ2(λ+1))24λσ42λzσ2.

Voiculescu's R-transform is given by

Rμ(z)=σ21σ2λz,

and the S-transform by

Sμ(z)=1σ2(1+λz).

Application to correlation matrices

For the special case of correlation matrices, we know that σ2=1 and λ=m/n. This bounds the probability mass over the interval defined by

λ±=(1±mn)2.

Since this distribution describes the spectrum of random matrices with mean 0, the eigenvalues of correlation matrices that fall inside of the aforementioned interval could be considered spurious or noise. For instance, obtaining a correlation matrix of 10 stock returns calculated over a 252 trading days period would render λ+=(1+10252)21.43. Thus, out of 10 eigenvalues of said correlation matrix, only the values higher than 1.43 would be considered significantly different from random.

See also

References