Marchenko–Pastur distribution

Short description: Distribution of singular values of large rectangular random matrices

In the mathematical theory of random matrices, the Marchenko–Pastur distribution, or Marchenko–Pastur law, describes the asymptotic behavior of singular values of large rectangular random matrices. The theorem is named after Ukrainian mathematicians Volodymyr Marchenko and Leonid Pastur who proved this result in 1967.

If $X$ denotes a $m \times n$ random matrix whose entries are independent identically distributed random variables with mean 0 and variance $σ^{2} < \infty$ , let

Y_{n} = \frac{1}{n} X X^{T}

and let $λ_{1}, λ_{2}, \dots, λ_{m}$ be the eigenvalues of $Y_{n}$ (viewed as random variables). Finally, consider the random measure

μ_{m} (A) = \frac{1}{m} # {λ_{j} \in A}, A \subset ℝ .

counting the number of eigenvalues in the subset $A$ included in $ℝ$ .

Theorem. Assume that $m, n \to \infty$ so that the ratio $m / n \to λ \in (0, + \infty)$ . Then $μ_{m} \to μ$ (in weak* topology in distribution), where

μ (A) = {\begin{cases} (1 - \frac{1}{λ}) 𝟏_{0 \in A} + ν (A), & if λ > 1 \\ ν (A), & if 0 \leq λ \leq 1, \end{cases}

and

d ν (x) = \frac{1}{2 π σ^{2}} \frac{\sqrt{(λ_{+} - x) (x - λ_{-})}}{λ x} 𝟏_{x \in [λ_{-}, λ_{+}]} d x

with

λ_{\pm} = σ^{2} (1 \pm \sqrt{λ})^{2} .

The Marchenko–Pastur law also arises as the free Poisson law in free probability theory, having rate $1 / λ$ and jump size $σ^{2}$ .

Cumulative distribution function

Using the same notation, cumulative distribution function reads

F_{λ} (x) = {\begin{cases} \frac{λ - 1}{λ} 𝟏_{x \in [0, λ_{-})} + (\frac{λ - 1}{2 λ} + F (x)) 𝟏_{x \in [λ_{-}, λ_{+})} + 𝟏_{x \in [λ_{+}, \infty)}, & if λ > 1 \\ F (x) 𝟏_{x \in [λ_{-}, λ_{+})} + 𝟏_{x \in [λ_{+}, \infty)}, & if 0 \leq λ \leq 1, \end{cases}

where $F (x) = \frac{1}{2 π λ} (π λ + σ^{- 2} \sqrt{(λ_{+} - x) (x - λ_{-})} - (1 + λ) \arctan \frac{r (x)^{2} - 1}{2 r (x)} + (1 - λ) \arctan \frac{λ_{-} r (x)^{2} - λ_{+}}{2 σ^{2} (1 - λ) r (x)})$ and $r (x) = \sqrt{\frac{λ_{+} - x}{x - λ_{-}}}$ .

Moments

For each $k \geq 1$ , its $k$ -th moment is $\sum_{r = 0}^{k - 1} \frac{1}{r + 1} (\binom{k}{r}) (\binom{k - 1}{r}) λ^{r} = \frac{1}{k} \sum_{r = 0}^{k - 1} (\binom{k}{r}) (\binom{k}{r + 1}) λ^{r}$

Some transforms of this law

The Cauchy transform (which is the negative of the Stieltjes transformation) is given by

G_{μ} (z) = \frac{z + σ^{2} (λ - 1) - \sqrt{(z - σ^{2} (λ + 1))^{2} - 4 λ σ^{4}}}{2 λ z σ^{2}} .

Voiculescu's $R$ -transform is given by

R_{μ} (z) = \frac{σ^{2}}{1 - σ^{2} λ z},

and the $S$ -transform by

S_{μ} (z) = \frac{1}{σ^{2} (1 + λ z)} .

Application to correlation matrices

For the special case of correlation matrices, we know that $σ^{2} = 1$ and $λ = m / n$ . This bounds the probability mass over the interval defined by

λ_{\pm} = {(1 \pm \sqrt{\frac{m}{n}})}^{2} .

Since this distribution describes the spectrum of random matrices with mean 0, the eigenvalues of correlation matrices that fall inside of the aforementioned interval could be considered spurious or noise. For instance, obtaining a correlation matrix of 10 stock returns calculated over a 252 trading days period would render $λ_{+} = {(1 + \sqrt{\frac{10}{252}})}^{2} \approx 1.43$ . Thus, out of 10 eigenvalues of said correlation matrix, only the values higher than 1.43 would be considered significantly different from random.

References

Götze, F.; Tikhomirov, A. (2004). "Rate of convergence in probability to the Marchenko–Pastur law". Bernoulli 10 (3): 503–548. doi:10.3150/bj/1089206408.
Marchenko, V. A.; Pastur, L. A. (1967). "Распределение собственных значений в некоторых ансамблях случайных матриц" (in ru). Mat. Sb.. N.S. 72 (114:4): 507–536. doi:10.1070/SM1967v001n04ABEH001994. Bibcode: 1967SbMat...1..457M. Link to free-access pdf of Russian version
Nica, A.; Speicher, R. (2006). Lectures on the Combinatorics of Free probability theory. Cambridge Univ. Press. pp. 204, 368. ISBN 0-521-85852-6. https://archive.org/details/lecturesoncombin00nica. Link to free download Another free access site
Zhang, W.; Abreu, G.; Inamori, M.; Sanada, Y. (2011). "Spectrum sensing algorithms via finite random matrices". IEEE Transactions on Communications 60 (1): 164–175. doi:10.1109/TCOMM.2011.112311.100721.
Epps, Brenden; Krivitzky, Eric M. (2019). "Singular value decomposition of noisy data: mode corruption". Experiments in Fluids 60 (8): 1–30. doi:10.1007/s00348-019-2761-y. Bibcode: 2019ExFl...60..121E.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Marchenko–Pastur distribution. Read more

Anonymous

Search

Marchenko–Pastur distribution

Namespaces

More

Page actions

Contents

Cumulative distribution function

Moments

Some transforms of this law

Application to correlation matrices

See also

References

Navigation

Navigation

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Marchenko–Pastur distribution

Cumulative distribution function

Moments

Some transforms of this law

Application to correlation matrices

See also

References

Navigation

Wiki tools

Page tools

Other projects

Categories