Companion matrix

Short description: Square matrix constructed from a monic polynomial

In linear algebra, the Frobenius companion matrix of the monic polynomial $p (x) = c_{0} + c_{1} x + \dots + c_{n - 1} x^{n - 1} + x^{n}$ is the square matrix defined as

$C (p) = [\begin{matrix} 0 & 0 & \dots & 0 & - c_{0} \\ 1 & 0 & \dots & 0 & - c_{1} \\ 0 & 1 & \dots & 0 & - c_{2} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & - c_{n - 1} \end{matrix}] .$

Some authors use the transpose of this matrix, $C (p)^{T}$ , which is more convenient for some purposes such as linear recurrence relations (see below).

$C (p)$ is defined from the coefficients of $p (x)$ , while the characteristic polynomial as well as the minimal polynomial of $C (p)$ are equal to $p (x)$ .^[1] In this sense, the matrix $C (p)$ and the polynomial $p (x)$ are "companions".

Similarity to companion matrix

Any matrix $A$ with entries in a field $F$ has characteristic polynomial $p (x) = \det (x I - A)$ , which in turn has companion matrix $C (p)$ . These matrices are related as follows.

The following statements are equivalent:

A is similar over F to $C (p)$ , i.e. A can be conjugated to its companion matrix by matrices in GL_n(F);
the characteristic polynomial $p (x)$ coincides with the minimal polynomial of A , i.e. the minimal polynomial has degree n;
the linear mapping $A : F^{n} \to F^{n}$ makes $F^{n}$ a cyclic $F [A]$ -module, having a basis of the form ${v, A v, \dots, A^{n - 1} v}$ ; or equivalently $F^{n} ≅ F [X] / (p (x))$ as $F [A]$ -modules.

If the above hold, one says that A is non-derogatory.

Not every square matrix is similar to a companion matrix, but every square matrix is similar to a block diagonal matrix made of companion matrices. If we also demand that the polynomial of each diagonal block divides the next one, they are uniquely determined by A, and this gives the rational canonical form of A.

Diagonalizability

The roots of the characteristic polynomial $p (x)$ are the eigenvalues of $C (p)$ . If there are n distinct eigenvalues $λ_{1}, \dots, λ_{n}$ , then $C (p)$ is diagonalizable as $C (p) = V^{- 1} D V$ , where D is the diagonal matrix and V is the Vandermonde matrix corresponding to the $λ$ 's: $D = [\begin{matrix} λ_{1} & 0 & \dots & 0 \\ 0 & λ_{2} & \dots & 0 \\ 0 & 0 & \dots & λ_{n} \end{matrix}], V = [\begin{matrix} 1 & λ_{1} & λ_{1}^{2} & \dots & λ_{1}^{n} \\ 1 & λ_{2} & λ_{2}^{2} & \dots & λ_{2}^{n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & λ_{n} & λ_{n}^{2} & \dots & λ_{n}^{n} \end{matrix}] .$ Indeed, an easy computation shows that the transpose $C (p)^{T}$ has eigenvectors $v_{i} = (1, λ_{i}, \dots, λ_{i}^{n - 1})$ with $C (p)^{T} (v_{i}) = λ_{i} v_{i}$ , which follows from $p (λ_{i}) = c_{0} + c_{1} λ_{i} + \dots + c_{n - 1} λ_{i}^{n - 1} + λ_{i}^{n} = 0$ . Thus, its diagonalizing change of basis matrix is $V^{T} = [v_{1}^{T} \dots v_{n}^{T}]$ , meaning $C (p)^{T} = V^{T} D (V^{T})^{- 1}$ , and taking the transpose of both sides gives $C (p) = V^{- 1} D V$ .

We can read the eigenvectors of $C (p)$ with $C (p) (w_{i}) = λ_{i} w_{i}$ from the equation $C (p) = V^{- 1} D V$ : they are the column vectors of the inverse Vandermonde matrix $V^{- 1} = [w_{1}^{T} \dots w_{n}^{T}]$ . This matrix is known explicitly, giving the eignevectors $w_{i} = (L_{0 i}, \dots, L_{(n - 1) i})$ , with coordinates equal to the coefficients of the Lagrange polynomials $L_{i} (x) = L_{0 i} + L_{1 i} x + \dots + L_{(n - 1) i} x^{n - 1} = \prod_{j \neq i} \frac{x - λ_{j}}{λ_{j} - λ_{i}} = \frac{p (x)}{(x - λ_{i}) p^{'} (λ_{i})} .$ Alternatively, the scaled eigenvectors ${\tilde{w}}_{i} = p^{'} (λ_{i}) w_{i}$ have simpler coefficients.

If $p (x)$ has multiple roots, then $C (p)$ is not diagonalizable. Rather, the Jordan canonical form of $C (p)$ contains one diagonal block for each distinct root, an m × m block with $λ$ on the diagonal if the root $λ$ has multiplicity m.

Linear recursive sequences

A linear recursive sequence defined by $a_{k + n} = - c_{0} a_{k} - c_{1} a_{k + 1} \dots - c_{n - 1} a_{k + n - 1}$ for $k \geq 0$ has the characteristic polynomial $p (x) = c_{0} + c_{1} x + \dots + c_{n - 1} x^{n - 1} + x^{n}$ , whose transpose companion matrix $C (p)^{T}$ generates the sequence: $[\begin{matrix} a_{k + 1} \\ a_{k + 2} \\ ⋮ \\ a_{k + n - 1} \\ a_{k + n} \end{matrix}] = [\begin{matrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 1 \\ - c_{0} & - c_{1} & - c_{2} & \dots & - c_{n - 1} \end{matrix}] [\begin{matrix} a_{k} \\ a_{k + 1} \\ ⋮ \\ a_{k + n - 2} \\ a_{k + n - 1} \end{matrix}] .$ The vector $v = (1, λ, λ^{2}, \dots, λ^{n - 1})$ is an eigenvector of this matrix, where the eigenvalue $λ$ is a root of $p (x)$ . Setting the initial values of the sequence equal to this vector produces a geometric sequence $a_{k} = λ^{k}$ which satisfies the recurrence. In the case of n distinct eigenvalues, an arbitrary solution $a_{k}$ can be written as a linear combination of such geometric solutions, and the eigenvalues of largest complex norm give an asymptotic approximation.

From linear ODE to first-order linear ODE system

Similarly to the above case of linear recursions, consider a homogeneous linear ODE of order n for the scalar function $y = y (t)$ : $y^{(n)} + c_{n - 1} y^{(n - 1)} + \dots + c_{1} y^{(1)} + c_{0} y = 0 .$ This can be equivalently described as a coupled system of homogeneous linear ODE of order 1 for the vector function $z (t) = (y (t), y^{'} (t), \dots, y^{(n - 1)} (t))$ : $z^{'} = C (p)^{T} z$ where $C (p)^{T}$ is the transpose companion matrix for the characteristic polynomial $p (x) = x^{n} + c_{n - 1} x^{n - 1} + \dots + c_{1} x + c_{0} .$ Here the coefficients $c_{i} = c_{i} (t)$ may be also functions, not just constants.

If $C (p)^{T}$ is diagonalizable, then a diagonalizing change of basis will transform this into a decoupled system equivalent to one scalar homogeneous first-order linear ODE in each coordinate.

An inhomogeneous equation $y^{(n)} + c_{n - 1} y^{(n - 1)} + \dots + c_{1} y^{(1)} + c_{0} y = f (t)$ is equivalent to the system: $z^{'} = C (p)^{T} z + F (t)$ with the inhomogeneity term $F (t) = (0, \dots, 0, f (t))$ .

Again, a diagonalizing change of basis will transform this into a decoupled system of scalar inhomogeneous first-order linear ODEs.

Cyclic shift matrix

In the case of $p (x) = x^{n} - 1$ , when the eigenvalues are the complex roots of unity, the companion matrix and its transpose both reduce to Sylvester's cyclic shift matrix, a circulant matrix.

Multiplication map on a simple field extension

Consider a polynomial $p (x) = x^{n} + c_{n - 1} x^{n - 1} + \dots + c_{1} x + c_{0}$ with coefficients in a field $F$ , and suppose $p (x)$ is irreducible in the polynomial ring $F [x]$ . Then adjoining a root $λ$ of $p (x)$ produces a field extension $K = F (λ) ≅ F [x] / (p (x))$ , which is also a vector space over $F$ with standard basis ${1, λ, λ^{2}, \dots, λ^{n - 1}}$ . Then the $F$ -linear multiplication mapping

m_{λ} : K \to K

defined by

m_{λ} (α) = λ α

has an n × n matrix $[m_{λ}]$ with respect to the standard basis. Since $m_{λ} (λ^{i}) = λ^{i + 1}$ and $m_{λ} (λ^{n - 1}) = λ^{n} = - c_{0} - \dots - c_{n - 1} λ^{n - 1}$ , this is the companion matrix of $p (x)$ : $[m_{λ}] = C (p) .$ Assuming this extension is separable (for example if $F$ has characteristic zero or is a finite field), $p (x)$ has distinct roots $λ_{1}, \dots, λ_{n}$ with $λ_{1} = λ$ , so that $p (x) = (x - λ_{1}) \dots (x - λ_{n}),$ and it has splitting field $L = F (λ_{1}, \dots, λ_{n})$ . Now $m_{λ}$ is not diagonalizable over $F$ ; rather, we must extend it to an $L$ -linear map on $L^{n} ≅ L \otimes_{F} K$ , a vector space over $L$ with standard basis ${1 \otimes 1, 1 \otimes λ, 1 \otimes λ^{2}, \dots, 1 \otimes λ^{n - 1}}$ , containing vectors $w = (β_{1}, \dots, β_{n}) = β_{1} \otimes 1 + \dots + β_{n} \otimes λ^{n - 1}$ . The extended mapping is defined by $m_{λ} (β \otimes α) = β \otimes (λ α)$ .

The matrix $[m_{λ}] = C (p)$ is unchanged, but as above, it can be diagonalized by matrices with entries in $L$ : $[m_{λ}] = C (p) = V^{- 1} D V,$ for the diagonal matrix $D = diag (λ_{1}, \dots, λ_{n})$ and the Vandermonde matrix V corresponding to $λ_{1}, \dots, λ_{n} \in L$ . The explicit formula for the eigenvectors (the scaled column vectors of the inverse Vandermonde matrix $V^{- 1}$ ) can be written as: ${\tilde{w}}_{i} = β_{0 i} \otimes 1 + β_{1 i} \otimes λ + \dots + β_{(n - 1) i} \otimes λ^{n - 1} = \prod_{j \neq i} (1 \otimes λ - λ_{j} \otimes 1)$ where $β_{i j} \in L$ are the coefficients of the scaled Lagrange polynomial $\frac{p (x)}{x - λ_{i}} = \prod_{j \neq i} (x - λ_{j}) = β_{0 i} + β_{1 i} x + \dots + β_{(n - 1) i} x^{n - 1} .$

Notes

↑ Horn, Roger A.; Charles R. Johnson (1985). Matrix Analysis. Cambridge, UK: Cambridge University Press. pp. 146–147. ISBN 0-521-30586-1. https://books.google.com/books?id=f6_r93Of544C&dq=%22companion+matrix%22&pg=PA147. Retrieved 2010-02-10.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Companion matrix. Read more

[1] Horn, Roger A.; Charles R. Johnson (1985). Matrix Analysis. Cambridge, UK: Cambridge University Press. pp. 146–147. ISBN 0-521-30586-1. https://books.google.com/books?id=f6_r93Of544C&dq=%22companion+matrix%22&pg=PA147. Retrieved 2010-02-10.

[1]

v t e Matrix classes
Explicitly constrained entries	(0,1) Alternant Anti-diagonal Anti-Hermitian Anti-symmetric Arrowhead Band Bidiagonal Binary Bisymmetric Block-diagonal Block Block tridiagonal Boolean Cauchy Centrosymmetric Conference Complex Hadamard Copositive Diagonally dominant Diagonal Discrete Fourier Transform Elementary Equivalent Frobenius Generalized permutation Hadamard Hankel Hermitian Hessenberg Hollow Integer Logical Markov Metzler Monomial Moore Nonnegative Partitioned Parisi Pentadiagonal Permutation Persymmetric Polynomial Positive Quaternionic Sign Signature Skew-Hermitian Skew-symmetric Skyline Sparse Sylvester Symmetric Toeplitz Triangular Tridiagonal Unitary Vandermonde Walsh Z
Constant	Exchange Hilbert Identity Lehmer Of ones Pascal Pauli Redheffer Shift Zero
Conditions on eigenvalues or eigenvectors	Companion Convergent Defective Diagonalizable Hurwitz Positive-definite Stability Stieltjes
Satisfying conditions on products or inverses	Congruent Idempotent or Projection Invertible Involutory Nilpotent Normal Orthogonal Orthonormal Singular Unimodular Unipotent Totally unimodular Weighing
With specific applications	Adjugate Alternating sign Augmented Bézout Carleman Cartan Circulant Cofactor Commutation Confusion Coxeter Derogatory Distance Duplication Elimination Euclidean distance Fundamental (linear differential equation) Generator Gramian Hessian Householder Jacobian Moment Payoff Pick Random Rotation Seifert Shear Similarity Symplectic Totally positive Transformation Wedderburn X–Y–Z
Used in statistics	Bernoulli Centering Correlation Covariance Design Dispersion Doubly stochastic Fisher information Hat Precision Stochastic Transition
Used in graph theory	Adjacency Biadjacency Degree Edmonds Incidence Laplacian Seidel adjacency Skew-adjacency Tutte
Used in science and engineering	Cabibbo–Kobayashi–Maskawa Density Fundamental (computer vision) Fuzzy associative Gamma Gell-Mann Hamiltonian Irregular Overlap S State transition Substitution Z (chemistry)
Related terms	Jordan canonical form Linear independence Matrix exponential Matrix representation of conic sections Perfect matrix Pseudoinverse Quaternionic matrix Row echelon form Wronskian
List of matrices Category:Matrices

Anonymous

Search

Companion matrix

Namespaces

More

Page actions

Contents

Similarity to companion matrix

Diagonalizability

Linear recursive sequences

From linear ODE to first-order linear ODE system

Cyclic shift matrix

Multiplication map on a simple field extension

See also

Notes

Navigation

Navigation

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Companion matrix

Similarity to companion matrix

Diagonalizability

Linear recursive sequences

From linear ODE to first-order linear ODE system

Cyclic shift matrix

Multiplication map on a simple field extension

See also

Notes

Navigation

Wiki tools

Page tools

Other projects

Categories