chapA.xml

A.1 Vector and Matrix Multiplication

We consider a vector

v

of length

J

(in an abstract vector space of dimensions

J

) to be an ordered sequence of

J

numbers

^{103}

. The vector can be displayed either as a column

v = (\begin{matrix} v_{1} \\ v_{2} \\ : \\ v_{J} \end{matrix})

(A . 1)

or as a row, which we regard as the transpose, denoted

^{T}

, of the column vector:

v^{T} = (v_{1}, v_{2}, . . ., v_{J}) .

(A . 2)

Vectors of the same dimensions can be added together so that the

j^{th}

entry of

u + v

u_{j} + v_{j}

The scalar product of two vectors

u

v

, in vector notation is indicated by a dot, but in matrix notation the dot is usually omitted. Instead we write it

u^{T} v = \sum_{j = 1}^{J} u_{j} v_{j} .

(A . 3)

If we have a set of

k

column vectors

v_{k}

, for

k = 1, \dots, K

, the

j^{th}

element of the

k^{th}

vector can be written,

V_{jk}

, and they can be arrayed compactly one after the other as

V = (\begin{matrix} V_{11} & V_{12} & \dots & V_{1 K} \\ V_{21} & V_{22} & \dots & V_{2 K} \\ : & : & ⋱ & : \\ V_{J 1} & V_{J 2} & \dots & V_{JK} \end{matrix}) .

(A . 4)

This is a matrix. We can consider matrix multiplication to be a generalization of the scalar product. So premultiplying a

J \times K

matrix

V

, by a length

J

row vector

u^{T}

gives a new row vector of length

K

u^{T} V = (\sum_{j = 1}^{J} u_{j} V_{j 1}, \sum_{j = 1}^{J} u_{j} V_{j 2}, \dots, \sum_{j = 1}^{J} u_{j} V_{jK}) .

(A . 5)

If we further have a set of

M

row vectors, we can display them as a matrix

U = (\begin{matrix} U_{11} & U_{12} & \dots & U_{1 J} \\ U_{21} & U_{22} & \dots & U_{2 J} \\ : & : & ⋱ & : \\ U_{M 1} & U_{M 2} & \dots & U_{MJ} \end{matrix})

(A . 6)

(dispensing with the transpose notation for brevity and consistency). And multiplication of the matrices

U

(

M \times J

) and

V

(

J \times K

) can be considered to give an

M \times K

matrix:

U V = (\begin{matrix} \sum_{j = 1}^{J} U_{1 j} V_{j 1} & \sum_{j = 1}^{J} U_{1 j} V_{j 2} & \dots & \sum_{j = 1}^{J} U_{1 j} V_{jK} \\ \sum_{j = 1}^{J} U_{2 j} V_{j 1} & \sum_{j = 1}^{J} U_{2 j} V_{j 2} & \dots & \sum_{j = 1}^{J} U_{2 j} V_{jK} \\ : & : & ⋱ & : \\ \sum_{j = 1}^{J} U_{Mj} V_{j 1} & \sum_{j = 1}^{J} U_{Mj} V_{j 2} & \dots & \sum_{j = 1}^{J} U_{Mj} V_{jK} \end{matrix}) .

(A . 7)

This is the definition of matrix multiplication. A matrix (or vector) can also be multiplied by a single number: a scalar,

λ

(say). The

(jk)

th element of

λ V

λ V_{jk}

The transpose of a matrix

A = (A_{ij})

is simply the matrix formed from reversing the order of suffixes

^{104}

A^{T} = (A_{ij}^{T}) = (A_{ji})

. The transpose of a product of two matrices is therefore the reverse of the product of the transposes:

(A B)^{T} = B^{T} A^{T} .

(A . 8)

A.2 Determinants

The determinant of a square matrix is a single scalar that is an important measure of its character. Determinants may be defined inductively. Suppose we know the definition of determinants of matrices of size

(M - 1) \times (M - 1)

. Define the determinant of an

M \times M

matrix

A

whose

{ij}^{th}

entry is

A_{ij}

, as the expression

det (A) = | A | = \sum_{j = 1}^{M} A_{1 j} {Co}_{1 j} (A)

(A . 9)

where

{Co}_{ij} (A)

is the

{ij}^{th}

cofactor of the matrix

A

. The

{ij}^{th}

cofactor of an

M \times M

matrix is

(- 1)^{i + j}

times the determinant of the

(M - 1) \times (M - 1)

matrix obtained by removing the

i^{th}

row and the

j^{th}

column of the original matrix:

{Co}_{ij} (A) = (- 1)^{i + j} | (\begin{matrix} A_{11} & \dots & A_{1 j - 1} & A_{1 j + 1} & \dots & A_{1 M} \\ : & : & : & : & : & : \\ A_{i - 1, 1} & \dots & A_{i - 1, j - 1} & A_{i - 1, j + 1} & \dots & A_{i - 1, M} \\ A_{i + 1, 1} & \dots & A_{i + 1, j - 1} & A_{i + 1, j + 1} & \dots & A_{i + 1, M} \\ : & : & : & : & : & : \\ A_{M 1} & \dots & A_{Mj - 1} & A_{Mj + 1} & \dots & A_{MM} \end{matrix}) | .

(A . 10)

The inductive definition is completed by defining the determinant of a

1 \times 1

matrix to be equal to its single element. The determinant of a

2 \times 2

matrix is then

A_{11} A_{22} - A_{12} A_{21}

, and of a

3 \times 3

matrix is

A_{11} (A_{22} A_{33} - A_{23} A_{32}) + A_{12} (A_{23} A_{31} - A_{21} A_{13}) + A_{13} (A_{21} A_{32} - A_{22} A_{31})

The determinant of an

M \times M

matrix may equivalently be defined as the sum over all the

M!

possible permutations

P

of the integers

1, . . ., M

, of the product of the entries

\underset{i}{Π} A_{i, P (i)}

times the signum of

P

(plus or minus 1 according to whether

P

is even or odd):

| A | = \sum_{P} sgn (P) A_{1, P (1)} A_{2, P (2)} . . . A_{M, P (M)} .

(A . 11)

This expression shows that there is nothing special about the first row in eq. (A.9). One could equally well have used any row,

i

, giving

| A | = \sum_{j = 1}^{M} A_{ij} {Co}_{ij} (A)

; or one could have used any column,

j

| A | = \sum_{i = 1}^{M} A_{ij} {Co}_{ij} (A)

. All the results are the same.

The determinant of the transpose of a matrix

A

is equal to its determinant:

| A^{T} | = | A |

. The determinant of a product of two matrices is the product of the determinants:

| A B | = | A | | B |

. A matrix is said to be singular if its determinant is zero, otherwise it is nonsingular. If a matrix has two identical (or proportional, i.e. dependent) rows or two identical columns, then its determinant is zero and it is singular

^{105}

A.3 Inverses

The unit matrix is square,

I = (δ_{ij}) = (\begin{matrix} 1 & 0 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ : & : & ⋱ & : \\ 0 & 0 & \dots & 1 \end{matrix})

(A . 12)

with ones on the diagonal and zeroes elsewhere. It may be of any size,

N

, and if need be then denoted

I_{N}

. For any

M \times N

matrix

A

I_{M} A = A and A I_{N} = A .

(A . 13)

The inverse of a square matrix

A

, if it exists, is another matrix written

A^{- 1}

such that

^{106}

A^{- 1} A = A A^{- 1} = I .

(A . 14)

A nonsingular square matrix possesses an inverse. A singular matrix does not.

The inverse of a matrix may be identified by considering the identity

\sum_{j = 1}^{M} A_{ij} {Co}_{kj} (A) = δ_{ik} | A | .

(A . 15)

For

i = k

, this equality arises as the expansion of the determinant by row

i

. For

i \neq k

, the sum represents the determinant, expanded by row

k

, of a matrix in which the row

k

has been replaced by a copy of row

i

. The modified matrix has two rows identical, so its determinant is zero, as is

δ_{ij}, i \neq j

. Now if we regard

Co (A)

as a matrix, consisting of all the cofactors. Then we can consider

\sum_{j = 1}^{M} A_{ij} {Co}_{kj} (A)

as being the matrix product of

A

by the transpose of the cofactor matrix,

A Co (A)^{T}

. So if

| A |

is nonzero we may divide (A.15) through by it and find

A [Co (A)^{T} / | A |] = I .

(A . 16)

This equality shows that

A^{- 1} = Co (A)^{T} / | A | .

(A . 17)

Consequently the solution of the nonsingular matrix equation

A x = b

x = \frac{Co (A)^{T} b}{| A |},

(A . 18)

which for column vectors

x

and

b

is Cramer's rule.

The inverse of the product of two nonsingular matrices is the reversed product of their inverses:

(A B)^{- 1} = B^{- 1} A^{- 1} .

(A . 19)

A.4 Eigenanalysis

A square matrix

A

maps the linear space of column vectors onto itself via

A x = y

, with

y

the vector onto which

x

is mapped. An eigenvector is a vector which is mapped onto a multiple of itself. That is

A x = λ x,

(A . 20)

where

λ

is a scalar called the eigenvalue. In general a square matrix of dimension

N

has

N

different eigenvectors. Obviously an eigenvector times any scalar is still an eigenvector, which is not considered to be different.

Since eq. (A.20), which is

(A - λ I) x = 0

, is a homogeneous equation for the elements of

x

, in order for there to be a non-zero solution,

x

, the determinant of the coefficients must be zero:

| A - λ I | = 0 .

(A . 21)

For an

N \times N

matrix, this determinant gives a polynomial of order

N

for

λ

, whose

N

roots are the

N

eigenvalues.

A

is symmetric, that is if

A^{T} = A

, then the eigenvectors corresponding to different eigenvalues are orthogonal, that is, their scalar product is zero. See this by considering two eigenvectors

e_{1}

and

e_{2}

, corresponding to different eigenvalues

λ_{1}

λ_{2}

, and using the respective versions of eq. (A.20) and the properties of the transpose.

e_{2}^{T} A e_{1} = e_{2}^{T} λ_{1} e_{1}, e_{2}^{T} A^{T} e_{1} = (e_{1}^{T} A e_{2})^{T} = (e_{1}^{T} λ_{2} e_{2})^{T} = e_{2}^{T} λ_{2} e_{1} .

(A . 22)

So by subtraction

0 = e_{2}^{T} (A - A^{T}) e_{1} = (λ_{1} - λ_{2}) e_{2}^{T} e_{1} .

(A . 23)

If there are multiple independent eigenvectors with identical eigenvalues, they can be chosen to be orthogonal. In that standard case, the eigenvectors are all orthogonal:

e_{i}^{T} e_{j} = 0

for

i \neq j

If we then take the eigenvectors also to be normalized such that

e_{j}^{T} e_{j} = 1

, we can construct a square matrix

U

whose columns are equal to these eigenvectors (as in eq. (A.4)). The matrix

U

whose columns are orthonormal is said to be an orthonormal matrix (sometimes just called orthogonal). The inverse of

U

is its transpose:

U^{- 1} = U^{T}

. This

U

is a unitary basis transformation which diagonalizes

A

. This fact follows from the observation that

A U = D U = U D

where

D

is the diagonal matrix constructed from the eigenvalues:

D = (\begin{matrix} λ_{1} & 0 & \dots & 0 \\ 0 & λ_{2} & \dots & 0 \\ : & : & ⋱ & : \\ 0 & 0 & \dots & λ_{N} \end{matrix}) .

(A . 24)

Therefore

U^{T} A U = U^{T} U D = D .

(A . 25)

HEAD

Appendix A Summary of Matrix Algebra

A.1 Vector and Matrix Multiplication

A.2 Determinants

A.3 Inverses

A.4 Eigenanalysis

Appendix A
Summary of Matrix Algebra