1. Recap: A few ways to multiply vectors and matrices

1.1. Vector multiplication operations (4 approaches)

Given we have 2 vectors, $a$ and $b$ , of same length (i.e. $a, b \in R^{n}$ ,), we can “multiply” them in the following ways:

Vector dot (inner) product: $a \cdot b = a^{T} b = Σ_{i = 1}^{n} a_{i} b_{i} = ∥ a ∥ ∥ b ∥ cos θ$
Vector outer product: $a \otimes b = a b^{T}$ . The resultant matrix is of size $n \times n$ and its elements are given by: $(a \otimes b)_{ij} = a_{i} b_{j}$
Vector Hadamard (aka element-wise) product: $a ⊙ b$ . Elements of the resultant vector are given by: $(a ⊙ b)_{i} = (a)_{i} (b)_{i}$
Vector cross product: $a \times b = ∥ a ∥ ∥ b ∥ sin (θ) n$

1.2. Matrix multiplication operations (4 approaches)

Given we have a matrix, $A : A \in R^{m \times n}$ , following are a few multiplication operations involving $A$ . NB: inner dimensions must match!

Matrix $A$ and some vector (examples: column vector $v$ , and row vector $w^{T}$ ):
1. Matrix times vector: $Av$ , where vector $v \in R^{n \times 1}$ . Hence, resultant column vector $Av \in R^{m \times 1}$
2. Vector times matrix: $w^{T} A$ , where vector $w^{T} \in R^{1 \times m}$ . Hence, resultant row vector $w^{T} A \in R^{1 \times n}$
Matrix $A$ and some matrix (examples: $A$ and $B$ matrices are of same size, but $C$ matrix has different size):
1. Matrix Hadamard (aka element-wise) product: $A ⊙ B$ , where $A, B \in R^{m \times n}$ . Elements of the resultant matrix are given by: $(A ⊙ B)_{ij} = (A)_{ij} (B)_{ij}$
2. Matrix multiplication: $AC$ , where $A \in R^{m \times p}$ and $C \in R^{p \times n}$ . Hence, inner dimensions match, and resultant matrix $AC \in R^{m \times n}$

import numpy as np  # more basic functionality
import scipy  # advanced functionality, built on numpy
 
# Find the inner and outer products of two 1D arrays (not exactly vectors, no double [[]])
a = np.array([4, 5, 6])
b = np.array([7, 8, 9])
 
print("Given vectors a:", a, "and b:", b)
 
print("\n4 types of vector multiplication")
print(
    "- Inner (aka dot) product: a•b = (a^T)b =", np.inner(a, b)
)  # dot prod; dims are: [1x3][3x1]=[1x1] <-- output dim, scalar
print("- Hadamard (elementwise) product: a⊙b", a * b)  # elementwise (or hadamard) product
print("- Cross product, a⨉b:", np.cross(a, b))
print("- Outer product, a[3⨉1] ⨂ b[1⨉3]:\n", np.outer(a, b))  # dims are [3x1][1x3]=[3x3] <-- output dim

Given vectors a: [4 5 6] and b: [7 8 9]
 
4 types of vector multiplication
- Inner (aka dot) product: a•b = (a^T)b = 122
- Hadamard (elementwise) product: a⊙b [28 40 54]
- Cross product, a⨉b: [-3  6 -3]
- Outer product, a[3⨉1] ⨂ b[1⨉3]:
 [[28 32 36]
 [35 40 45]
 [42 48 54]]

3. Gram-Schmidt Process

Use this to orthonormalise anything (vector or matrix (orthogonalise))

Orthonormalise a set of vectors ${v_{1}, v_{2}, v_{3}, ..., v_{n}}$
- to ${u_{1}, u_{2}, u_{3}, ..., u_{n}}$ , where each $u_{i}$ vector is in the same $R^{n}$ vector space,
  - but each $u_{i}$ vector is unit length, and
  - is mutually orthogonal with other vectors

I.e. Transform a set of vectors into a set of orthonormal vectors in the same vector space

4. Matrix decompositions

4.1. Gaussian Elimination (or Decomposition?)

Purpose: We use Gaussian Elimination to simplify a system of linear equations, $Ax=b$ into row echelon form (or reduced row echelon form; which allows solving $Ax=b$ by simple inspection)
Application:
- Solving linear system $Ax=b$ ,
- Computing inverse matrices
- Computing rank
- Computing determinant
- Elementary row operations: Methods by which the above are done
  - Swapping rows
  - Scaling rows
  - Adding rows to each other (i.e. creating linear combinations)
Row echelon form: The first non-zero element from the left in each row (aka leading coefficient, pivot) is always to the right of the first non-zero element in the row above
Reduced row echelon form: Row echelon form whose pivots are $1$ and column containing pivots are $0$ elsewhere
Elementary row operation

4.2. LU Decomposition

Like Gaussian Decomposition, but more computationally efficient

Decompose any matrix $A$ (square or not) into:

A lower triangular matrix $L$
An upper triangular matrix $U$
Sometimes needing to reorder $A$ using a $P$ matrix

a = np.random.randn(3, 4)
print("A:\n", a)
 
p, l, u = scipy.linalg.lu(a)
print("\nP:\n", p)
print("\nL:\n", l)
print("\nU:\n", u)
print("\n----\n\nRecomposition: PLU = A:\n", p @ l @ u)

A:
 [[ 0.15901331 -0.42151338  0.89002566 -0.77368563]
 [ 0.76901951 -0.8152131  -0.28904111 -1.03463915]
 [ 0.85276297  0.99433259 -0.23377478 -0.12578455]]
 
P:
 [[0. 0. 1.]
 [0. 1. 0.]
 [1. 0. 0.]]
 
L:
 [[1.         0.         0.        ]
 [0.90179749 1.         0.        ]
 [0.18646835 0.354533   1.        ]]
 
U:
 [[ 0.85276297  0.99433259 -0.23377478 -0.12578455]
 [ 0.         -1.71189973 -0.0782236  -0.92120695]
 [ 0.          0.          0.9613501  -0.42363252]]
 
----
 
Recomposition: PLU = A:
 [[ 0.15901331 -0.42151338  0.89002566 -0.77368563]
 [ 0.76901951 -0.8152131  -0.28904111 -1.03463915]
 [ 0.85276297  0.99433259 -0.23377478 -0.12578455]]

4.3. QR Decomposition

Decompose a matrix $A$ into:

an orthogonal matrix $Q$
an upper triangular matrix $R$

It’s used in QR algorithms to solve the linear least square problem.

Also, the $Q$ matrix is sometimes what we desire after the Gram-Schmidt process

a = np.random.randn(3, 4)
print("A:\n", a)
 
q, r = np.linalg.qr(a)
print("\nQ:\n", q)
print("\nR:\n", r)
print("\n----\n\nRecomposition: QR = A:\n", q @ r)

A:
 [[-1.36254921 -0.77417909 -2.21614234  0.89851109]
 [ 0.04834985 -0.67077294 -2.09042296  1.6127626 ]
 [ 0.67829896  0.71423733  0.69209262 -1.74429325]]
 
Q:
 [[-0.8947565   0.14601158  0.4220088 ]
 [ 0.0317503  -0.92184032  0.38626719]
 [ 0.44542421  0.35901398  0.82018671]]
 
R:
 [[ 1.52281566  0.98954313  2.22481102 -1.52969339]
 [ 0.          0.76172761  1.85192465 -1.98174223]
 [ 0.          0.         -1.17504818 -0.42850929]]
 
----
 
Recomposition: QR = A:
 [[-1.36254921 -0.77417909 -2.21614234  0.89851109]
 [ 0.04834985 -0.67077294 -2.09042296  1.6127626 ]
 [ 0.67829896  0.71423733  0.69209262 -1.74429325]]

4.4. Cholesky Decomposition

Decompose a symmetric (or Hermitian) positive-definite matrix into:

a lower triangular matrix $L$
and its transpose (or conjugate transpose) $L.H.$

Used in algorithms for numerical convenience

x = np.diagflat([[1, 2], [3, 4]])
print("x:\n", x)
 
L = np.linalg.cholesky(x)
print("\nL:\n", L)
 
print("\n----\n\nRecomposition: LL^T:\n", L @ L.T)

x:
 [[1 0 0 0]
 [0 2 0 0]
 [0 0 3 0]
 [0 0 0 4]]
 
L:
 [[1.         0.         0.         0.        ]
 [0.         1.41421356 0.         0.        ]
 [0.         0.         1.73205081 0.        ]
 [0.         0.         0.         2.        ]]
 
----
 
Recomposition: LL^T:
 [[1. 0. 0. 0.]
 [0. 2. 0. 0.]
 [0. 0. 3. 0.]
 [0. 0. 0. 4.]]

Questions

When exactly do we use decompositions?

notes/

Ch 4.1. Matrix decomposition

1. Recap: A few ways to multiply vectors and matrices

1.1. Vector multiplication operations (4 approaches)

1.2. Matrix multiplication operations (4 approaches)

3. Gram-Schmidt Process

4. Matrix decompositions

4.1. Gaussian Elimination (or Decomposition?)

4.2. LU Decomposition

4.3. QR Decomposition

4.4. Cholesky Decomposition

Questions

Ch 4.1. Matrix decomposition

1. Recap: A few ways to multiply vectors and matrices

1.1. Vector multiplication operations (4 approaches)

1.2. Matrix multiplication operations (4 approaches)

3. Gram-Schmidt Process

4. Matrix decompositions

4.1. Gaussian Elimination (or Decomposition?)

4.2. LU Decomposition

4.3. QR Decomposition

4.4. Cholesky Decomposition

Questions

Graph View

Backlinks

Explorer