Definition 1.1.1 (Matrix) A rectangular array of numbers is called a matrix.
The horizontal arrays of a matrix are called its rows and the vertical arrays are called its columns. A matrix is said to have the order m × n if it has m rows and n columns. An m × n matrix A can be represented in either of the following forms:

then
a11 = 1, a12 = 3,a13 = 7,a21 = 4,a22 = 5, and a23 = 6.
A matrix having only one column is called a column vector; and a matrix with only one row is called a row vector. Whenever a vector is used, it should be understood from the context whether it is a row vector or a column vector. Also, all the vectors will be represented by bold letters.
Definition 1.1.2 (Equality of two Matrices) Two matrices A = [aij] and B = [bij] having the same order m × n are equal if aij = bij for each i = 1,2,…,m and j = 1,2,…,n.
In other words, two matrices are said to be equal if they have the same order and their corresponding entries are equal.
Example 1.1.3 The linear system of equations 2x + 3y = 5 and 3x + 2y = 5 can be identified
with the matrix
. Note that x and y are indeterminate and we can think of x being
associated with the first column and y being associated with the second column.
![[0 0] [0 0 0]
02×2 = 0 0 and 02×3 = 0 0 0 .](mth1023x.png)
are a few diagonal matrices.
A diagonal matrix D of order n with the diagonal entries d1,d2,…,dn is denoted by D = diag(d1,…,dn). If di = d for all i = 1,2,…,n then the diagonal matrix D is called a scalar matrix.
For example, I2 =
and I3 =
. The subscript n is suppressed in case the
order is clear from the context or if no confusion arises.
A square matrix A = [aij] is said to be a lower triangular matrix if aij = 0 for i < j.
A square matrix A is said to be triangular if it is an upper or a lower triangular matrix.
For example,
is upper triangular,
is lower triangular.
Exercise 1.1.5 Are the following matrices upper triangular, lower triangular or both?
Definition 1.2.1 (Transpose of a Matrix) The transpose of an m × n matrix A = [aij] is defined as the n × m matrix B = [bij], with bij = aji for 1 ≤ i ≤ m and 1 ≤ j ≤ n. The transpose of A is denoted by At.
That is, if A =
then At =
. Thus, the transpose of a row vector is a column vector and
vice-versa.
Proof. Let A = [aij],At = [bij] and (At)t = [cij]. Then, the definition of transpose gives

Definition 1.2.3 (Addition of Matrices) let A = [aij] and B = [bij] be two m × n matrices. Then the sum A + B is defined to be the matrix C = [cij] with cij = aij + bij.
Note that, we define the sum of two matrices only when the order of the two matrices are same.
Definition 1.2.4 (Multiplying a Scalar to a Matrix) Let A = [aij] be an m × n matrix. Then for any element k ∈ ℝ, we define kA = [kaij].
For example, if A =
and k = 5, then 5A =
.
Proof. Part 1.
Let A = [aij] and B = [bij]. Then
![A + B = [aij]+ [bij] = [aij + bij] = [bij + aij] = [bij]+ [aij] = B + A](mth10215x.png)
The reader is required to prove the other parts as all the results follow from the properties of real numbers. □
Definition 1.2.6 (Additive Inverse) Let A be an m × n matrix.
and B =
. Compute A + Bt and B + At.
Definition 1.2.8 (Matrix Multiplication / Product) Let A = [aij] be an m×n matrix and B = [bij ] be an n × r matrix. The product AB is a matrix C = [cij] of order m × r, with

That is, if Am×n =
and Bn×r =
then
![AB = [(AB )ij]m×r and (AB )ij = ai1b1j + ai2b2j + ⋅⋅⋅+ ainbnj.](mth10221x.png)
Observe that the product AB is defined if and only if
the number of columns of A = the number of rows of B.
For example, if A =
and B =
then
![]() | (1.2.1) |
Observe that in Equation (1.2.1), the first row of AB can be re-written as
![[ ] [ ] [ ]
a ⋅α β γ δ + b⋅ x y z t + c⋅ u v w s .](mth10225x.png)
Remark 1.2.9 Observe the following:
However, for square matrices A and B of the same order, both the product AB and BA are defined.
![⌊ ⌋
a1B
||a2B ||
AB = [Ab1,Ab2, ...,Abp ] = |⌈ .. |⌉.
.
anB](mth10228x.png)
Example 1.2.10 Let A =
and B =
. Use the row/column method of
matrix multiplication to
![1⋅[1,0,- 1]+ 0 ⋅[0,0,1]+ 1⋅[0,- 1,1] = [1,- 1,0].](mth10231x.png)

Definition 1.2.11 (Commutativity of Matrix Product) Two square matrices A and B are said to commute if AB = BA.
Remark 1.2.12 Note that if A is a square matrix of order n and if B is a scalar matrix of
order n then AB = BA. In general, the matrix product is not commutative. For example, consider
A =
and B =
. Then check that the matrix product
![[2 0] [1 1]
AB = ⁄= = BA.
0 0 1 1](mth10235x.png)
Theorem 1.2.13 Suppose that the matrices A,B and C are so chosen that the matrix multiplications are defined.
A similar statement holds for the columns of A when A is multiplied on the right by D.
Proof. Part 1. Let A = [aij]m×n,B = [bij]n×p and C = [cij]p×q. Then


Part 5. For all j = 1,2,…,n, we have

The reader is required to prove the other parts. □
. Compute A + 3A2 - A3 and aA3 + bA + cA2.
and B =
. Use the row/column method of matrix multiplication
to compute the
![[ ] ⌊1 1 1⌋ ⌊1 1 1⌋
1 1 , ⌈0 1 1⌉, ⌈1 1 1⌉ .
0 1 0 0 1 1 1 1](mth10242x.png)
=
. Determine the matrix A.
=
. Can you construct the matrix A satisfying the
above? Why!
Definition 1.2.15 (Inverse of a Matrix) Let A be a square matrix of order n.
Lemma 1.2.16 Let A be an n×n matrix. Suppose that there exist n×n matrices B and C such that AB = In and CA = In, then B = C.
Proof. Note that

Proof. Proof of Part 1.
By definition AA-1 = A-1A = I. Hence, if we denote A-1 by B, then we get AB = BA = I. Thus, the
definition, implies B-1 = A, or equivalently (A-1)-1 = A.
Proof of Part 2.
Verify that (AB)(B-1A-1) = I = (B-1A-1)(AB).
Proof of Part 3.
We know AA-1 = A-1A = I. Taking transpose, we get

We will again come back to the study of invertible matrices in Sections 2.2 and 2.5.
and
.
Ar is also an invertible
matrix.
. Determine the matrix
A.
. Determine the matrix A [Hint: See
Example 1.2.18.2 and Theorem 1.2.19.1].

.
and B =
. Then A is a symmetric matrix and B is
a skew-symmetric matrix.
. Then A is an orthogonal matrix.
. Then A2 = A. The matrices that satisfy the condition that A2 = A are
called idempotent matrices.
(A + At) is symmetric, T =
(A - At) is
skew-symmetric, and A = S + T.
+ (-1)k-1Ak-1].
Definition 1.3.4 A matrix obtained by deleting some of the rows and/or columns of a matrix is said to be a submatrix of the given matrix.
For example, if A =
, a few submatrices of A are
![[ ] [ ]
[1],[2], 1 ,[15], 1 5 ,A.
0 0 2](mth10275x.png)
But the matrices
and
are not submatrices of A. (The reader is advised to give
reasons.)
Let A be an n × m matrix and B be an m × p matrix. Suppose r < m. Then, we can decompose the
matrices A and B as A = [PQ] and B =
; where P has order n×r and H has order r ×p. That is,
the matrices P and Q are submatrices of A and P consists of the first r columns of A and Q consists of
the last m - r columns of A. Similarly, H and K are submatrices of B and H consists of the first r
rows of B and K consists of the last m - r rows of B. We now prove the following important
theorem.
Proof. First note that the matrices PH and QK are each of order n × p. The matrix products PH and QK are valid as the order of the matrices P,H,Q and K are respectively, n × r,r × p,n × (m - r) and (m - r) × p. Let P = [Pij],Q = [Qij],H = [Hij], and K = [kij]. Then, for 1 ≤ i ≤ n and 1 ≤ j ≤ p, we have

Remark 1.3.6 Theorem 1.3.5 is very useful due to the following reasons:
For example, if A =
and B =
, Then
![[ ][ ] [ ] [ ]
AB = 1 2 a b + 0 [ef] = a+ 2c b+ 2d .
2 5 c d 0 2a+ 5c 2b + 5d](mth10284x.png)
If A =
, then A can be decomposed as follows:
, or A =
, or
and so on.Suppose A =
m1m2 |
|||
| ![]() |
s1s2 |
|||
| ![]() |
Even if A + B is defined, the orders of P and E may not be same and hence, we may not be able to
add A and B in the block form. But, if A + B and P + E is defined then A + B =
.
Similarly, if the product AB is defined, the product PE need not be defined. Therefore, we can talk of
matrix product AB as block product of matrices, if both the products AB and PE are defined. And in
this case, we have AB =
.
That is, once a partition of A is fixed, the partition of B has to be properly chosen for purposes of block addition or multiplication.
,B =
and C =
. Compute
and y =
. Determine the 2 × 2 matrix
Now, let C and D be two 2×2 matrices such that y = Cx gives rise to counter-clockwise rotation through an angle β and y = Dx gives rise to the reflection along the line y = (tanδ)x, respectively. Then prove that
,B =
and C =
. If x =
and y =
then geometrically interpret the following:
x1 | = a11y1 + a12y2 |
x2 | = a21y1 + a22y2 |
y1 | = b11z1 + b12z2 |
y2 | = b21z1 + b22z2 |

.
and B =
. Compute the matrix product AB using
the block matrix multiplication.
. If P,Q,R and S are symmetric, is the matrix A symmetric? If A is symmetric,
is it necessary that the matrices P,Q,R and S are symmetric?
, where A11 is a 2 × 2 invertible matrix and c is a
real number.

In this chapter, we started with the definition of a matrix and came across lots of examples. In particular, the following examples were important:
We also learnt product of two matrices. Even though it seemed complicated, it basically tells the following:
Let us look at some examples of linear systems.
.

The readers are advised to supply the proof.
Definition 2.1.1 (Linear System) A system of m linear equations in n unknowns x1,x2,…,xn is a set of equations of the form
where for 1 ≤ i ≤ n, and 1 ≤ j ≤ m;aij,bi ∈ ℝ. Linear System (2.1.1) is called homogeneous if b1 = 0 = b2 =
= bm and non-homogeneous otherwise.
We rewrite the above equations in the form Ax = b, where
A =
,x =
, and b =
The matrix A is called the coefficient matrix and the block matrix
, is called the augmented
matrix of the linear system (2.1.1).
That is, for i = 1,2,…,m and j = 1,2,…,n, the entry aij of the coefficient matrix A corresponds to the ith linear equation and the jth variable xj.
Definition 2.1.3 For a system of linear equations Ax = b, the system Ax = 0 is called the associated homogeneous system.
Definition 2.1.4 (Solution of a Linear System) A solution of Ax = b is a column vector y with entries y1,y2,…,yn such that the linear system (2.1.1) is satisfied by substituting yi in place of xi . The collection of all solutions is called the solution set of the system.
That is, if yt = [y1,y2,…,yn] is a solution of the linear system Ax = b then Ay = b holds. For
example, from Example 3.3a, we see that the vector yt = [1,1,1] is a solution of the system Ax = b,
where A =
,xt = [x,y,z] and bt = [3,7,13].
We now state a theorem about the solution set of a homogeneous system. The readers are advised to supply the proof.
Example 2.1.7 Solve the linear system y + z = 2,2x + 3z = 5,x + y + z = 3.
Solution: In this case, the augmented matrix is
and the solution method proceeds along the
following steps.

.



.

The last equation gives z = 1. Using this, the second equation gives y = 1. Finally, the first equation gives x = 1. Hence the solution set is {(x,y,z)t : (x,y,z) = (1,1,1)}, a unique solution.
In Example 2.1.7, observe that certain operations on equations (rows of the augmented matrix) helped us in getting a system in Item 5, which was easily solvable. We use this idea to define elementary row operations and equivalence of two linear systems.
Definition 2.1.8 (Elementary Row Operations) Let A be an m×n matrix. Then the elementary row operations are defined as follows:
Definition 2.1.9 (Equivalent Linear Systems) Let [Ab] and [Cd] be augmented matrices of two linear systems. Then the two linear systems are said to be equivalent if [Cd] can be obtained from [Ab] by application of a finite number of elementary row operations.
Definition 2.1.10 (Row Equivalent Matrices) Two matrices are said to be row-equivalent if one can be obtained from the other by a finite number of elementary row operations.
Thus, note that linear systems at each step in Example 2.1.7 are equivalent to each other. We also prove the following result that relates elementary row operations with the solution set of a linear system.
Lemma 2.1.11 Let Cx = d be the linear system obtained from Ax = b by application of a single elementary row operation. Then Ax = b and Cx = d have the same solution set.
Proof. We prove the result for the elementary row operation Rjk(c) with c≠0. The reader is advised to prove the result for other elementary operations.
In this case, the systems Ax = b and Cx = d vary only in the kth equation. Let (α1,α2,…,αn) be a solution of the linear system Ax = b. Then substituting for αi’s in place of xi’s in the kth and jth equations, we get

![]() | (2.1.2) |
But then the kth equation of the linear system Cx = d is
![]() | (2.1.3) |
Therefore, using Equation (2.1.2), (α1,α2,…,αn) is also a solution for kth Equation (2.1.3).
Use a similar argument to show that if (β1,β2,…,βn) is a solution of the linear system Cx = d then it is also a solution of the linear system Ax = b. Hence, the required result follows. □
The readers are advised to use Lemma 2.1.11 as an induction step to prove the main result of this subsection which is stated next.
We first define the Gauss elimination method and give a few examples to understand the method.
Definition 2.1.13 (Forward/Gauss Elimination Method) The Gaussian elimination method is a procedure for solving a linear system Ax = b (consisting of m equations in n unknowns) by bringing the augmented matrix
![|
⌊ a11 a12 ⋅⋅⋅ a1m ⋅⋅⋅ a1n |b1 ⌋
| a21 a22 ⋅⋅⋅ a2m ⋅⋅⋅ a2n |b2 |
[Ab] = || .. .. .. .. .. | .. ||
⌈ . . . . . | . ⌉
am1 am2 ⋅⋅⋅ amm ⋅⋅⋅ amn |bm](mth102137x.png)

We have already seen an example before defining the notion of row equivalence. We give two more examples to illustrate the Gauss elimination method.
Example 2.1.14 Solve the following linear system by Gauss elimination method.

Solution: Let A =
and b =
. The Gauss Elimination method starts with the augmented
matrix [Ab] and proceeds as follows:



Thus, the solution set is {(x,y,z)t : (x,y,z) = (1,2-z,z)} or equivalently {(x,y,z)t : (x,y,z) = (1,2,0)+z(0,-1,1)}, with z arbitrary. In other words, the system has infinite number of solutions. Observe that the vector yt = (1,2,0) satisfies Ay = b and the vector zt = (0,-1,1) is a solution of the homogeneous system Ax = 0.
Example 2.1.15 Solve the following linear system by Gauss elimination method.

Solution: Let A =
and b =
. The Gauss Elimination method starts with the augmented
matrix [Ab] and proceeds as follows:



The third equation in the last step is

Remark 2.1.16 Note that to solve a linear system Ax = b, one needs to apply only the row operations to the augmented matrix [Ab].
Definition 2.1.17 (Row Echelon Form of a Matrix) A matrix C is said to be in the row echelon form if
Example 2.1.18 The matrices
and
are in
row-echelon form. Whereas, the matrices

Definition 2.1.19 (Basic, Free Variables) Let Ax = b be a linear system consisting of m equations in n unknowns. Suppose the application of Gauss elimination method to the augmented matrix [Ab] yields the matrix [Cd].
The free variables are called so as they can be assigned arbitrary values. Also, the basic variables can be written in terms of the free variables and hence the value of basic variables in the solution set depend on the values of the free variables.
Remark 2.1.20 Observe the following:

Before proceeding further, we have the following definition.
Definition 2.1.21 (Consistent, Inconsistent) A linear system is called consistent if it admits a solution and is called inconsistent if it admits no solution.
We are now ready to prove conditions under which the linear system Ax = b is consistent or inconsistent.
Theorem 2.1.22 Consider the linear system Ax = b, where A is an m × n matrix and xt = (x1 , x2,…,xn). If one obtains [Cd] as the row-echelon form of [Ab] with dt = (d1,d2,…,dm) then
Proof. Part 1: The linear equation corresponding to the row [0t1] equals

Part 2: Suppose [Cd] has r non-zero rows. As [Cd] is in row echelon form there exist positive integers 1 ≤ i1 < i2 < … < ir ≤ n such that entries cℓiℓ for 1 ≤ ℓ ≤ r are leading terms. This in turn implies that the variables xij, for 1 ≤ j ≤ r are the basic variables and the remaining n - r variables, say xt1,xt2,…,xtn-r, are free variables. So for each ℓ,1 ≤ ℓ ≤ r, one obtains xiℓ + ∑ k>iℓcℓkxk = dℓ (k > iℓ in the summation as [Cd] is an upper triangular matrix). Or equivalently,



We omit the proof of the next result as it directly follows from Theorem 2.1.22.
Corollary 2.1.23 Consider the homogeneous system Ax = 0. Then
We end this subsection with some applications related to geometry.

d,0,-
d,d). Hence, taking d = 13, the equation of the
required circle is


d,-
,-
d,d). Hence, taking d = 3, the equation of the
required plane is -4x - y + 2z + 3 = 0.
.
Solution of Part 3a: Solving for Ax = 2x is same as solving for (A - 2I)x = 0. This leads
to the augmented matrix
. Check that a non-zero solution is given by
xt = (1,0,0).
Solution of Part 3b: Solving for Ay = 4y is same as solving for (A - 4I)y = 0. This leads to
the augmented matrix
. Check that a non-zero solution is given by
yt = (2,0,1).
The Gauss-Jordan method consists of first applying the Gauss Elimination method to get the row-echelon form of the matrix [Ab] and then further applying the row operations as follows. For example, consider Example 2.1.7. We start with Step 5 and apply row operations once again. But this time, we start with the 3rd row.

times 3rd equation.

Definition 2.1.26 (Row-Reduced Echelon Form) A matrix C is said to be in the row-reduced echelon form or reduced row echelon form if
A matrix which is in the row-reduced echelon form is also called a row-reduced echelon matrix.
Example 2.1.27 Let A =
and B =
. Then A and
B are in row echelon form. If C and D are the row-reduced echelon forms of A and B, respectively
then C =
and D =
.
Definition 2.1.28 (Back Substitution/Gauss-Jordan Method) The procedure to get The row-reduced echelon matrix from the row-echelon matrix is called the back substitution. The elimination process applied to obtain the row-reduced echelon form of the augmented matrix is called the Gauss-Jordan elimination method.
That is, the Gauss-Jordan elimination method consists of both the forward elimination and the backward substitution.
Remark 2.1.29 Note that the row reduction involves only row operations and proceeds from left to right. Hence, if A is a matrix consisting of first s columns of a matrix C, then the row-reduced form of A will consist of the first s columns of the row-reduced form of C.
The proof of the following theorem is beyond the scope of this book and is omitted.
Remark 2.1.31 Consider the linear system Ax = b. Then Theorem 2.1.30 implies the following:
Example 2.1.32 Consider Ax = b, where A is a 3 × 3 matrix. Let [Cd] be the row-reduced echelon form of [Ab]. Also, assume that the first column of A has a non-zero entry. Then the possible choices for the matrix [Cd] with respective solution sets are given below:
. Ax = b has a unique solution, (x,y,z) = (d1,d2,d3).
,
or
. Ax = b has no solution for any
choice of α,β.
,
,
. Ax = b has Infinite number of
solutions for every choice of α,β.
| x | + y | – 2 u | + v | = 2 | ||
| z | + u | + 2 v | = 3 | |||
| v | + w | = 3 | ||||
| v | + 2 w | = 5 |
In the previous section, we solved a system of linear equations with the help of either the Gauss Elimination method or the Gauss-Jordan method. These methods required us to make row operations on the augmented matrix. Also, we know that (see Section 1.2.1 ) the row-operations correspond to multiplying a matrix on the left. So, in this section, we try to understand the matrices which helped us in performing the row-operations and also use this understanding to get some important results in the theory of square matrices.
Definition 2.2.1 A square matrix E of order n is called an elementary matrix if it is obtained by applying exactly one row operation to the identity matrix, In.
Remark 2.2.2 Fix a positive integer n. Then the elementary matrices of order n are of three types and are as follows:

and B =
. Then B is obtained from A by the
interchange of 2nd and 3rd row. Verify that

. Then B =
is the row-reduced echelon form of A.
The readers are advised to verify that


Remark 2.2.4 Observe the following:
That is, all the elementary matrices are invertible and the inverses are also elementary matrices.
![Ek ⋅Ek-1⋅⋅⋅E2 ⋅E1 ⋅[Ab] = [Cd ].](mth102194x.png)
We are now ready to prove a equivalent statements in the study of invertible matrices.
As A is invertible, we have A-1A = In = AA-1. Let x0 be a solution of the homogeneous system Ax = 0. Then, Ax0 = 0 and Thus, we see that 0 is the only solution of the homogeneous system Ax = 0.
Let xt = [x1,x2,…,xn]. As 0 is the only solution of the linear system Ax = 0, the final equations are x1 = 0, x2 = 0,…,xn = 0. These equations can be rewritten as

Suppose that the row-reduced echelon form of A is In. Then using Remark 2.2.4.4, there exist elementary matrices E1,E2,…,Ek such that
![]() | (2.2.4) |
Now, using Remark 2.2.4, the matrix Ej-1 is an elementary matrix and is the inverse of Ej for 1 ≤ j ≤ k. Therefore, successively multiplying Equation (2.2.4) on the left by E1-1,E2-1,…,Ek-1, we get

Suppose A = E1E2
Ek; where the Ei’s are elementary matrices. As the elementary matrices are
invertible (see Remark 2.2.4) and the product of invertible matrices is also invertible, we get the required
result. □
As an immediate consequence of Theorem 2.2.5, we have the following important result.
Proof. Suppose there exists a matrix C such that CA = In. Let x0 be a solution of the homogeneous system Ax = 0. Then Ax0 = 0 and

Using the first part, it is clear that the matrix B in the second part, is invertible. Hence

Remark 2.2.7 Theorem 2.2.6 implies the following:
EkA = In. Then A-1 = E1E2
Ek.Remark 2.2.7 gives the following method of computing the inverse of a matrix.
Summary: Let A be an n×n matrix. Apply the Gauss-Jordan method to the matrix [AIn]. Suppose the
row-reduced echelon form of the matrix [AIn] is [BC]. If B = In, then A-1 = C or else A is not
invertible.
Example 2.2.8 Find the inverse of the matrix
using the Gauss-Jordan method.
Solution: let us apply the Gauss-Jordan method to the matrix
.



|


.Thus, the inverse of the given matrix is
.
, (ii)
, (iii)
,(iv)
.

. Find the elementary matrices E1,E2,E3 and E4 such that E4 ⋅ E3 ⋅ E2 ⋅
E1 ⋅ A = I2.
. Determine elementary matrices E1,E2 and E3 such that E3 ⋅E2 ⋅E1 ⋅
B = I3.
We end this section by giving two more equivalent conditions for a matrix to be invertible.
Observe that x0 = A-1b is the unique solution of the system Ax = b.
The system Ax = b has a solution and hence by definition, the system is consistent.
For 1 ≤ i ≤ n, define ei = (0,…,0,
ith position,0,…,0)t, and consider the linear system
Ax = ei . By assumption, this system has a solution, say xi, for each i,1 ≤ i ≤ n. Define a
matrix B = [x1,x2,…,xn]. That is, the ith column of B is the solution of the system Ax = ei.
Then
![AB = A[x1,x2...,xn] = [Ax1, Ax2...,Axn] = [e1,e2...,en] = In.](mth102231x.png)
We now state another important result whose proof is immediate from Theorem 2.2.10 and Theorem 2.2.5 and hence the proof is omitted.
In the previous section, we gave a few equivalent conditions for a square matrix to be invertible. We also used the Gauss-Jordan method and the elementary matrices to compute the inverse of a square matrix A. In this section and the subsequent sections, we will mostly be concerned with m × n matrices.
Let A by an m×n matrix. Suppose that C is the row-reduced echelon form of A. Then the matrix C is unique (see Theorem 2.1.30). Hence, we use the matrix C to define the rank of the matrix A.
Definition 2.3.1 (Row Rank of a Matrix) Let C be the row-reduced echelon form of a matrix A. The number of non-zero rows in C is called the row-rank of A.
For a matrix A, we write ‘row-rank (A)’ to denote the row-rank of A. By the very definition, it is clear that row-equivalent matrices have the same row-rank. Thus, the number of non-zero rows in either the row echelon form or the row-reduced echelon form of a matrix are equal. Therefore, we just need to get the row echelon form of the matrix to know its rank.
. 
. 
The following remark related to the augmented matrix is immediate as computing the rank only involves the row operations (also see Remark 2.1.29).
Remark 2.3.3 Let Ax = b be a linear system with m equations in n unknowns. Then the row-reduced echelon form of A agrees with the first n columns of [Ab], and hence
![row- rank(A) ≤ row-rank([Ab ]).](mth102236x.png)
Now, consider an m × n matrix A and an elementary matrix E of order n. Then the product AE corresponds to applying column transformation on the matrix A. Therefore, for each elementary matrix, there is a corresponding column transformation as well. We summarize these ideas as follows.
Definition 2.3.4 The column transformations obtained by right multiplication of elementary matrices are called column operations.

Remark 2.3.6 After application of a finite number of elementary column operations (see Definition 2.3.4) to a matrix A, we can obtain a matrix B having the following properties:
We define column-rank of A as the number of non-zero columns in B.
It will be proved later that row-rank(A) = column-rank(A). Thus we are led to the following definition.
Definition 2.3.7 The number of non-zero rows in the row-reduced echelon form of a matrix A is called the rank of A, denoted rank(A).
we are now ready to prove a few results associated with the rank of a matrix.
Theorem 2.3.8 Let A be a matrix of rank r. Then there exist a finite number of elementary matrices E1,E2,…,Es and F1,F2,…,Fℓ such that
![[ ]
Ir 0
E1E2...EsAF1F2 ...Fℓ = 0 0 .](mth102239x.png)
Proof. Let C be the row-reduced echelon matrix of A. As rank(A) = r, the first r rows of C are non-zero rows. So by Theorem 2.1.22, C will have r leading columns, say i1,i2,…,ir. Note that, for 1 ≤ s ≤ r, the is th column will have 1 in the sth row and zero, elsewhere.
We now apply column operations to the matrix C. Let D be the matrix obtained from C by
successively interchanging the sth and isth column of C for 1 ≤ s ≤ r. Then D has the form
,
where B is a matrix of an appropriate size. As the (1,1) block of D is an identity matrix, the block (1,2)
can be made the zero matrix by application of column operations to D. This gives the required
result. □
The next result is a corollary of Theorem 2.3.8. It gives the solution set of a homogeneous system Ax = 0. One can also obtain this result as a particular case of Corollary 2.1.23.2 as by definition rank(A) ≤ m, the number of rows of A.
Corollary 2.3.9 Let A be an m × n matrix. Suppose rank(A) = r < n. Then Ax = 0 has infinite number of solutions. In particular, Ax = 0 has a non-trivial solution.
Proof. By Theorem 2.3.8, there exist elementary matrices E1,…,Es and F1,…,Fℓ such that
E1 E2
EsAF1F2
Fℓ =
. Define P = E1E2
Es and Q = F1F2
Fℓ. Then the
matrix PAQ =
. As Ei’s for 1 ≤ i ≤ s correspond only to row operations, we get
AQ =
, where C is a matrix of size m × r. Let Q1,Q2,…,Qn be the columns of the
matrix Q. Then check that AQi = 0 for i = r + 1,…,n. Hence, the required results follows (use
Theorem 2.1.5). □
and B =
. Find P and Q such that B = PAQ.
,AC1 =
,B2AC2 =
and B3AC3 =
, where the (1,1)
block of each matrix is of size r × r. Also, prove that A1 is an invertible matrix.
,
P(AB) = (PAQ)(Q-1B) =
. Now find R an invertible matrix with P(AB)R =
. Define
X = R
Q-1.]
![[ ]- 1 [ ]
A B = 0 C-1 .
C 0 B -1 - B -1AC- 1](mth102258x.png)
and B =
. Also, assume that A11 is invertible
and define P = A22 - A21A11-1A12. Then prove that
We end this section by giving another equivalent condition for a square matrix to be invertible. To do so, we need the following definition.
Definition 2.3.11 A n × n matrix A is said to be of full rank if rank(A) = n.
Let if possible rank(A) = r < n. Then there exists an invertible matrix P (a product of elementary
matrices) such that PA =
, where B1 is an r ×r matrix. Since A is invertible, let A-1 =
,
where C1 is an r × n matrix. Then
![[ ][ ] [ ]
P = P I = P (AA - 1) = (P A)A- 1 = B1 B2 C1 = B1C1 + B2C2 .
n 0 0 C2 0](mth102266x.png)
Suppose A is of full rank. This implies, the row-reduced echelon form of A has all non-zero rows. But A has as many columns as rows and therefore, the last row of the row-reduced echelon form of A is [0, 0, … , 0, 1]. Hence, the row-reduced echelon form of A is In.
Using Theorem 2.2.5.3, the required result follows. □
In Section 2.2, we studied the system of linear equations in which the matrix A was a square matrix. We will now use the rank of a matrix to study the system of linear equations even when A is not a square matrix. Before proceeding with our main result, we give an example for motivation and observations. Based on these observations, we will arrive at a better understanding, related to the existence and uniqueness results for the linear system Ax = b.
Consider a linear system Ax = b. Suppose the application of the Gauss-Jordan method has reduced the augmented matrix [Ab] to
![⌊ 1 0 2 - 1 0 0 2 8⌋
|| ||
|| 0 1 1 3 0 0 5 1||
[Cd ] = || 0 0 0 0 1 0 - 1 2|| .
|| 0 0 0 0 0 1 1 4||
| |
⌈ 0 0 0 0 0 0 0 0⌉
0 0 0 0 0 0 0 0](mth102269x.png)
Hence, the solution set is given by

,u1 =
,u2 =
and u3 =
. A similar idea is used in the proof of the next theorem and is omitted. The proof appears on page § as Theorem 3.3.26.
Theorem 2.4.1 (Existence/Non-Existence Result) Consider a linear system Ax = b, where A is an m × n matrix, and x,b are vectors of orders n× 1, and m× 1, respectively. Suppose rank (A) = r and rank([Ab]) = ra. Then exactly one of the following statement holds:
Remark 2.4.2 Let A be an m × n matrix. Then Theorem 2.4.1 implies that
Exercise 2.4.3 In the introduction, we gave 3 figures (see Figure 2) to show the cases that arise in the Euclidean plane (2 equations in 2 unknowns). It is well known that in the case of Euclidean space (3 equations in 3 unknowns), there
Determine all the figures.
In this section, we associate a number with each square matrix. To do so, we start with the
following notation. Let A be an n × n matrix. Then for each positive integers αi’s 1 ≤ i ≤ k
and βj ’s for 1 ≤ j ≤ ℓ, we write A(α1,…,αk
β1,…,βℓ) to mean that submatrix of A, that is
obtained by deleting the rows corresponding to αi’s and the columns corresponding to βj’s of
A.
Example 2.5.1 Let A =
. Then A(1|2) =
, A(1|3) =
and
A(1, 2|1, 3) = [4].
With the notations as above, we have the following inductive definition of determinant of a matrix. This definition is commonly known as the expansion of the determinant along the first row. The students with a knowledge of symmetric groups/permutations can find the definition of the determinant in Appendix 11.1.15. It is also proved in Appendix that the definition given below does correspond to the expansion of determinant along the first row.
Definition 2.5.2 (Determinant of a Square Matrix) Let A be a square matrix of order n. The determinant of A, denoted det(A) (or |A|) is defined by
,
det(A) = ( ∑ (- 1)1+ja1jdet A(1|j) , otherwise.
j=1](mth102280x.png)
. Then, det(A) = |A| = a
A(1|1)
- b
A(1|2)
= ad - bc. For example, if
A =
then det(A) =
= 1 ⋅ 5 - 2 ⋅ 3 = -1.
. Then, Let A =
. Then |A| = 1 ⋅
- 2 ⋅
+ 3 ⋅
= 4 - 2(3) + 3(1) = 1.
Exercise 2.5.4 Find the determinant of the following matrices.
i)
,ii)
,iii)
.
Definition 2.5.5 (Singular, Non-Singular) A matrix A is said to be a singular if det(A) = 0. It is called non-singular if det(A)≠0.
We omit the proof of the next theorem that relates the determinant of a square matrix with row operations. The interested reader is advised to go through Appendix 11.2.
Theorem 2.5.6 Let A be an n × n matrix. If
Since det(In) = 1, where In is the n×n identity matrix, the following remark gives the determinant of the elementary matrices. The proof is omitted as it is a direct application of Theorem 2.5.6.
Remark 2.5.7 Fix a positive integer n. Then
Remark 2.5.8 Theorem 2.5.6.1 implies that “one can also calculate the determinant by expanding along any row.” Hence, the computation of determinant using the k-th row for 1 ≤ k ≤ n is given by

. Determine det(A). 


|
.
Thus, using Theorem 2.5.6, det(A) = 2 ⋅ 1 ⋅ 2 ⋅ (-1) = -4.
. Determine det(A). 
.
Recall the following: The dot product of ut = (u1,u2) and vt = (v1,v2), denoted u ∙ v, equals
u ∙ v = u1v1 + u2v2, and the length of a vector u, denoted ℓ(u) equals ℓ(u) =
.
Also, if θ is the angle between u and v then we know that cos(θ) =
. Therefore



In general, for any n×n matrix A, it can be proved that |det(A)| is indeed equal to the volume of the n-dimensional parallelepiped. The actual proof is beyond the scope of this book.
Exercise 2.5.11 In each of the questions given below, use Theorem 2.5.6 to arrive at your answer.
,B =
and C =
for some complex
numbers α and β. Prove that det(B) = α det(A) and det(C) = det(A).
and B =
. Prove that 3 divides det(A) and det(B) = 0.
Definition 2.5.12 (Minor, Cofactor of a Matrix) The number det
is called the
(i, j)th minor of A. We write Aij = det
. The (i,j)th cofactor of A, denoted Cij, is the
number (-1)i+jAij.
Definition 2.5.13 (Adjoint of a Matrix) Let A be an n × n matrix. The matrix B = [bij] with bij = Cji, for 1 ≤ i,j ≤ n is called the Adjoint of A, denoted Adj(A).
Example 2.5.14 Let A =
. Then Adj(A) =
as

Proof. Part 1: It directly follows from Remark 2.5.8 and the definition of the cofactor.
Part 2: Fix positive integers i,ℓ with 1 ≤ i≠ℓ ≤ n. And let B = [bij] be a square matrix whose ℓth row equals the ith row of A and the remaining rows of B are the same as that of A.
Then by construction, the ith and ℓth rows of B are equal. Thus, by Theorem 2.5.6.5, det(B) = 0. As A(ℓ|j) = B(ℓ|j) for 1 ≤ j ≤ n, using Remark 2.5.8, we have
This completes the proof of Part 2.Part 3:, Using Equation (2.5.3) and Remark 2.5.8, observe that
![[ ] ∑n ∑n {
A(Adj(A)) = aik(Adj(A)) = aikCjk = 0, if i ⁄= j,
ij k=1 kj k=1 det(A), if i = j.](mth102326x.png)
= In. Hence, by
Theorem 2.2.6,

Example 2.5.16 For A =
, Adj(A) =
and det(A) = -2. Thus,
by Theorem 2.5.15.3, A-1 =
.
The next corollary is a direct consequence of Theorem 2.5.15.3 and hence the proof is omitted.
Corollary 2.5.17 Let A be a non-singular matrix. Then

The next result gives another equivalent condition for a square matrix to be invertible.
Proof. Let A be non-singular. Then det(A)≠0 and hence A-1 =
Adj(A) as .
Now, let us assume that A is invertible. Then, using Theorem 2.2.5, A = E1E2
Ek, a product of
elementary matrices. Also, by Remark 2.5.7, det(Ei)≠0 for each i,1 ≤ i ≤ k. Thus, by a repeated
application of the first three parts of Theorem 2.5.6 gives det(A)≠0. Hence, the required result
follows. □
We are now ready to prove a very important result that related the determinant of product of two matrices with their determinants.
Proof. Step 1. Let A be non-singular. Then by Theorem 2.5.15.3, A is invertible. Hence, using
Theorem 2.2.5, A = E1E2
Ek, a product of elementary matrices. Then a repeated application of the
first three parts of Theorem 2.5.6 gives

Step 2. Let A be singular. Then using Theorem 2.5.18 A is not invertible. Hence, there exists
an invertible matrix P such that PA = C, where C =
. So, A = P-1C and therefore
![( [ ])
det(AB ) = det((P -1C)B ) = det(P -1(CB )) = det P -1 C1B
0
( [C1B])
= det(P-1)⋅det 0 as P -1 is non-singular
= det(P)⋅0 = 0 = 0 ⋅det(B) = det(A )det(B ).](mth102339x.png)
The next result relates the determinant of a matrix with the determinant of its transpose. As an application of this result, determinant can be computed by expanding along any column as well.
Proof. If A is a non-singular, Corollary 2.5.17 gives det(A) = det(At).
If A is singular, then by Theorem 2.5.18, A is not invertible. Therefore, At is also not invertible (as At
is invertible implies A-1 =
(At)-1
t)). Thus, using Theorem 2.5.18 again, det(At) = 0 = det(A). Hence
the required result follows. □
Let A be a square matrix. Then using Theorem 2.2.10 and Theorem 2.5.18, one has the following result.
Thus, Ax = b has a unique solution for every b if and only if det(A)≠0. The next theorem gives a direct method of finding the solution of the linear system Ax = b when det(A)≠0.
Theorem 2.5.22 (Cramer’s Rule) Let A be an n × n matrix. If det(A)≠0 then the unique solution of the linear system Ax = b is

Proof. Since det(A)≠0, A-1 =
Adj(A). Thus, the linear system Ax = b has the solution
x =
Adj(A)b. Hence xj, the jth coordinate of x is given by

In Theorem 2.5.22 A1 =
, A2 =
and so on till
An =
.
Example 2.5.23 Solve Ax = b using Cramer’s rule, where A =
and b =
.
Solution: Check that det(A) = 1 and xt = (-1,1,0) as
