Matrices Compress
Matrices Compress
Matrices
Pam Norton
MATRICES
Matrices are used in many areas of mathematics, and have also have
applications
applications
in diverse
in
diverse
areas such
areas
as such
engineering,
as engineering,
computer computer
graphics,graphics,
image processing,
image processing,
physicalphysical
sciences,
sciences, biological
biological sciences and
sciences
socialand
sciences.
social Powerful
sciences. Powerful
calculators calculators
and computers
and computers
can now
can now
carry outcarry
complicated
out complicated
and difficult
and numeric
difficult numeric
and algebraic
and algebraic
computations
computations
involving
involving
matrix methods,
matrix and
methods,
such technology
and such technology
is a vital tool
is ain
vital
related
tool real-life,
in relatedproblem-
real-life,
Pam Norton
problem-solving
solving applications.
applications.
This book provides mathematics teachers with an elementary introduction to
matrix algebra and its uses in formulating and solving practical problems, solving
systems of linear equations, representing combinations of affine (including linear)
transformations of the plane and modelling finite state Markov chains. The basic
theory in each of these areas is explained and illustrated using a broad range of
examples. A feature of the book is the complementary use of technology, particularly
computer algebra systems, to do the calculations involving matrices required for
the applications. A selection of student activities with solutions and text and web
references are included throughout the book.
Series overview
MathsWorks for Teachers has been developed to provide a coherent and contemporary
framework for conceptualising and implementing aspects of middle and senior
mathematics curricula.
Matrices
Titles in the series are:
Functional Equations
David Leigh-Lancaster
Contemporary Calculus
Michael Evans
Pam Norton
Functional Contemporary Matrices Foundation Numeracy Data Analysis Complex Numbers
Matrices Equations
David Leigh-Lancaster
Calculus
Michael Evans
Pam Norton in Context
David Tout and Gary Motteram
Applications
Kay Lipson
and Vectors
Les Evans
Pam Norton MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers
Foundation Numeracy
in Context
David Tout & Gary Motteram
ISBN 978-0-86431-508-3
Data Analysis
Applications
Kay Lipson
Complex Numbers
and Vectors
9 780864 315083 MathsWorks for Teachers
Les Evans
Matrices
Pam Norton
All rights reserved. Except under the conditions described in the Copyright Act 1968 of
Australia and subsequent amendments, no part of this publication may be reproduced, stored
in a retrieval system or transmitted in any form or by any means, electronic, mechanical,
photocopying, recording or otherwise, without the written permission of the publishers.
Norton, Pam.
Matrices.
For teachers of senior secondary mathematics courses.
ISBN 9780864315083.
1. Matrices – Study and teaching (Secondary).
I. Leigh-Lancaster, David. II. Australian Council for Educational Research.
III. Title. (Series: Mathsworks for teachers).
512.94340712
Introduction v
About the author vi
1 An introduction to matrices 1
History 2
Matrices in the senior secondary mathematics curriculum 5
A b o u t t h e a u t h o r
vi
C ha p t e r 1
A n i n t r o d u c t i o n t o
m a t ric e s
Throughout history people have collected and recorded various data using sets
(unordered lists), vectors (ordered one-dimensional lists) and matrices
(ordered two-dimensional lists of lists, tables or rectangular arrays). Today
arrays and tables of numbers and other information are found widely in
everyday life. For example, sports ladders give numbers of wins, losses, draws,
points and other information for teams in a competition. Each day stock tables
are given in local newspapers. Results of opinion polls are usually given in
table form in newspapers. Information of many forms is held in tables, and the
spreadsheet is an electronic digital technology that is now widely used in
everyday life wherever data is to be entered, stored and manipulated using
tables and rectangular arrays.
The word matrix comes from the Latin word for ‘womb’. The term matrix
is also used in areas other than mathematics, and generally means an
environment, milieu, substance or place in which something is cast, shaped
or developed.
It has become conventional in contemporary mathematics to describe a
matrix in terms of the specification of its elements by reference to row by
column, following the use of this form in developments by 19th century
mathematicians, but it is interesting to speculate whether an element of a
matrix would most likely have been referenced by column first then by row,
had computerised spreadsheets been around prior to the modern development
of matrices! Indeed, Chinese mathematicians who first recorded the use of
matrices used an early form of computer algebra, the counting board, and did
reference elements by column before row (see Hoe, 1980).
MathsWorks for Teachers
Matrices
Hi s t o r y
chapter 1
An introduction to matrices
numerical elements for matrices. Where quick and accurate computations are
required, modern calculator and computer technologies are indispensable
tools.
Gauss’s method was extended to become known as the Gauss–Jordan
method for solving systems of simultaneous linear equations. Jordan’s
contribution to this method is the incorporation of a systematic technique for
back-substitution. The Gauss–Jordan method is used to obtain the reduced row
echelon form of a matrix and hence to solve a system of simultaneous linear
equations directly. The Jordan part was first described by Wilhelm Jordan, a
German professor of geodesy, in the third edition (1888) of his Handbook of
Geodesy. He used the method to solve symmetric systems of simultaneous
linear equations arising out of a least squares application in geodesy.
It is perhaps somewhat surprising that the idea of a matrix did not evolve
until well after that of a determinant. In 1545, Cardan gave a rule for solving
a system of two linear equations, which is essentially Cramer’s rule using
determinants for solving a 2 × 2 system. The idea of a determinant appeared
in Japan and Europe at approximately the same time. Seki Kowa in a
manuscript in 1683 introduced the notion of a determinant (without using
this name for them). He described their use and showed how to find
determinants of matrices up to order 5 × 5. In the same year in Europe,
Leibniz also introduced the idea of a determinant to explain when a system of
equations had a solution.
In 1750, Gabriel Cramer published his Introduction to the Analysis of
Algebraic Curves. He was interested in the problem of determining the
equation of a plane curve of a given degree that passes through a certain
number of points. He stated a general rule, now known as Cramer’s rule, in an
appendix, but did not explain how it worked. The term ‘determinant’ was first
introduced by Gauss in 1801 while discussing quadratic forms. Binary
quadratic forms are expressions such as ax 2 + 2bxy + cy2, where x and y are
variables and a, b and c coefficients, which can be represented in matrix
form as:
6 x y@ =
a b x
G= G
b c y
Gauss considered linear substitutions for the variables x and y of the form
x = ax1 + by1
y = gx1 + hy1
and composition of substitutions, which led to matrix multiplication.
MathsWorks for Teachers
Matrices
chapter 1
An introduction to matrices
awarded the Nobel Prize in Economics for his work on input–output models,
which relied heavily on matrices and solving systems of simultaneous linear
equations. Matrices are also used extensively in business optimisation
contexts, in particular where networks (graph theory) are applied to problems
involving representation, connectedness and allocation. The mathematician
Olga Taussky Todd developed matrix applications to analyse vibrations on
airplanes during World War II, and made an important contribution to the
development and application of matrix theory.
The development of the electronic digital computer has had a big impact on
the use of matrices in many areas. Matrix methods are used extensively in
computer graphics, a developing area especially driven by the demands of the
movie and computer games industries. Matrix methods are also used
extensively in the communication industry (especially for encryption and
decryption), in engineering and the sciences, and in economic modelling
and industry.
In mathematics, the study of matrices and determinants is part of linear
algebra, and it is recognised that matrices and their natural operations provide
models for the algebraic structures of a vector space and of a non-
commutative ring.
Ma t ric e s i n t h e s e n i o r s e c o n d ar y
m a t h e m a t ic s c u rric u l u m
During the 1960s and 1970s matrices began to be incorporated into aspects of
the senior secondary mathematics curriculum, in particular following the
innovations of the new mathematics program from the 1950s (see, for
example, Allendoerfer & Oakley, 1955; Adler, 1958), and the increasing use of
them as a tool for business-related applications. In curriculum terms this can
also be related to a greater emphasis on discrete mathematics in curriculum
design during the second half of the 20th century.
There have been several purposes for which matrices have been introduced
into the senior secondary mathematics curriculum:
• to represent and solve systems of m simultaneous linear equations in
n variables, where m, n ≥ 2
• to represent and apply transformations, and combinations of
transformations of the cartesian plane, in particular considering those
subsets of the cartesian plane that represent graphs of functions and other
relations
MathsWorks for Teachers
Matrices
chapter 1
An introduction to matrices
MathsWorks for Teachers
Matrices
SUMM A R Y
chapter 1
An introduction to matrices
SUMM A R Y (Cont.)
References
Adler, I 1958, The new mathematics, John Day, New York.
Allendoerfer, CB & Oakley, CO 1955, Principles of mathematics, McGraw-Hill, New
York.
Cameron, N, Clements, K, Green, LJ & Smith, GC 1970, School Mathematics
Research Foundation: Pure mathematics, Chesire, Melbourne.
Fitzpatrick, JB & Galbraith, P 1971, Pure mathematics, Jacaranda, Milton, Queensland.
Fraleigh, JB & Beauregard, AR 1995, Linear algebra (3rd edition with historical notes
by VJ Katz), Addison-Wesley, Reading, MA.
Garner, S, McNamara, A & Moya, F 2004, CAS analysis supplement for
Mathematical Methods CAS Units 3 and 4, Pearson, Melbourne.
Grattan-Guinness, I (ed.) 1994, Companion encyclopedia of the history and
philosophy of the mathematical sciences, volume 1, Routledge, London.
MathsWorks for Teachers
Matrices
Websites
http://www.ualr.edu/lasmoller/matrices.html – History Department, University of
Arkansas at Little Rock
This website includes a concise overview of the history of matrices with a good
list of links to related sites, including online applications for matrix computation.
http://www-groups.dcs.st-and.ac.uk/~history/HistTopics/Matrices_and_
determinants.html – School of Mathematics and Statistics, University of St
Andrews Scotland
This website contains a comprehensive historical coverage of matrices,
determinants and related mathematics from ancient Babylonia and China
through to the modern era, with extensive cross references to key
mathematicians and related topics.
10
C ha p t e r 2
R e c t a n g u l ar arra y s ,
m a t ric e s a n d o p e ra t i o n s
11
MathsWorks for Teachers
Matrices
S == G
150 40 10
70 20 10
12
chapter 2
Rectangular arrays, matrices and operations
this can be represented by the matrix = G, and we can easily update the
0 2 0
0 0 0
stock can be represented by the matrix = G, and the total stock of each
10 5 0
10 5 0
13
MathsWorks for Teachers
Matrices
14
chapter 2
Rectangular arrays, matrices and operations
S t u d e n t ac t i v i t y 2 . 1
a Use a suitable matrix product to calculate the total amount of change held by each
of Michael, Jay, Sam and Lin in the given week.
b If the Australian–US dollar exchange rate is A$1 = US$0.76, use a suitable scalar
multiple of the matrix in part a to find the equivalent value of their change in US
dollars.
D e f i n i t i o n o f a m a t ri x
15
MathsWorks for Teachers
Matrices
Two special cases of note are that a m × 1 matrix is usually called a column
matrix or column vector, while a 1 × n matrix is usually called a row matrix
or row vector. If a rectangular array is not available for visual display, then a
matrix can be written as a list of lists of equal size, where a list is an ordered
set. For example, the matrix
R V
S 4 0 2 W
S 1 1 1 W
S W
S- 5 10 3 W
S 0 0 3.4 W
T X
is the 4 × 3 matrix uniquely defined by the list {{4, 0, 2}, {1, 1, 1}, {–5, 10, 3},
{0, 0, 3.4}}.
When technology is used, the data to specify a matrix is either entered into
a template of a specified size (where the dimensions of the matrix needs to be
specified first to obtain the desired template), or as a list of lists.
Example 2.1
O p e ra t i o n s o n m a t ric e s
16
chapter 2
Rectangular arrays, matrices and operations
A d d i t i o n a n d s u b t rac t i o n o f t w o
m a t ric e s
If matrices A and B are of the same size m × n then A + B is the m × n matrix
with (i, j) entry aij + bij, for i = 1 to m, j = 1 to n. That is,
A + B = [aij] + [bij] = [aij + bij]. In other words, we simply add all the entries
in their corresponding positions throughout the matrix.
Subtraction can be defined in the same way, and
A – B = [aij] – [bij ] = [aij – bij]. In other words, we simply subtract all the
entries in matrix B from their corresponding entries in matrix A.
Example 2.2
R V R V R
S 1 - 2 W S 7 12 W S 1 + 7 ]- 2g + 12 VW RS 8 10 VW
S 3 5 W+ S- 3 1 W = S 3 + ]- 3g 5 + 1 W= S 0 6 W
SS- 4 1 WW SS- 1 2 WW SS]- 4g + ]- 1g 1 + 2 WW SS- 5 3 WW
T X T X T X T X
R V R V R
S 1 - 2 W S 7 12 W S 1 - 7 ]- 2g - 12 W S- 6 - 14 VW
V R
S 3 5 W- S- 3 1 W = S 3 - ]- 3g 5 - 1 W= S 6 4 W
SS- 4 1 WW SS- 1 2 WW SS]- 4g - ]- 1g 1 - 2 WW SS- 3 - 1 WW
T X T X T X T X
M u l t i p l ica t i o n b y a n u m b e r ( s ca l ar
m u lt i p l e )
Given a matrix A of size m × n and a number (scalar) k, then kA is the m × n
matrix with (i, j) entry kaij for i = 1 to m and j = 1 to n. That is, if A = [aij ]
and k is a scalar then kA = k[aij] = [k × aij].
17
MathsWorks for Teachers
Matrices
Example 2.3
R V
S 2 - 2W
If k = 3 and A = S 3 7 W, then
SS- 1 5 WW
T X
R V R V
S 2 - 2W S 6 - 6W
3A = A + ]A + Ag = 3 S 3 7 W = S 9 21 W
SS- 1 5 WW SS- 3 15 WW
T X T X
A - B = A + ]- Bg
with entries the sums aij + (–bij) of the corresponding entries in A and (–B).
Example 2.4
S 1 - 2 W S 7 12 W S1 + ]- 7g - 2 + ]- 12gW S- 6 - 14 W
R V R V R V R V
S 3 5 W- S- 3 1 W = S 3 + 3 5 + ]- 1g W = S 6 4 W
SS- 4 1 WW SS- 1 2 WW SS - 4 + 1 1 + ]- 2g WW SS- 3 - 1 WW
T X T X T X T X
There is a special matrix, called the zero matrix O = [oij] where oij = 0 for
all i and j. For any matrix A, A - A = O .
Example 2.5
1 -2 3 2 4 -1
If A = > H and B = > H, then
4 2 -1 -1 3 2
1 -2 3 2 4 -1
i A+B => H+> H
4 2 -1 -1 3 2
1 + 2 ]- 2g + 4 3 + ]- 1g
== G== G
3 2 2
4 + ]- 1g 2 + 3 ]- 1g + 2 3 5 1
18
chapter 2
Rectangular arrays, matrices and operations
1 -2 3 2 4 -1
A-B => H-> H
4 2 -1 -1 3 2
ii
1 - 2 ]- 2g - 4 3 - ]- 1g -1 -6 4
== G => H
4 - ]- 1g 2 - 3 ]- 1g - 2 5 -1 -3
1 -2 3 2 4 -1
iii 2A - 3B = 2 > H - 3> H
4 2 -1 -1 3 2
2 -4 6 6 12 - 3
=> H-> H
8 4 -2 -3 9 6
2 - 6 ]- 4g - 12 6 - ]- 3g - 4 - 16 9
== G=> H
]
8 - -3 g 4-9 ] g
-2 - 6 11 - 5 - 8
1 -2 3 1 -2 3
H== G
0 0 0
iv A - A = > H->
4 2 -1 4 2 -1 0 0 0
S t r u c t u r e p r o p e r t i e s o f m a t ri x
a d d i t i o n a n d s ca l ar m u l t i p l ica t i o n
Let A, B and C be any matrices of a given size m × n, where m and n are non-
zero, then
1 the sum of any two such matrices
is always defined (Closure property for addition)
2 (A + B) + C = A + (B + C) (Associative property for addition)
3 A + O = A = O + A (Identity property for addition)
4 A + (-A) = O = (-A) + A (Inverse property for addition)
5 A + B = B + A (Commutative property for addition)
This collection of properties can be established by working from the
general definition of matrix addition as applied to the matrices A = [aij ],
B = [bij ], C = [cij ] and O = [oij ] of the same order. It may be helpful to have
students undertake some general case calculations for matrices of a given
order, for example 2 × 3 matrices. The results may appear to be obvious, or
even trivial to students; however, care should be taken to draw to their
attention that they do apply to matrices drawn from the same set of a given
order (for any order) by virtue of the component-wise definition of addition of
matrices, and the corresponding number properties of their elements and the
corresponding number operations. This may be summarised by saying that
19
MathsWorks for Teachers
Matrices
such a set of matrices forms a commutative (or abelian) group under addition.
If we also consider multiplication of a matrix from this set by a scalar (scalar
multiple), then for any scalars r and s the following properties also hold:
1 r(sA) = (rs)A (Associative property of scalar multiples)
2 (r + s)A = rA + sA (Right distributive property of scalar multiple over
scalar addition)
3 r(A + B) = rA + rB (Left distributive property of scalar multiple over
matrix addition)
Again, it may be useful for students to consider the general case for
matrices of a given order. Taken together, these properties show that such a set
of matrices with these operations of addition and scalar multiple form what is
called a vector space.
Ma t ri x m u l t i p l ica t i o n
That is, the (i, j) entry of the product is obtained by multiplying each of the
entries in the ith row of A by the corresponding entries in the jth column of
B, and then adding all these products.
20
chapter 2
Rectangular arrays, matrices and operations
Example 2.6
R V
S 2 W
a If A = = G and B = S- 1W, then
1 -2 3
4 2 -1 SS 3 WW
R V T X
1 - 2 3 S W 1 # 2 + ]- 2g # ]- 1g + 3 # 3
2
AB = = G S- 1W = = G== G
13
4 2 - 1 S W 4 # 2 + 2 # ]- 1g + ]- 1g # 3 3
S 3 W
T X
Note that we cannot form the product BA, since the number of
columns of B is not equal to the number of rows of A.
R V
S- 1 2 W
b If A = = G and B = S- 2 3 W, then
1 -2 3
4 2 -1 SS 1 4 WW
R VT X
- 1 2W
1 -2 3 S
AB = > H S- 2 3 W
4 2 -1 S W
S 1 4W
1 # ]- 1g + ]- 2g # ]- 2g + 3 # 1 1 # 2 + ]- 2g # 3 + 3 # 4
T X
== G
4 # ]- 1g + 2 # ]- 2g + ]- 1g # 1 4 # 2 + 2 # 3 + ]- 1g # 4
6 8
=> H
- 9 10
and
R V
S- 1 2 W 1 - 2 3
BA = S- 2 3 W> H
S W 4 2 -1
S 1 4W
S]- 1g # 1 + 2 # 4 ]- 1g # ]- 2g + 2 # 2 ]- 1g # 3 + 2 # ]- 1gW
TR X V
= S]- 2g # 1 + 3 # 4 ]- 2g # ]- 2g + 3 # 2 ]- 2g # 3 + 3 # ]- 1gW
SS 1 # 1 + 4 # 4 1 # ]- 2g + 4 # 2 1 # 3 + 4 # ]- 1g WW
RT V X
S 7 6 - 5W
= S10 10 - 9 W
S W
S17 6 - 1 W
T X
21
MathsWorks for Teachers
Matrices
== G
4 13
11 13
]- 1g # 2 + 2 # ]- 1g ]- 1g # 3 + 2 # 5
H== G
-1 2 2 3
and BA = > H>
2 3 -1 5 2 # 2 + 3 # ]- 1g 2#3+3#5
-4 7
=> H
1 21
Clearly, AB ≠ BA in this case. Inspection of a range of other examples will
generally show that it is not the case that AB is the same as BA. This is also
the case for square matrices of other orders, a situation which students can
readily investigate using suitable technology. They will readily develop a
collection of cases which show that multiplication of square matrices is, in
general, not commutative. A related investigation is to see if students can
identify examples, and then sets, of square matrices for which the product is
commutative, for example:
= G and = G
3 0 1 0
0 2 0 6
There are some important situations in which matrix products are
commutative, for example when they are used to represent and compose
certain types of transformations of the cartesian plane, such as rotations about
the origin.
22
chapter 2
Rectangular arrays, matrices and operations
Teachers may also wish to consider such arguments from the general
definition of matrix multiplication for the case of, for example, 2 × 2 square
matrices, and consideration of the equality of two matrices. Thus:
if A = = G and B = = G
a b e f
c d g h
ae + bg af + bh
then AB = > H
ce + dg cf + dh
ea + fc eb + fd
and BA = > H
ga + hc gb + hd
If the elements of A and B are real numbers, then AB = BA when bg = fc ,
af + bh = eb + fd , and ce + dg = ga + hc .
Z e r o a n d i d e n t i t y m a t ric e s
A matrix with all entries zero is called the zero matrix (of appropriate size),
and denoted O. A square matrix of size n × n with all entries zero except the
diagonal entries, that is those in position (j, j) for j = 1 to n, which are all 1, is
called the identity matrix of size n × n, denoted I. When the size (order) of
matrices being considered is fixed, the symbols O and I can be used without
ambiguity, otherwise the notation Om,n and In,n or just On (when m = n) and
In can be used, as applicable.
23
MathsWorks for Teachers
Matrices
Example 2.7
R V
S0 0 W
O 3, 2 = S0 0 W is the zero matrix of size 3 × 2.
SS0 0 WW
T X
I2, 2 = = G is the identity matrix of size 2 × 2.
1 0
0 1
R V
S1 0 0 W
I 3 = S0 1 0 W is the identity matrix of size 3 × 3.
SS0 0 1 WW
T X
Zero and identity matrices have properties similar to the numbers 0 and 1
with respect to addition and multiplication. Students should be able to
convince themselves that if A is a matrix and O is the zero matrix of the same
size, then A + O = A = O + A and A + (–A) = O = (–A) + A, and hence
A – A = O. Similarly if A is an m × n matrix and I is the identity matrix of
size n × n then AI = A, and if B is an n × m matrix then IB = B. If A is also an
n × n matrix, then students should also be able to observe that AI = A = IA.
Initially this may be tested by a judicious range of examples, and subsequently
argued in terms of the general definitions of the relevant operations.
Exploration of these operations and their conformable computations for
various combinations of matrices will enable students to see matrices as
rectangular arrays that can be thought of as a list of lists of the same size, as a
rectangular array and as abstract reified objects in their own right. They
should also be aware that while particular computations involving matrices of
relatively low order can be carried out fairly readily (if somewhat tediously)
by hand, more complex computations and/or computations involving matrices
of higher order are generally best carried out by technology designed for
this purpose.
However, students should also be aware that general analysis of matrix
operations is likely to involve component-wise operations with numbers using
indexed sets of sums and/or products of these numbers. For example, students
might be asked to show that for square matrices of a given size AO = O = OA
but AB = O does not necessarily imply that A = O or B = O.
24
chapter 2
Rectangular arrays, matrices and operations
S t u d e n t ac t i v i t y 2 . 2
1 3 2 -1 2 4 6
a Given A = = G, B = = G, C = = G calculate
-1 2 2 4 8 10 12
i 2A
ii A–B
iii A + B
iv AB
v BA
vi AC
4 1 p
b Let M = = G and P = = G where p and q are non-zero real numbers. Find all real
8 6 q
values of the scalar k such that MP = kP.
11
c Let A = = G . Evaluate An for n = 2 to 5 and find a general form for n > 1.
11
d Show that for square matrices of a given size AO = O = OA, but AB = O does not
necessarily imply that A = O or B = O.
0 -1
e Let J = = G and I be the identity matrix for multiplication for 2 × 2 matrices.
1 0
Show that J 2 = –I and that J 4 = I.
f Explain why, if X and Y are 2 × 2 matrices, then, in general,
X 2 – Y 2 ≠ (X + Y)(X – Y) and illustrate this with a suitable (counter) example. Find
two matrices X and Y for which this relationship is true.
T h e t ra n s p o s e o f a m a t ri x
Example 2.8
R V
S1 3 W
If A is the 3 × 2 matrix S2 4 W, then AT is a 2 × 3 matrix = G.
1 2 5
SS5 8 WW 3 4 8
T X
25
MathsWorks for Teachers
Matrices
Example 2.9
1 -2 2 -1 4
Let A = > H and B = > H.
3 0 0 1 3
R V
T T S 2 6 W
Then ]ABgT = f>
1 2 2 1 4 2 3 2
H> Hp = > H = S- 3 - 3 W
- - - -
3 0 0 1 3 6 - 3 12 S W
S- 2 12 W
R V R V T X
S 2 0W 1 3 S 2 6 W
and BT AT = S- 1 1 W> H = S- 3 - 3 W
SS WW - 2 0 S W
4 3 S- 2 12 W
T X T X
Note that we cannot form the product AT BT, since the number of columns
in AT, namely 2, is not equal to the number of rows in BT, namely 3.
Most equations involving matrices can be written in two forms: the
original and its equivalent using transposes. For example, the matrix equation
R V
50
160 43 10 S W 10090
= G # 30 W = =
S G
80 25 10 S W 5550
S80 W
T X
that we have seen earlier can also be written in transposed form:
R V
S160 80 W
650 30 80@ # S 43 25 W = 610090 5550@
SS W
10 10 W
T X
Commonly, matrix equations involving transformations (see Chapter 4)
and transitions (see Chapter 5) may also be written in a form that is the
transpose of the form given in this book.
A symmetric matrix is the same as its transpose. If A = [aij] is a symmetric
matrix, then aij = aji and AT = A. A symmetric matrix must be a square
matrix.
26
chapter 2
Rectangular arrays, matrices and operations
Example 2.10
R V
S 1 - 2 4W
= G and S- 2 2 0 W are symmetric matrices.
1 3
3 2 S W
S 4 0 3W
T X
A diagonal matrix is a square matrix that has non-zero entries only on the
main diagonal.
Example 2.11
R V
S1 0 0 W
= G and S0 - 2 0 W are diagonal matrices.
1 0
0 2 SS W
0 0 3W
T X
T h e i n v e r s e o f a m a t ri x
Not all square matrices have inverses. For example, = G does not have an
2 4
3 6
inverse.
Properties of inverses
It is important that students are familiar with some of the key properties of
matrices and their inverses:
1 A square matrix has at most one inverse, where A–1 A = I = AA–1.
2 If A is invertible, then so is AT, and (AT) –1 = (A–1)T.
3 If A is invertible, then so is A–1, and (A–1) –1 = A.
4 If A and B are invertible matrices of the same size, then AB is invertible
and (AB) –1 = B –1 A–1.
These properties can be established fairly readily, and provide some good
examples to students of proofs that are not lengthy, but are illustrative of
important aspects of mathematical reasoning using structural properties such
27
MathsWorks for Teachers
Matrices
Then AB = = G= G== G
2 4 1 2 1 14 20 10
3 6 3 4 2 21 30 15
and AC = = G= G== G
2 4 5 10 1 14 20 10
3 6 1 0 2 21 30 15
Hence AB = AC and yet B ≠ C.
2 The inverse matrix can be used to solve a system of simultaneous linear
equations with a unique solution. Consider the following system of
simultaneous equations:
2x + 3y = 7
4x + y = 3
This system can be written in matrix form as AX = B, where A = = G,
2 3
4 1
x
X = = G and B = = G. If A–1 exists, then we can multiply both sides of the
7
y 3
equation on the left by A–1, and we thus have X = A–1B, and so we can find
the solution by matrix multiplication.
28
chapter 2
Rectangular arrays, matrices and operations
There are several ways to find the inverse of a matrix. The most useful one
is via the Gauss–Jordan method (see, for example, Anton & Rorres, 2005, or
Nicholson, 2003 for details). Since we shall generally only be concerned with
the inverse of a 2 × 2 matrix, we will find it in a simple way.
Example 2.12
Solution
Suppose = G is its inverse.
e f
g h
Then = G= G==
a b e f
G.
1 0
c d g h 0 1
Hence:
ae + bg = 1 (i)
af + bh = 0 (ii)
ce + dg = 0 (iii)
cf + dh = 1 (iv)
Considering equation (ii), if we put f =- b and h = a , then the
equation is satisfied. Similarly for equation (iii) we can put e = d and
g =- c .
Now consider the matrix product:
If ad – bc = 0, then the matrix A does not have an inverse. The number ad – bc
is called the determinant of A and denoted det(A) or |A|. (For more
information on determinants, see the references on linear algebra.)
29
MathsWorks for Teachers
Matrices
S t u d e n t ac t i v i t y 2 . 3
b Use matrix inverses to solve the following system of simultaneous linear equations:
2x + 3y = 7
4x + y = 3
A p p l ica t i o n s o f m a t ric e s
Fibonacci’s rabbits
Suppose that newborn pairs of rabbits produce no offspring during the first
month of their lives, but then produce one new pair every subsequent month.
Start with F1 = 1 newborn pair in the first month and determine the number,
Fr = number of pairs in the rth subsequent month, assuming that no rabbit dies.
Since the newborn pair do not produce offspring in the second month, we
have F2 = F1 = 1. In the third month, the original pair will produce one pair of
offspring, so F3 = 2. In the fourth month, the pairs in the second month will
each produce another pair, so the total will be these newborn pairs added to
the number of pairs from the previous month, that is F4 = 1 + 2 = 3, or
F4 = F2 + F3.
Continuing in this manner, we see that
F5 = (Number of pairs alive in the third month, which each produce one
pair of offspring in the fifth month) + (Number of pairs alive in the
fourth month)
= F3 + F4
30
chapter 2
Rectangular arrays, matrices and operations
S t u d e n t ac t i v i t y 2 . 4
Codes
Governments, national security agencies, telecommunications providers,
banks and other companies are often interested in the transmission of coded
messages that are hard to decode by others, if intercepted, yet easily decoded
at the receiving end. There are many interesting ways of coding a message,
most of which use number theory or linear algebra. We will discuss one that
is effective, especially when a large-size invertible matrix is used.
At a personal level, bank accounts, credit cards, superannuation funds,
computer networks, phone companies, frequent flyer schemes, electronic
31
MathsWorks for Teachers
Matrices
M == G
-3 4
-1 2
Suppose we want to code the message
L E A V E N O W
I J K L M N O P Q
9 10 11 12 13 14 15 16 17
R S T U V W X Y Z
18 19 20 21 22 23 24 25 26
We can use this table to replace each letter with the number that
corresponds to the letter’s position in the alphabet.
L E A V E N O W
↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑
↓ ↓ ↓ ↓ ↓ ↓ ↓ ↓ ↓ ↓
12 5 1 22 5 0 14 15 23 0
32
chapter 2
Rectangular arrays, matrices and operations
The message has now been converted into the sequence of numbers
12, 5, 1, 22, 5, 0, 14, 15, 23, 0, which we group as a sequence of column vectors:
= G, = G, = G, = G, = G
12 1 5 14 23
5 22 0 15 0
and multiply on the left by M:
G== G, M = G = = G, M = G = = G, M = G = = G, M = G = =
- 16 - 15
M = G
12 1 85 5 14 18 23 - 69
5 -2 22 43 0 -5 15 16 0 23
giving the sequence of numbers –16, –2, 85, 43, –15, –5, 18, 16, –69, 23. This is
the coded message. Note that this could have been done in one step by the
matrix multiplication:
- 16 85 - 15 18 - 69
M= G=>
12 1 5 14 23
H
5 22 0 15 0 - 2 43 - 5 16 23
To decode it, the recipient needs to compute M–1:
-1 2
M- 1 = > 1 3 H
-2 2
M- 1 = G = = G, M- 1 = G = = G, M- 1 = G = = G, M- 1 = G = = G,
- 16 12 85 1 - 15 5 18 14
-2 5 43 22 -5 0 16 15
M- 1 = G== G
- 69 23
23 0
- 16 85 - 15 18 - 69
H== G
12 1 5 14 23
or M- 1 >
- 2 43 - 5 16 23 5 22 0 15 0
Note: It is possible to use 2 × 2 matrices so that both the matrix and its
inverse have integer entries—simply ensure that the determinant is equal to 1
or –1. It is also easy to extend this simple coding system to take account of
alphanumeric data such as PINs.
33
MathsWorks for Teachers
Matrices
S t u d e n t ac t i v i t y 2 . 5
a Based on this approach, code the message SAVE THE WHALES using the matrix
5 3
= G.
3 2
b The following message was received and was decoded using the coding matrix
5 3
M == G.
3 2
65, 42, 75, 50, 138, 87, 90, 54, 85, 54, 80, 49, 160, 99, 123, 76
Determine the original message.
SUMM A R Y
34
chapter 2
Rectangular arrays, matrices and operations
SUMM A R Y (Cont.)
= G, provided
d -b
For a 2 × 2 matrix A = = G, A- 1 =
a b 1
•
c d ad - bc - c a
ad – bc ≠ 0. (ad – bc is called the determinant of matrix A, written
det(A) or |A|.)
35
MathsWorks for Teachers
Matrices
References
Anton, H & Rorres, C 2005, Elementary linear algebra (applications version), 9th
edn, John Wiley and Sons, New York.
Cirrito, F (ed.) 1999, Mathematics higher level (core), 2nd edn, IBID Press, Victoria.
Hill, RO Jr 1996, Elementary linear algebra with applications, 3rd edn, Saunders
College, Philadelphia.
Lipschutz, S & Lipson, M 2000, Schaum’s outlines of linear algebra, 3rd edn,
McGraw-Hill, New York.
Nicholson, KW 2001, Elementary linear algebra, 1st edn, McGraw-Hill Ryerson,
Whitby, ON.
Nicholson, KW 2003, Linear algebra with applications, 4th edn, McGraw-Hill
Ryerson, Whitby, ON.
Poole, D 2006, Linear algebra: A modern introduction, 2nd edn, Thomson Brooks/
Cole, California.
Sadler, AJ & Thorning, DWS 1996, Understanding pure mathematics, Oxford
University Press, Oxford.
Victorian Curriculum and Assessment Authority (VCAA) 2005, VCE Mathematics
study design, VCAA, East Melbourne.
Wheal, M 2003, Matrices: Mathematical models for organising and manipulating
information, 2nd edn, Australian Association of Mathematics Teachers,
Adelaide.
Websites
http://wims.unice.fr/wims/en_tool~linear~matrix.html
This website provides a matrix calculator.
http://en.wikipedia.org/wiki/Matrix_(mathematics)
This website gives a concise introduction to matrices and matrix arithmetic, and
has links to other resources and references.
http://www.sosmath.com/matrix/matrix.html
This site has some notes on basic matrix concepts and operations at quite a simple
level.
36
C ha p t e r 3
S o lv i n g s ys t e m s o f
s i m u l t a n e o u s l i n e ar
eq u at i o n s
From the middle years of secondary schooling, students become familiar with
linear relations of the form ‘a certain number of x s added to a certain number
of y s are equal to a given number’, such as 2x + 3y = 24. Usually this is done
by considering a table of whole number ordered pairs of values that satisfy the
relation, and then plotting the corresponding points on a set of cartesian axes.
This is then typically extrapolated (by an implicit continuity assumption) to
consideration of the continuous straight line on which these points lie. Thus
students learn to draw the graph, part of which is shown in Figure 3.1, of such
15
10
x
–20 –15 –10 –5 O 5 10 15 20
–5
–10
–15
–20
Figure 3.1: Part of the graph of the linear relation 2x + 3y = 24
37
MathsWorks for Teachers
Matrices
38
chapter 3
Solving systems of simultaneous linear equations
15
10
y=7 (4, 7)
5
x
–4 –2 O 2 4 6 8 10
–5 x=4
–10
–15
–20
Figure 3.2: Parts of the graphs of the lines with equations x = 4 and y = 7
showing their point of intersection at (4, 7)
39
MathsWorks for Teachers
Matrices
[x = 44 49
23 ^ y = – 23 ]
This ordered pair represents the point of intersection of the corresponding
straight line graphs in the cartesian plane, as shown in Figure 3.3.
x
O
–2 –1 1 2 3 4
–2
–4
–6
40
chapter 3
Solving systems of simultaneous linear equations
5
z
–5
25
–25
y
x
–25
25
Figure 3.4: Graph of parts of 3x – 2y + z = 0 and x – y – z = 10 showing intersection
41
MathsWorks for Teachers
Matrices
42
chapter 3
Solving systems of simultaneous linear equations
R V R V
Sx W Sx W
matrix equations, 7 3 - 2 1A # Sy W = 5 0 ? and 71 0 - 1A # Sy W = 5 4 ?
SS WW SS WW
z z
T X T X
respectively, or simultaneously via the single matrix equation
R V
3 -2 1 Sx W 0
> H # Sy W = = G.
1 0 -1 S W 4
Sz W
T X
S t u d e n t ac t i v i t y 3 . 1
a Write down a system of two simultaneous linear equations in two variables that has
(–5, 6) as its unique solution.
b Write down a system of two simultaneous linear equations in two variables that has
(–5, 6) as one of its many solutions.
c Write down a system of two simultaneous linear equations in three variables that
have (0, 0, 0) and (1, 1, 1) as solutions.
d Use the Solve functionality of a CAS, or a like functionality of other suitable
technology, to find the intersection of 3x – 2y + z = 0 and x – y – z = 10, and
express the solution set in terms of y.
S o l v i n g s y s t e m s o f s i m u l t a n e o u s l i n e ar
e q u a t i o n s u s i n g m a t ri x i n v e r s e
If a system of n simultaneous linear equations in n variables is consistent and
has a unique solution, then square matrices and their inverse matrices may be
used to find this unique solution. We will begin by considering two simple
examples, give some geometric interpretations and then introduce some
general notation. To start with, it is important for students to see how a
familiar set of two simultaneous linear equations in two variables may be
written as an array, as is often the case for by hand techniques, and
subsequently as a single matrix equation.
43
MathsWorks for Teachers
Matrices
Example 3.1
44
chapter 3
Solving systems of simultaneous linear equations
AX = B. This equation can be solved for X by left multiplying both sides of the
matrix equation by A–1 and applying matrix algebra:
AX = B
A–1(AX) = A–1B (multiplying by A–1 on left)
(A–1 A)X = A–1B (by associativity of matrix multiplication)
IX = A–1B (since (A–1 A) = I)
X = A–1B (since IX = X)
For the system of two simultaneous linear equations in two unknowns this
gives:
R V
x 1 - 1 -1 1 S 1 1W 1
= G== G = G = S 21 12 W= G = = G
2
y 1 1 3 S- W3 1
S 2 2W
T X
So x = 2 and y = 1 is the (simultaneous) solution to both the equations, as
expected. Using the matrix inverse, both values are obtained at the same time,
unlike other techniques in which the values are determined successively.
Geometrically, each equation corresponds to a straight line in the cartesian
plane, and they intersect at the point with coordinates (2, 1), as shown in
Figure 3.5.
3
x+y=3
2
x
–1 O 1 2 3 4 5
–1
–2
x–y=1 –3
Figure 3.5: Graphs of x – y = 1 and x + y = 3 and their point of intersection
45
MathsWorks for Teachers
Matrices
Figure 3.6.
–5
5
5 x
y
–5 –5
Figure 3.6: Graphs of x – y – z = 0, 6x – 4y = 20 and –4y + 2z = 10 and their point of intersection
46
chapter 3
Solving systems of simultaneous linear equations
These are called consistent systems, as there is a solution. In both cases the
solution is also unique. This simple matrix method is ideal if there is a unique
solution. However, in many cases there may be no solution or infinitely many
solutions, and in these cases the coefficient matrix does not have an inverse.
Intuitively students should be able to discuss and identify geometric
interpretations of possible intersections of graphs of straight lines (using
rulers) and planes (using sheets of paper) corresponding to the two-
dimensional and three-dimensional cases respectively. If two students write
down the equations of two straight lines of the form ax + by = c
independently and then compare, there are three possibilities:
• They correspond to distinct lines with different gradients which have a
unique point of intersection, for example {3x + 2y = 4, –x + y = 0}.
• They correspond to distinct lines with the same gradient which have no
points of intersection, that is they are parallel with different axis intercepts,
for example, {3x + 2y = 4, 3x + 2y = 0}.
• They correspond to the same line with the same gradient and have
infinitely many points of intersection, that is they are parallel with the
same axis intercepts, for example {3x + 2y = 4, 1.5x + y = 2}.
If three students write down the equations of three planes of the form
ax + by +cz = d independently and then compare, there are also several
possibilities:
• The three planes are distinct and intersect in a unique point.
• The three planes are distinct and intersect in a line (like a three-page book).
• The three planes are identical and intersect in infinitely many points that
form a single common plane.
• The three planes do not intersect all together. (There are several ways in
which this might occur geometrically.)
Students should be able to discuss the idea that, as a single equation in
three variables of the form ax + by + cz = d represents a single plane in R3,
then a system of two simultaneous linear equations in three variables could
have zero or infinitely many solutions, the former because the planes are
parallel but distinct, the latter either because the planes are identical or
because they define a line in R3 by their set of intersection points. Student
exploration of these possibilities will be aided by access to CAS with two-
dimensional and three-dimensional graphing functionalities.
47
MathsWorks for Teachers
Matrices
Example 3.2
3
x+y=4
2
x+y=2
1
x
O
–2 –1 1 2 3 4 5
–1
–2
–3
Figure 3.7: Graphs of x + y = 2 and x + y = 4, parallel straight lines with
no points of intersection
For the second system, the corresponding lines are in fact the same
line, as shown in Figure 3.8, hence each point on the line is a solution to
the system. This system is consistent, but with infinitely many
solutions.
48
chapter 3
Solving systems of simultaneous linear equations
x
O
–2 2 4
–2
–4
–6
Figure 3.8: Graphs of 3x – y = 4 and –6x + 2y = –8, identical straight lines with
infinitely many points of intersection
If we use y = k as the free variable, a set of solutions, or solution set, can be
written parametrically in the form $` 4 +3 k , kj: k ! R. .There are infinitely
many other ways of writing the solution set for this system. For example,
another solution set, using x = t as the free variable, is {(t, 3t – 4): t ! R}.
Each value of k or t, as applicable, generates the coordinates of a solution point.
S t u d e n t ac t i v i t y 3 . 2
49
MathsWorks for Teachers
Matrices
Although we cannot use the inverse of the coefficient matrix to solve the
above systems, we can use another technique called Gaussian elimination to
solve such systems. This is a generalisation, of the process of eliminating
variables, carried out systematically, and represented in matrix form. This
algorithm can be applied in cases where there are not the same number of
equations and variables—an important technical generalisation.
T h e m e t h o d o f Ga u s s ia n e l i m i n a t i o n
Example 3.3
x-y = 1
The system * 4 has augmented matrix = G.
1 -1 1
x+y = 3 1 1 3
Z x - y - z = 0_ R V
] b S1 - 1 - 1 0 W
The system [ 6x + 4y + 0z = 20` has augmented matrix S6 4 0 20 W.
] 0x - 4y + 2z = 10b SS0 - 4 2 10 WW
\ a T X
50
chapter 3
Solving systems of simultaneous linear equations
51
MathsWorks for Teachers
Matrices
augmented matrix rather than the equations. These row operations have the
same effect as the operation on equations, where the equations have been
written as a rectangular array with the coefficients of the variables and the
constants vertically aligned. The following are the corresponding elementary
row operations for a matrix:
1 Interchange: Interchange any two rows in their entirety.
2 Scaling: Multiply any row by a non-zero constant.
3 Elimination: Add a constant multiple of one row to another row.
A matrix is said to be in (row) echelon form if it satisfies the following two
conditions:
1 If there are any zero rows, they are at the bottom of the matrix.
2 The first non-zero entry in each non-zero row (called the leading entry or
pivot) is to the right of the pivots in the rows above it.
Matrices which are in row echelon form have a ‘staircase’ appearance:
R V
S0 * * * * W
S0 0 * * * W
S0 0 0 0 * W
S W
S0 0 0 0 0 W
T X
Example 3.4
Example 3.5
52
chapter 3
Solving systems of simultaneous linear equations
Two matrices A and B are said to be equivalent if one can be obtained from
the other by a finite sequence of elementary row operations, and we write
A ~ B to denote this equivalence.
Gaussian elimination is a procedure for bringing a matrix to its equivalent
row echelon form, and is described in steps 1–4 of the Gauss–Jordan algorithm
given below. At this stage the corresponding system of simultaneous linear
equations is in triangular form and could be solved by back-substitution. The
Gauss–Jordan algorithm, described in Table 3.1 is an extension of Gaussian
elimination which brings the matrix to its equivalent reduced row echelon
form from which the solution (if there is one) can be directly written down.
Gauss–Jordan algorithm
Step 1 Identify the leftmost non-zero column.
Step 2 If the first row has a zero in the column of Step 1, interchange it
with one that has a non-zero entry in the same column.
Step 3 Obtain zeros below the leading entry (also called a pivot) by
adding suitable multiples of the top row to the rows below it.
Step 4 Cover the top row and repeat the same process with the leftover
sub-matrix and starting at step 1. Repeat this process with each
row. (At this stage the matrix is in echelon form.)
Step 5 Start with the last non-zero row, work upwards. For each row,
obtain a leading 1 (by dividing by the value of the pivot) and
introduce zeros above it by adding suitable multiples of the row
with the leading 1 to the corresponding rows.
Example 3.6
53
MathsWorks for Teachers
Matrices
As noted earlier, in this case the solution of x = 2 and y = 1 can
readily be obtained by inspection, however this will not always be the
case. A simple example such as this enables students to attend to the
process being illustrated rather than focus on the manipulations
involved.
Solution
This system has the corresponding augmented matrix = G.
1 -1 1
1 1 3
• The leftmost non-zero column is the first column (Step 1).
• The top entry in this column is non-zero, so proceed to Step 3 of the
algorithm (Step 2).)
• The new second row will be the old second row minus the first row
(Step 3).
• The resulting matrix is = G, which is in row echelon form.
1 -1 1
0 2 2
(This corresponds to the system of simultaneous linear equations
x-y = 1
* 4 , which is in a triangular form, and could easily be solved
2y = 2
by back-substitution. The last equation is 2y = 2, and so y = 1.
Substitute this value for y in the first equation, and solve for x,
which gives x – 1 = 1, and so x = 2, hence the solution of the system
is (2, 1).)
• Now we must use Step 5 to convert the matrix to reduced row
echelon form. First, we need to turn the leading entry in the second
row, into a leading ‘1’. So divide the second row by 2 to obtain
54
chapter 3
Solving systems of simultaneous linear equations
Example 3.7
55
MathsWorks for Teachers
Matrices
56
chapter 3
Solving systems of simultaneous linear equations
Students may well inquire what happens when this algorithm is applied to
a system of simultaneous linear equations that corresponds to a pair of
parallel lines or identical lines in the cartesian plane.
Example 3.8
this system is = G.
1 1 4
1 1 2
The first step in the Gauss–Jordan algorithm is to replace the second
row with the previous second row minus the first row, to obtain
= G.
1 1 4
0 0 -2
(The system is now in triangular form, and we can see that the last
row corresponds to the equation 0x + 0y = –2, which is impossible.
Hence there are no solutions to this system.)
Next, we divide the elements in the second row by –2 to obtain
= G, and finally we subtract four times the second row from the
1 1 4
0 0 1
first row to obtain = G. Now we see that the last equation is
1 1 0
0 0 1
0x + 0y = 1, which is impossible, and so there are no solutions to this
system of equations.
57
MathsWorks for Teachers
Matrices
Example 3.9
) 3
- 6x + 2y = - 8
3x - y = 4
Solution
This system corresponds to a pair of identical parallel lines (same
gradient, same intercepts). The corresponding augmented matrix for this
system is = G.
-6 2 -8
3 -1 4
Replace the second row by itself + 12 times the first row. This gives
58
chapter 3
Solving systems of simultaneous linear equations
S t u d e n t ac t i v i t y 3 . 3
For the following systems of equations, enter the augmented matrix into a CAS, or other
suitable technology, and use this to obtain the reduced row echelon form. Hence solve the
following systems of simultaneous linear equations.
Z _
] x + 2y - z = 2b
a [ x + 4y - 3z = 3`
]2x + 5y - 3z = 1b
\ a
Z _
] - x + y + z = - 1b
b [ x - y + 3z = 5`
] 3x - 2y + z = - 2b
\ a
Z _
] 2x + 2y - z = 5b
c [- 2x + y + z = 7`
]- 4x + y + 2z = 10b
\ a
S y s t e m s o f s i m u l t a n e o u s l i n e ar
e q u a t i o n s i n v ari o u s c o n t e x t s
Many different contexts give rise to systems of simultaneous linear equations
in several variables, even when the relations involved may themselves be non-
linear. Substitution of values of the variables in such contexts often results in
a system of simultaneous linear equations relating coefficients and/or
arbitrary constants.
59
MathsWorks for Teachers
Matrices
Example 3.10
Solution
Since (–2, 3) lies on the circle, substitution of these values into the
equation for the circle gives (–2)2 + 32 – 2a + 3b + c = 0. This simplifies
to the linear equation:
–2a + 3b + c = –13
Similarly, as (6, 3) also lies on the circle we have
62 + 32 + 6a + 3b + c = 0, which simplifies to the linear equation:
6a + 3b + c = –45
For the point (2, 7), which also lies on the circle, we have
22 + 72 + 2a + 7b + c = 0, and hence:
2a + 7b + c = –53
We thus have a system of three simultaneous linear equations in a, b
and c:
Z _
]- 2a + 3b + c =- 13b
[ 6a + 3b + c =- 45`
] b
\ 2a + 7b + c =- 53a
The augmented matrix form is:
R V
S- 2 3 1 - 13 W
S 6 3 1 - 45W
SS 2 7 1 - 53 WW
T X
and using by hand or CAS manipulation to bring this to reduced row
echelon form yields:
R V
S1 0 0 - 4 W
S0 1 0 - 6 W
SS0 0 1 - 3 WW
T X
The solution can be read directly from this: a = – 4, b = – 6 and c = – 3,
and so the equation of the circle is x2 + y2 – 4x – 6y – 3 = 0. Completing
the square, on both x and y, results in the alternative equation of the form
(x – 2)2 + (y – 3)2 = 16 for the relation. The graph of this relation is a circle
with centre (2, 3) and radius 4, as shown in Figure 3.9.
60
chapter 3
Solving systems of simultaneous linear equations
x
O
–6 –4 –2 2 4 6 8
–2
–4
–6
Figure 3.9: Graph of the relation x2 + y2 – 4x – 6y – 3 = 0 or (x – 2) 2 + (y – 3) 2 = 16
Example 3.11
Three Toyotas, two Fords and four Holdens can be rented for $212 per
day. Alternatively, two Toyotas, four Fords and three Holdens can be
rented for $214 per day, or four Toyotas, three Fords and two Holdens
could be rented for $204 per day.
Assuming that the rate for renting any type of car is fixed by the
make, find the rental rates for each type of car per day.
Solution
Let a, b and c be the respective costs of renting a Toyota, a Ford and a
Holden per day. Then we have three simultaneous linear equations in
the three unknowns a, b and c.
3a + 2b + 4c = 212
2a + 4b + 3c = 214
4a + 3b + 2c = 204
61
MathsWorks for Teachers
Matrices
Example 3.12
A girl finds $5.20 in coins: 50 cent coins, 20 cent coins and 10 cent coins.
She finds 21 coins in total. How many coins of each type must she have?
Solution
Suppose she has a 50 cent coins, b 20 cent coins and c 10 cent coins. Then
50a + 20b + 10c = 520
She has 21 coins in total, so:
a + b + c = 21
This gives us two simultaneous linear equations in three unknowns.
We write the augmented matrix corresponding to the system:
= G
50 20 10 520
1 1 1 21
and find its reduced row echelon form:
R V
S1 0 - 1 10 W
S 3 3W
S0 1 4 53 W
S 3 3W
T X
Generally, there would be infinitely many possible solutions to these
equations, but we require non-negative integers as solutions. The leading
variables correspond to the columns with leading 1s, so are a and b.
62
chapter 3
Solving systems of simultaneous linear equations
The free variable is c, and it can take integer values between 0 and 21. We can
write a and b in terms of c, from the reduced row echelon matrix above, as
10 c 10 + c
a= 3 +3= 3
53 4c 53 - 4c
b= 3 - 3 = 3
To find the possible integer solutions, we need to consider integer values of
c between 0 and 21, and determine when 10 + c and 53 – 4c are both divisible
by 3. This could easily be done by technology, forming a 22 × 3 matrix, with
10 + c 53 - 4c
the first column containing c, the second 3 and the third 3 .
10 + c 53 - 4c
c 3 3
0 3.3 17.7
1 3.7 16.3
2 4.0 15.0
3 4.3 13.7
4 4.7 12.3
5 5.0 11.0
6 5.3 9.7
7 5.7 8.3
8 6.0 7.0
9 6.3 5.7
10 6.7 4.3
11 7.0 3.0
12 7.3 1.7
13 7.7 0.3
14 8.0 –1.0
15 8.3 –2.3
16 8.7 –3.7
17 9.0 –5.0
18 9.3 –6.3
19 9.7 –7.7
20 10.0 –9.0
21 10.3 –10.3
63
MathsWorks for Teachers
Matrices
Example 3.13
The scores of three players in a tournament have been lost. The only
information available is the total of the scores for players 1 and 2, the
total for players 2 and 3, and the total for players 3 and 1. Show that the
original scores can be recovered.
Solution
Let x, y and z be the scores for players 1, 2 and 3 respectively, and a, b
and c the totals for players 1 and 2, 2 and 3, and 3 and 1 respectively.
Then
x+y = a
y+z = b
z+x = c
is a system of three simultaneous linear equations in three unknowns x,
y and z. The augmented matrix is:
R V
S1 1 0 a W
S0 1 1 b W
SS W
1 0 1 cW
T X
Its corresponding reduced row echelon form is
R V
S1 0 0 a - b + c W
S 2 W
S a+b-c W
S0 1 0 2 W
S -a + b + c W
SS0 0 1 2 WW
T X
a-b+c a+b-c
So the original scores are x = 2 , y = 2 and
-a + b + c
z= 2 .
64
chapter 3
Solving systems of simultaneous linear equations
Example 3.14
Find the rule for the family of parabolas which pass through the points
(1, 2) and (3, 4).
Solution
Let the rule for the family of parabolas be y = ax 2 + bx + c, where a is
non-zero. Since (1, 2) lies on any member of this family of curves:
a+b+c = 2
Similarly, since (3, 4) lies on any member of the family of curves:
9a + 3b + c = 4
Hence we have a system of two equations in three unknowns a, b
and c. We first write the augmented matrix:
= G
1 1 1 2
9 3 1 4
and use CAS to reduce it to its reduced row echelon form:
1 0 - 13 - 13
> 4 7 H
0 1 3 3
c-1 k-1 4 7
a= 3 = 3 . The second row gives b + 3 c = 3 and so
7 - 4c 7 - 4 k
b= 3 = 3 .
Hence the rule for the family of parabolas is:
k - 1 2 7 - 4k
y= 3 x + 3 x + k , k ∈ R
We will graph a few of these curves.
If k = 1, then a = 0 and we obtain the straight line with equation
y = x + 1, which passes through the two points.
If k = 4, then we obtain the parabola with equation y = x 2 – 3x + 4.
If k = – 5, then we obtain the parabola y = –2x 2 + 9x – 5. The graphs
of these curves are shown in Figure 3.10.
65
MathsWorks for Teachers
Matrices
y
6
g
5
h
4 (3, 4)
2 f
(1, 2)
1
x
–1 O 1 2 3 4
–1
–2
Figure 3.10: Graph of f(x) = –2x2 + 9x – 5, g(x) = x2 – 3x + 4 and h (x) = x + 1
Example 3.15
Find the rule for the family of cubic polynomials which passes through
the points (1, 0) and (–1, 0), with slope –4 when x = 1.
Solution
Let f(x) = ax3 + bx2 + cx + d be the rule of a cubic polynomial function,
with a, b, c and d the unknown coefficients. Since (1, 0) lies on the curve:
a+b+c+d = 0
66
chapter 3
Solving systems of simultaneous linear equations
-a + b - c + d = 0
Now f ′(x) = 3ax 2 + 2bx + c, and the slope at x = 1 is –4, so
3a + 2b + c = –4.
We now have a system of three simultaneous linear equations in four
unknowns:
Z _
] a + b + c + d = 0b
[- a + b - c + d = 0`
] 3a + 2b + c = - 4b
\ a
We can write down the augmented matrix corresponding to this
system:
R V
S 1 1 1 1 0 W
S- 1 1 - 1 1 0 W
S W
S 3 2 1 0 - 4W
T X
The reduced row echelon form of this is:
R V
S1 0 0 - 1 - 2 W
S0 1 0 1 0 W
SS W
0 0 1 1 2 W
T X
There are three leading variables, a, b and c, and one free variable, d.
Let d = k where k ∈ R, and express the leading variables in terms of k.
The first row of the reduced matrix tells us that a – d = –2, and so
a = –2 + k. The second row tells us that b + d = 0, and so b = –k and the
third row tells us that c + d = 2, so c = 2 – k. Thus, the rule for the
family of polynomials is:
f: R → R, where f(x) = (k – 2)x3 – kx2 +(2 – k)x + k, and k ∈ R.
If k = 2, the function will be the quadratic with rule f(x) = –2x 2 + 2.
We can check that this is the general form by drawing the graphs of
some members of this family:
If k = 0, f(x) = –2x3 + 2x.
If k = 3, f(x) = x3 – 3x 2 – x + 3.
If k = –1, f(x) = –3x3 + x 2 + 3x – 1.
The graphs of these, and for k = 2, are shown in Figure 3.11 on the
following page.
67
MathsWorks for Teachers
Matrices
y
4
y = –2x + 2x
3
p 3 y = x3 –3x2 – x + 3
2
f
1
x
O
–4 –3 –2 –1 g 1 2 3 4
–1
y = –3x + x + 3x – 1
3 2
–2
h
y = –2x2 + 2 –3
–4
Example 3.16
Solution
Let f(x) = ax4 + bx3 + cx 2 + dx + e be the rule for the family of quartic
polynomials.
Since they pass through (1, 2), f(1) = 2, so:
a + b + c + d + e = 2
Since they pass through (–2, –1), f(–2) = –1, so:
16a – 8b + 4c – 2d + e = –1
Now f ′(x) = 4ax3 + 3bx 2 + 2cx + d. Since we must have f ′(1) = 5:
4a + 3b + 2c + d = 5
We now have a system of three simultaneous linear equations in the
five unknowns a, b, c, d and e:
Z _
] a + b + c + d + e = 2b
[ 16a - 8b + 4c - 2d + e = - 1`
] 4a + 3b + 2c + d = 5b
\ a
68
chapter 3
Solving systems of simultaneous linear equations
f]xg = b 2 s + 4 t + 12 l x 4 + b 2 t + 6 l x 3 + b- 2 s - 4 t + 12 l x2 + sx + t
1 3 1 1 5 3 9 13
where s, t d R .
As a solution, this looks fairly formidable, so it’s a useful strategy to
plot a few of its members.
• If s = 0 and t = 0 , then we have the function
f]xg = 12 x 4 + 6 x 3 + 12 x2 .
1 5 13
69
MathsWorks for Teachers
Matrices
2
1
x
–4 –3 –2 –1 O 1 2 3 4
–1
–2
–3
–4
–5
Figure 3.12: Parts of the graphs of some members of the family of functions
for various values of s and t
S t u d e n t ac t i v i t y 3 . 4
a The scores of four players in a tournament have been lost. The only information
available is the total of the scores for players 1 and 2, the total for players 2 and 3,
the total for players 3 and 4 and the total for players 4 and 1. Can the original
scores be recovered?
b Find the equation of the cubic polynomial which passes through the points (1, 0)
and (–1, 0), with slope –4 when x = 1 and slope 12 when x = –1.
c Find the rule for the family of quartic polynomials (polynomials of degree 4) that
pass through the points (1, 2), (–2, –1) and (2, 0), and have slope 5 at x = 1.
d Find the rule for the quartic polynomial (polynomial of degree 4) that passes
through the points (1, 2), (–2, –1), (2, 0) and (–1, 5), and has slope 5 at x = 1.
70
chapter 3
Solving systems of simultaneous linear equations
SUMM A R Y
71
MathsWorks for Teachers
Matrices
SUMM A R Y (Cont.)
or
– there are no solutions, and the graphs of the three planes are all
parallel but distinct; or one pair is parallel (and distinct) and the
other oblique to this pair; or they are configured like a triangular
prism.
• The system of simultaneous linear equations can be written in
matrix form AX = B, where
R V R V
S a11 a12 f a1n W S x1 W
S a21 a22 f a2n W S x2 W
A=S W is the m × n coefficient matrix, X = S W the
S h h h W ShW
Sam1 am2 f amn W S xn W
T X T X
R V
S b1 W
Sb W
n × 1 column matrix (vector) of variables, and B = S 2 W the m × 1
column matrix (vector) of constant terms. Sh W
Sbm W
T X
• If A is an invertible (non-singular) square matrix (m = n) then the
inverse method can be employed and X = A–1B.
• The m × (n + 1) augmented matrix of the system is the following
matrix:
R V
S a11 a12 f a1n b1 W
S a21 a22 f a2n b2 W
S W
S h h h h W
Sam1 am2 f amn bm W
T X
• The system of simultaneous linear equations of the form AX = O is
said to be homogeneous and is always consistent, with X = O
(the relevant zero vector) a solution.
• To solve such systems of equations using the Gauss–Jordan method,
there are three steps.
Step 1: Write the augmented matrix for the system of equations.
Step 2: Enter the augmented matrix into CAS, or other suitable
technology, and obtain the reduced row echelon form of the
matrix (using exact arithmetic where possible).
72
chapter 3
Solving systems of simultaneous linear equations
SUMM A R Y (Cont.)
References
Anton, H & Rorres, C 2005, Elementary linear algebra (applications version),
9th edn, John Wiley and Sons, New York.
Cirrito, F (ed.) 1999, Mathematics higher level (core), 2nd edn, IBID Press, Victoria.
Hill, RO Jr 1996, Elementary linear algebra with applications, 3rd edn, Saunders
College, Philadelphia.
Lipschutz, S & Lipson, M 2000, Schaum’s outline of linear algebra, 3rd edn,
McGraw-Hill, New York.
Nicholson, KW 2001, Elementary linear algebra, 1st edn, McGraw-Hill Ryerson,
Whitby, ON.
Nicholson, KW 2003, Linear algebra with applications, 4th edn, McGraw-Hill
Ryerson, Whitby, ON.
Poole, D 2006, Linear algebra: A modern introduction, 2nd edn, Thomson Brooks/
Cole, California.
Wheal, M 2003, Matrices: Mathematical models for organising and manipulating
information, 2nd edn, Australian Association of Mathematics Teachers,
Adelaide.
73
MathsWorks for Teachers
Matrices
Websites
http://en.wikipedia.org/wiki/Gaussian_elimination – Wikipedia
This site provides a comprehensive discussion with links to other resources and
references.
http://mathworld.wolfram.com/GaussianElimination.html – Wolfram Research
This site is the online mathematical encyclopaedia from the developers of the
CAS Mathematica. It provides a concise but comprehensive mathematical
overview and includes links to related topics and a good list of other references.
http://www.sosmath.com/matrix/system1/system1.html – SOS Mathematics
This site provides an accessible discussion with worked examples using Gaussian
elimination for some simple cases of systems of simultaneous linear equations.
http://aleph0.clarku.edu/~djoyce/ma105/simultaneous.html – Department of
Mathematics and Computer Science, Clark University.
This site includes a first principles discussion of a practical problem based on
ancient Chinese methods.
http://www.jgsee.kmutt.ac.th/exell/PracMath/SimLinEq.html – Practical
Mathematics.
This site covers straightforward examples for 2 × 2 and 3 × 3 systems, with a
collection of related exercises.
http://mathforum.org/linear/choosing.texts/ – Drexel University
This site provides information on selected linear algebra texts, including those
that are technology based.
http://www.sosmath.com/matrix/matrix.html – SOS Mathematics
This site has some notes with examples on solving systems of linear equations,
and is at quite a simple level.
74
C ha p t e r 4
Tra n s f o r m a t i o n s o f t h e
car t e s ia n p l a n e
75
MathsWorks for Teachers
Matrices
Matrices are well suited to the analysis of linear transformations, and provide
a convenient notation for distinguishing between these two senses of function.
Indeed, linear transformations can be used to provide a strong motivation for the
definition of matrix multiplication, as applied to 2 × 2 matrices. The application
of matrices to the analysis of linear transformations involves the solution of
systems of simultaneous linear equations and matrix inverses.
Li n e ar t ra n s f o r m a t i o n s
76
chapter 4
Transformations of the cartesian plane
2
x
–8 –6 –4 –2 O 2 4 6 8
–2
–4
–6
–8
Figure 4.1: Intersecting lines indicating a subset of grid points of Z × Z = Z 2,
where Z is the set of integers
x
Since any u = (x, y) ∈ R2 can be written as a column matrix u = = G, we
y
can write u = x = G + y = G; that is, we can write u as a linear combination of
1 0
0 1
= G and = G.
1 0
0 1
By the linearity properties, to determine the image of u under T we simply
need to determine the image of = G and = G under T. The set of vectors
1 0
0 1
{[1, 0], [0, 1]} or )= G, = G3 is said to be a basis of the vector space they
1 0
0 1
generate, in this case the set of all coordinate vectors in the cartesian plane, as
77
MathsWorks for Teachers
Matrices
Then T e= Go = T e x = G + y = Go = xT e= Go + yT e= Go
x 1 0 1 0
y 0 1 0 1
= x= G + y= G = = G==
a ax + by a b x
G= G
b
c d cx + dy c d y
Such a linear transformation can be accomplished by a matrix
multiplication, with the matrix determined by the image of the two points
T e= Go = = G = G = = G and T e= Go = =
a
G= G = = G
1 a b 1 0 a b 0 b
0 c d 0 c 1 c d 1 d
Example 4.1
Solution
a To find the image of the point (2, 4), we simply find the matrix
product:
= G= G = = G
0 1 2 4
1 0 4 2
78
chapter 4
Transformations of the cartesian plane
If the points whose images are known are not = G and = G, then pairs of
1 0
0 1
simultaneous linear equations or inverse matrices are used to find the matrix
for the transformation.
Example 4.2
Solution
Then = G = G = = G and = G = G = = G.
a b 1 -1 a b 3 0
c d 2 1 c d 1 1
a + 2b =- 1 3a + b = 0
That is, and .
c + 2d = 1 3c + d = 1
Combining the four equations, we have two equations involving
a and b, and two equations involving c and d. For integer values of
a, b, c and d, these can usually be readily, if somewhat tediously, solved
by hand.
79
MathsWorks for Teachers
Matrices
Example 4.3
-1 2
The matrix A = > H transforms the point P(x, y) onto the point
3 -8
Q(1, –7). Find the coordinates of the point P.
Solution
We have = G = G = = G.
-1 2 x 1
3 -8 y -7
-4 -1 1
G = G = > 3 1 H= G = = G
- 1 2 -1 1
Hence = G = =
x 3
y 3 -8 -7 - 2 - 2 -7 2
So P has coordinates (3, 2).
S t u d e n t ac t i v i t y 4 . 1
a Show that any linear transformation maps the origin to the origin.
3 -1 -3 1
b Explain why a linear transformation T with T f= Gp = = G and T e= Go = = G is
-1 1 1 -1
not uniquely determined. Find at least two linear transformations that satisfy the
above conditions.
c Find the points that are mapped to the points (1, 0) and (0, 1) by the linear
4 3
transformation with matrix = G.
5 4
80
chapter 4
Transformations of the cartesian plane
Li n e ar t ra n s f o r m a t i o n o f a s t rai g h t l i n e
image of the graph of the relation with the equation ax + by + c = 0 (which is
a straight line) is the graph of the relation with the equation
1] g 1] g
2 ^x1 + y1h + 2 ^x1 - y1h + c = 0 , or 2 a + b x1 + 2 a - b y1 + c = 0
a b
(which is also a straight line). In particular, the straight line with equation
2x + 3y - 6 = 0 is transformed onto the straight line with equation
5 1
2 x - 2 y - 6 = 0 , as shown for part of the original line (that is a line segment
subset of the original line) and the corresponding image points in Figure 4.2.
y
+ + + + 4 + + + +
+ + + + + + + +
+ + + + 2 + + + +
2x + 3y – 6 = 0 5x – y – 6 = 12
+ + + + + + + +
x
–4 –2 O 2 4
+ + + + + + + +
+ + + + –2 + + + +
+ + + + + + + +
+ + + + –4 + + + +
Figure 4.2: Graph of 2x + 3y – 6 = 0, 0 < x < 3, and its image under T(x, y) = (x + y, x – y)
81
MathsWorks for Teachers
Matrices
An equation for any straight line can easily be written in parametric form.
Although not commonly used for straight lines in the cartesian plane (unless
as a simple application of vector kinematics), the parametric form of an
equation for a straight line is simply a vector equation for the line. It gives the
position vector of each point on the line with respect to the origin. In this
sense the coordinates of a point in the plane correspond to its position vector
relative to the origin (0, 0). Parametric forms are very useful in computer
graphic applications.
The following discussion should be developed through an exposition that
connects the graphical picture with the conceptual and symbolic argument.
Consider the straight line that passes through the two distinct points P, with
position vector p = (x1, y1), and Q, with position vector q = (x 2, y2). A
direction vector for this line from P to Q is d = q – p = (x 2 – x1, y2 – y1), and
the position vector r of any point on the line is given by r = p + td, t ∈ R. That
is, any point on the line that passes through P and Q must be some distance
(a scalar multiple of the length of the directed line segment PQ ) along this
line from the point P. This is illustrated in Figure 4.3. If we restrict t to the
interval [0, 1], then we have exactly the directed line segment from P to Q.
As d = q – p, another way of writing this position vector is
r = p + t(q – p) = (1 – t)p + t q. When t = 0, r = p, and when t = 1, r = q, the
endpoints of the line segment. If 0 ≤ t ≤ 1, then the vector r is clearly the
position vector of some point on the directed line segment PQ .
R
Q
d
P
r
p q
O
Figure 4.3: Vector representation of a line through two distinct points in the plane
For example, consider the line passing through the points P(1, 2) and
Q(–3, 4). Then the position vector r of any point R on the line through P and
Q can be written r = (1, 2) + t (–4, 2), where PQ = (–4, 2).
82
chapter 4
Transformations of the cartesian plane
It is natural for students to inquire how this relates to the more common
representation of a straight line by the rule y = mx + c, where c is interpreted
as the y-axis intercept and m represents the slope of the line (the ratio of
relative difference between the y values of two points with respect to their
corresponding difference in x values). A simple parameterisation of
y = mx + c is to let x = t, where t ∈ R, then y = mt + c and the corresponding
vector parametric form is (t, mt + c), t ∈ R. This simple parameterisation can
x
also be written in matrix form as = G = = G + t = G.
0 1
y c m
In vector terms, the position vector of any point S on the line is the sum of
the vector = G, the position vector of the point (0, c), the y-axis intercept, and
0
c
a scalar multiple of the vector = G, which gives the direction of the line. Note
1
m
that the vector = G has a horizontal component of 1 unit and a vertical
1
m
component of m units, as shown in Figure 4.4.
S
y
m
(0, c)
1
x
O
Figure 4.4: Vector representation of y = mx + c in the plane
Using matrices and vectors, it can easily be shown that under a linear
transformation with non-singular matrix:
1 The origin is mapped onto itself.
2 The transformation is a one-to-one mapping of R2 onto R2.
3 A straight line (line segment) is mapped onto a straight line (line segment).
83
MathsWorks for Teachers
Matrices
4 Any pair of distinct parallel lines is mapped onto another pair of distinct
parallel lines.
5 A straight line that passes through the origin is mapped onto another
straight line that passes through the origin.
If the matrix of the transformation is singular, then any line will be
mapped to a line or point.
There are several methods for determining the equation of the image of a
line under a linear transformation. Some involve the use of the inverse of a
2 × 2 matrix, while others do not. The following discussion illustrates three
different approaches. Teachers should encourage students to think about the
mathematical strategies involved, and where various constructs arise in each
case.
Example 4.4
A == G. Find the image of the line with rule y = 2x + 3 under this
1 2
4 -3
transformation.
Solution
Method 1
Since straight lines are transformed onto straight lines under a linear
transformation, we can find the image by finding the image of any two
distinct points on the original straight line. For example, the points (0, 3)
and (1, 5) clearly lie on the straight line with equation y = 2x + 3.The
corresponding
^y + 9h =- 5 ]x - 6g or y =- 5 ]2x + 33g.
2 1
84
chapter 4
Transformations of the cartesian plane
Method 2
Method 3
1 2 -1 1 2 x 1 2 - 1 x1
> H > H= G = > H > H
4 -3 4 -3 y 4 -3 y1
R V R V
-1 S 3 2 W S 3x1 + 2y1 W
x 1 2 x1 11 11 W x1
= G=> H > H= S > H = SS
11 W
y 4 -3 y 1 S 4 - 1 W y 1 4 x 1 - y1 W
S 11 11 W S 11 W
T X T X
= 2 c 1 11 1 m + 3, which
4x1 - y1 3 x + 2 y
Hence y = 2x + 3 becomes 11
simplifies to 5y1 =- 2x1 - 33 or y1 =- 5 ^2x1 + 33h and so the equation
1
x1
This method can be implemented directly by finding = G = A- 1 f> Hp,
x
y y 1
85
MathsWorks for Teachers
Matrices
will not work if we cannot find the inverse of the transformation matrix, that
is, if the transformation matrix is singular. In this case the transformation
maps the plane onto a line through the origin or onto the origin itself.
Method 3 is easy to adapt to finding the image of any function, and is the one
we will use in general.
Example 4.5
Solution
We cannot use Method 3 because the matrix A does not have an inverse,
since det(A) = 1 – 1 = 0. We begin in a similar way, and find the image
of the point with coordinates (x, y).
1 1 x x+y x1
= G= G = > H=> H
1 1 y x+y y 1
Since the x1- and y1-coordinates are the same, and x + y is not
constant, the image of the line is y = x. In fact, one can show that the
image of any line other than those of the form y = –x + c is y = x, and
that the image of y = –x + c is the point (c, c), which of course is on the
line y = x. This transformation corresponds to a projection onto the line
y = x. Projections onto lines will not be considered in any detail, since
they do not occur when transforming functions.
S t u d e n t ac t i v i t y 4 . 2
86
chapter 4
Transformations of the cartesian plane
d Find the equations of lines which are mapped to points under the linear
11
transformation with matrix = G .
11
e Find the image of the unit square (that is, the region bounded by line segments
joining vertices (0, 0), (0, 1), (1, 1) and (1, 0)) and the area of this region under the
2 1
transformation with matrix = G.
1 3
Li n e ar t ra n s f o r m a t i o n o f a c u r v e
To find the image of the graph of y = f(x) or f(x, y) = c under the linear
transformation with matrix A, we can proceed as for a straight line, provided
A has an inverse. Consider an arbitrary point (x, y) on the curve which is the
graph of the function or relation we are interested in. If this is mapped to the
x x1 x x1
point (x1, y1) where A = G = > H, we can equivalently write = G = A- 1 > H,
y y 1 y y 1
which gives expressions for x and y. These can simply be substituted into
y = f(x) or f(x, y) = c to find the equation of the image function or relation.
Example 4.6
Find the image of the function y = x 2 and the relation 4x 2 + y2 = 1 under
1 2
the linear transformation represented by the matrix A = > H.
4 -3
Solution
Consider an arbitrary point (x, y) on the curve, which is mapped to the
1 2 x x1
point (x1, y1) where > H = G = > H. This gives us equations for x1 and
4 -3 y y1
y1 in terms of x and y. What we want is to solve these for x and y in
terms of x1 and y1, and substitute into the original equation of the curve
to give an equation involving x1 and y1. This can be done easily by pre-
multiplying both sides of the matrix equation by A–1.
1 2 -1 1 2 x 1 2 - 1 x1
> H > H= G = > H > H
4 -3 4 -3 y 4 -3 y1
R V R V
-1 S 3 2 W S 3x1 + 2y1 W
x 1 2 x1 11 11 W x1
= G=> H > H= S > H = SS
11 W
y 4 - 3 y 1 S 4 - 1 W y 1 4 x 1 - y 1 W
S 11 11 W S 11 W
T X T X
87
MathsWorks for Teachers
Matrices
Hence the rule of the image of the function y = x 2 is the relation
4x1 - y1 c 3x1 + 2y1 m2
11 = 11 . Expanding this expression and replacing x1 by
x and y1 by y gives 44x – 11y = 9x 2 + 12xy + 4y2. The graphs of both
the original function and the image relation are shown in Figure 4.5.
y
10
O
x
–4 –3 –2 –1 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
–6
–10
Figure 4.5: Graph of the function y = x2 and its image relation
under transformation by the matrix A
4c
3x1 + 2y1 m2 c 4x1 - y1 m2
11 + 11 = 1. Expanding this expression and
replacing x1 by x and y1 by y we have 52x 2 + 40xy + 17y2 = 121. The
graphs of the original curve and its image are shown in Figure 4.6.
y
4
3
2
1
x
–6 –5 –4 –3 –2 –1 O 1 2 3 4 5 6
–1
–2
–3
–4
Figure 4.6: Graph of the relation 4x2 + y2 = 1 and its image relation
under transformation by matrix A
88
chapter 4
Transformations of the cartesian plane
S t u d e n t ac t i v i t y 4 . 3
3 0
a Find the image of y = sin(x) under the linear transformation with matrix = G.
0 2
3 5
b Find the image of y = x2 under the transformation with matrix = G.
1 2
11
c Find the image of y = x2 under the transformation with matrix = G .
11
S t a n d ar d t y p e s o f l i n e ar
t ra n s f o r m a t i o n s
It is a key part of many senior secondary mathematics curricula to investigate
the effects of certain standard types of transformations on the graphs of
familiar functions and relations. In the following discussion, such an
investigation is carried out with respect to the unit square—defined as the
region bounded by and including the set of four line segments joining (0, 0)
with (1, 0); (1, 0) with (1, 1); (1, 1) with (0, 1) and (0, 1) with (0, 0)—and the
graphs of the functions with domain R and rules f(x) = x 2 and g(x) = sin(x)
respectively. Note that if a linear transformation with matrix A maps a region
of area a1 onto a region of area a2, then a2 = |det(A)| × a1.
Computer algebra systems can be used to good effect to apply transformations
to points, find algebraic relations corresponding to the transformation of
variables, carry out computation involving compositions of transformations
and their inverses, and draw graphs of original and transformed sets of points
in the plane.
Solution
Under this transformation, each point (x, y) is mapped to the point
(x, 3y), and so the corners (vertices) of the unit square (0, 0), (1, 0), (1, 1)
89
MathsWorks for Teachers
Matrices
and (0, 1) are mapped to the points (0, 0), (1, 0), (1, 3), (0, 3) respectively.
The square has been stretched vertically, and the resulting rectangle has
area 3 square units, as shown in Figure 4.7.
y
3 (1, 3)
(1, 1)
1
x
–1 O 1 2 3
–1
Figure 4.7: Graph of the unit square and its image under the transformation with matrix < F
1 0
0 3
(corresponding to a dilation by factor 3 from the x-axis)
In general, the point (x, y) is mapped to the point (x1, y1), where
1 0 x x x1
= G = G = = G = > H, and so x = x1 and y = 31 .
y
0 3 y 3y y1
The effect on the graph of y = f(x) is given by substituting for x and y
y
in y = x 2, which results in 31 = x12. The corresponding rule for the
transformed function is y = f 1(x) = 3x 2. Part of the graph of the original
function and its image function is shown in Figure 4.8.
y
7
6
5
4
3
2
1
O x
–4 –3 –2 –1 1 2 3 4
–1
Figure 4.8: Graph of part of f(x) = x2 and its image under the transformation with matrix < F
1 0
0 3
90
chapter 4
Transformations of the cartesian plane
y in y = sin ]xg, which results in 31 = sin (x1) or y1 = 3 sin ^x1h. The rule
y
4
3
2
1
x
–4 –3 –2 –1 O 1 2 3 4
–1
–2
–3
–4
Example 4.8
What effect does the transformation with matrix = G have on the unit
3 0
0 1
square and the graphs of the functions f and g?
Solution
3 0 x 3x
Under the transformation, = G = G = = G, so each point (x, y) is mapped
0 1 y y
to the point (3x, y), and so the vertices of the unit square (0, 0), (1, 0),
(1, 1) and (0, 1) are mapped to the points (0, 0), (3, 0), (3, 1), (0, 1)
respectively. The square has been stretched horizontally, and the area of
the resulting rectangle is 3 square units, as shown in Figure 4.10.
91
MathsWorks for Teachers
Matrices
(1, 1) (3, 1)
1
x
–1 O 1 2 3 4
–1
Figure 4.10: Graph of the unit square and its image under the transformation with matrix < F
3 0
0 1
In general, the effect on any curve or region is described by
3 0 x 3x x1
= G = G = = G = > H, and so x = 31 and y = y1.
x
0 1 y y y1
The effect on the graph of y = f(x) is given by substituting for x and y
x
O
–10 –8 –6 –4 –2 2 4 6 8 10
–1
Figure 4.11: Graph of part of f(x) = x2 and its image under the transformation with matrix < F
3 0
0 1
92
chapter 4
Transformations of the cartesian plane
The rule for the transformed function is g1 ]xg = sin b 3 l. The graphs
x
of g and g 1 are shown in Figure 4.12.
y
x
–10 –8 –6 –4 –2 O 2 4 6 8 10
–1
Example 4.9
93
MathsWorks for Teachers
Matrices
Solution
3 0 x 3x
Under the transformation = G = G = = G. Each point (x, y) is mapped to
0 3 y 3y
the point (3x, 3y), and so the vertices of the unit square (0, 0), (1, 0),
(1, 1) and (0, 1) are mapped to the points (0, 0), (3, 0), (3, 3), (0, 3)
respectively. The square has been scaled by a factor of 3 and the
resultant square has an area of 9 square units, as shown in Figure 4.13.
y
(3, 3)
3
(1, 1)
1
x
–1 O 1 2 3
–1
Figure 4.13: Graph of the unit square and its image
3 0 x 3x x1
Since = G = G = = G = > H, we have x = 31 and y = 31 , so the
x y
0 3 y 3y y1
graphs of y = x 2 and y = sin(x) are transformed to the graphs of the
respectively.
94
chapter 4
Transformations of the cartesian plane
1 0
for ky > 0 (a dilation from the y-axis) and the transformation > H for kx > 0
0 kx
(a dilation from the x-axis), the product
ky 0
H=> H
ky 0 1 0 1 0 ky 0
> H> H=> H>
0 1 0 kx 0 kx 0 1 0 kx
is a composite dilation, and this composition is commutative (it doesn’t matter
in which order the transformations are applied, the final image is the same).
x
After such a composite dilation, y = f(x) becomes y = k x f e o.
ky
Reflections in lines through the origin
-1 0 1 0
H and = G represent (i) a reflection in the y-axis,
0 1
The matrices > H, >
0 1 0 -1 1 0
(ii) a reflection in the x-axis and a (iii) reflection in the line y = x respectively,
since for any (x, y) we have:
-1 0 x -x
i > H= G = > H
0 1 y y
1 0 x x
ii > H= G = > H
0 -1 y -y
0 1 x y
iii = G= G = = G
1 0 y x
That these matrices are indeed the correct representations for the
corresponding transformation can readily be seen by considering a general
G and choosing a, b, c and d so that the requisite
a c
transformation matrix =
b d
transformation of coordinates applies. For example, reflection in the y-axis
-x
maps the point (x, y) to the point (–x, y), or in matrix form = G = G = > H.
a b x
c d y y
Thus, as we require ax + by = –x, this can be obtained by having a = –1 and
b = 0. Similarly, as we require cx + dy = y, this is obtained by having c = 0
95
MathsWorks for Teachers
Matrices
Example 4.10
-1 0
What is the effect of the matrix > H corresponding to a reflection in
0 1
the y-axis on the graphs of the functions f and g?
Solution
-1 0 x -x x1
Under this transformation > H = G = > H = > H, and so x = –x1 and
0 1 y y y1
y = y1. Thus, in general, y = f(x) is transformed to y1 = f(–x1) or
y = f(–x). So y = x 2 is transformed to y = (–x)2 = x 2 and y = sin(x) is
transformed to y = sin(–x) = –sin(x). When the image of a function or
relation is the same as the original function under a transformation, the
transformation illustrates a symmetry of the function or relation (see
Leigh-Lancaster, 2006). In this case, for y = x 2, the symmetry exhibited
is reflection (mirror) symmetry in the vertical coordinate axis.
Example 4.11
1 0
What is the effect of the matrix > H corresponding to a reflection in
0 -1
the x-axis on the graphs of the functions f and g?
Solution
1 0 x x x1
Under this transformation > H = G = > H = > H, and so x = x1 and
0 -1 y -y y1
y = –y1. Thus, in general, y = f(x) is transformed to –y1 = f(x1) or
y = –f(x). So y = x 2 is transformed to y = –(x)2 and y = sin(x) is
transformed to y = –sin(x).
Example 4.12
Solution
0 1 x y x1
Under this transformation = G = G = = G = > H, and so x = y1 and
1 0 y x y1
y = x1. Thus, in general, y = f(x) is transformed to x1 = f(y1) or x = f(y).
So y = x 2 is transformed to x = y2 and y = sin(x) is transformed to
x = sin(y).
96
chapter 4
Transformations of the cartesian plane
The transformed functions are no longer functions but relations. In fact each
function and its transformed relation are, by definition, inverse relations. For
any one-to-one function h, the inverse relation is also a one-to-one function.
Such functions are useful in solving equations by hand and using technology,
since, for such a function, h, the equation h(x) = k will have the corresponding
solution x = h–1(k).
P
1
y = mx
Q
θ
θ A x
–2 –1 O 1 2
–1
Figure 4.14: Finding the image of A(1, 0) under reflection in the line y = mx, where 0 < m < 1
First we find the image of point A with coordinates (1, 0). The line through
A perpendicular to the line y = mx cuts the line y = mx at Q and passes
through the point P, where length of PQ = length of AQ. So P is the image of
A after reflection in the line y = mx, and OP is also of length 1 unit. Let θ be
the angle that the line makes with the positive x-axis, so tan (θ) = m. Then the
angle POA = 2θ and so the coordinates of P are (cos(2θ), sin(2θ)) by
definition. Next we find the image of point B with coordinates (0, 1), as shown
in Figure 4.15.
97
MathsWorks for Teachers
Matrices
B 1
θ y = mx
R
θ T x
–2 –1 O 1 2
θ
S
–1
Figure 4.15: Finding the image of B (0, 1) under reflection in the line y = mx, where 0 < m < 1
Example 4.13
The graph of the function y = f(x) is reflected in the line y = 2x. Find the
equation of the transformed function.
Solution
Since m = tan(θ) = 2, it follows that θ = arctan(2), sin(2θ) = 0.8 and
- 0.6 0.8 x x1
cos(2θ) = –0.6. Consider > H = G = > H.
0.8 0.6 y y 1
98
chapter 4
Transformations of the cartesian plane
1 P
θ θ
x
–1 O 1
Figure 4.16: Rotation of points (1, 0) and (0, 1) anticlockwise about the origin through an angle q 2 0
The point (1, 0) is rotated to point P with coordinates (cos(θ), sin(θ)) and
the point (0, 1) is rotated to the point Q with coordinates
(cos(90° + θ), sin(90° + θ)) = (–sin(θ), cos(θ)). Hence the matrix
corresponding to an anti-clockwise rotation about the origin through the
angle θ is:
cos ]qg - sin ]qg
= G
sin ]qg cos ]qg
This result can be used to establish the compound angle formulas used in
the previous section, by considering a rotation through an angle of θ1 + θ2 as
both a single rotation through an angle of θ1 + θ2 and as a composition of two
rotations, one through an angle of θ1 followed by another through an angle of
θ2. By definition, the two corresponding matrices must be equal, so the
corresponding elements give the required identities (see Leigh-Lancaster,
2006, pp. 66–8).
99
MathsWorks for Teachers
Matrices
Example 4.14
Solution
The matrix corresponding to an anticlockwise rotation of 60° about the
cos ]60cg - sin ]60cg
origin is = G.
sin ]60cg cos ]60cg
give y =- x c 11 + 11 m .
5 3 8
100
chapter 4
Transformations of the cartesian plane
C o m p o s i t i o n o f l i n e ar t ra n s f o r m a t i o n s
Example 4.15
Solution
-1 0
The matrix for the first transformation is > H, while the matrix for
0 1
R1 1 V
] g ] g S - W
the second transformation is = G= S 1
cos 45c - sin 45c 2 2
sin ]45cg cos ]45cg 1
W.
SS W
2 2 W
T X
To obtain the image of each point (x, y) in the plane under the given
R1 1 V
S 2 - 2 W -1 0 x
sequence of these two transformations we use S 1 1
W> H = G.
SS WW 0 1 y
2 2
T X
So the matrix acting upon each point is the product of the rotation
matrix and the reflection matrix:
R1 1 V R 1 1 V
S 2 - 2 W -1 0 S- 2 - 2 W
A=S 1 1
W> H= S 1 1
W
SS WW 0 1 SS- W
2 2 2 2 W
T X T X
101
MathsWorks for Teachers
Matrices
S t u d e n t ac t i v i t y 4 . 4
a Find the coordinates of the images of the vertices of the unit square under the
k 0
transformation with matrix > y H , and show that its area is given by kxky.
0 kx
b Show that the rule of the image of y = f(x) under the transformation with matrix
k 0 x
> y H , where k x ! 0 and k y ! 0 , is y = k x f e o .
0 kx ky
c Find the image of the relation x2 + y2 = 1 (the unit circle) under the transformation
k 0
with matrix > y H where kx = b and ky = a and a ! 0 and b ! 0 . Hence find the
0 kx
formula for the area of the ellipse with horizontal axis length 2a and vertical axis
length 2b.
d Find the transformation matrix for an anticlockwise rotation about the origin
through an angle θ followed by an anticlockwise rotation about the origin through
an angle φ. Use the transformation matrix for an anticlockwise rotation about the
origin through an angle θ + φ to show:
i sin(θ + φ) = sin(θ)cos(φ) + cos(θ)sin(φ)
ii cos(θ + φ) = cos(θ)cos(φ) – sin(θ)sin(φ)
102
chapter 4
Transformations of the cartesian plane
1 k
e The matrix = G represents a shear transformation in the x direction and the
0 1
1 0
matrix = G represents a shear transformation in the y direction. Find the image
k 1
1 3
of the unit square under the shear transformation with matrix = G and draw the
0 1
original and its image on the same graph.
f Find the image of the function y = f(x) under the shear transformation with matrix
1 3
= G . In particular, find the image of y = x2 under this transformation.
0 1
A f f i n e t ra n s f o r m a t i o n s
103
MathsWorks for Teachers
Matrices
x x
T f= Gp = I = G + = G. We can form the composite of these transformations to
0
y y q
translate a point p units in the positive direction of the x-axis and q units in
x x p
the positive direction of the y-axis, by T f= Gp = I = G + = G. In this last case,
y y q
x x p x1 x x1 p x1 - p
T f= Gp = I = G + = G = > H, so = G = > H - = G = > H and y = f(x) will be
y y q y1 y y1 q y1 - q
transformed to y - q = f^x - ph.
Example 4.16
Find the image of the point (2, 6), the line y = 3x and the parabola y = x 2
under a translation by three units in the x-direction and two units in the
y-direction.
Solution
x 1 0 x
T f= Gp = = G= G + = G
3
y 0 1 y 2
so T f= Gp = = G= G + = G = = G + = G = = G
2 1 0 2 3 2 3 5
6 0 1 6 2 6 2 8
Suppose (x1, y1) is the image of the point with coordinates (x, y).
x 1 0 x x1
Then T f= Gp = = G= G + = G = > H
3
y 0 1 y 2 y1
x x1 x1 - 3
Hence = G = > H - = G = > H
3
y y1 2 y1 - 2
Then x = x1 − 3 and y = y1 − 2, and the line y = 3x becomes the line
y1 − 2 = 3(x1 − 3) or y = 3x − 7.
For the case of the parabola y = x 2, we have y1 − 2 = (x1 − 3)2 or
y = (x − 3)2 + 2.
C o m p o s i t i o n o f a f f i n e t ra n s f o r m a t i o n s
If S and T are affine transformations, then so is the composite transformation
T °S defined by T °S(u) = T(S(u)). Here S is applied to u first, and T is then
applied to the result. Since u is a 2 × 1 coordinate (vector) matrix, S(u) is also
a 2 × 1 coordinate (vector) matrix by conformability of matrix multiplication
104
chapter 4
Transformations of the cartesian plane
and addition. The same argument then applies for the application of T to the
2 × 1 coordinate matrix S(u).
Example 4.17
a Find the image of the point u = (x, y) after a reflection in the y-axis
followed by a dilation from the x-axis by a factor of 2.
b Find the image of the point u = (x, y) after a dilation from the x-axis
by a factor of 2 followed by a reflection in the y-axis.
Solution
a Let S be a reflection in the y-axis, so S has the matrix representation
-1 0
> H; and let T be a dilation from the x-axis by a factor of 2, so T
0 1
has the matrix representation = G.
1 0
0 2
1 0 -1 0 x -x
Then T °S(u) =T(S(u)) = = G> H = G = > H, where T °S has the
0 2 0 1 y 2y
1 0 -1 0 -1 0
matrix representation = G> H=> H.
0 2 0 1 0 2
b In the reverse order of application we have
-1 0 1 0 x -x
S°T(u) = S(T(u)) = > H= G = G = > H, where S°T also has the
0 1 0 2 y 2y
-1 0 1 0 -1 0
matrix representation > H= G=> H.
0 1 0 2 0 2
Thus composition of these two transformations is commutative, that
is T °S = S°T.
Example 4.18
105
MathsWorks for Teachers
Matrices
Solution
x
The dilations are applied by multiplying an arbitrary position vector = G
y
by the matrix = G. The rotation is then applied by multiplying the
3 0
0 2 R V
S 1 - 1 W
2 2W
result by the matrix SS , and the translation is applied by
1 1 W
S 2 2 W
T X
addition of the matrix = G.
1
1
R V
S 1 - 1 W
x1 2W3 0 x
Hence, in combination, > H = SS = G= G + = G
2 1
y1 1 1 W 0 2 y 1
S 2 2 W
T X
Reversing the order (that is, applying the inverse transformations) we
obtain: R V
S 1 - 1 W
x1 2W3 0 x
> H - = G = SS = G= G
1 2
y1 1 1 1 W0 2 y
S 2 2 W
T RX V
S 1 - 1 W
x1 - 1 2W3 0 x
which gives > H = SS = G= G
2
y1 - 1 1 1 W0 2 y
S 2 2 W
R T V X R V
1 1 W- 1 S x1 2 + y1 2 - 2 W
-1 S -
x 2 W x1 - 1
and so = G = = G S H= S
3 0 S 2
>
S 6 6 3 W
y 0 2 1 1 W y1 - 1 y1 2 x1 2 W.
S 2 2 W S
S 4 - 4 WW
T X T X
Substituting these values for x and y into the equation for the unit
circle x 2 + y2 = 1 yields the relation:
13x 2 − 2x(5y + 8) + 13y2 − 16y = 56
The area of the original region is π square units.
106
chapter 4
Transformations of the cartesian plane
x
–6 –5 –4 –3 –2 –1 O 1 2 3 4 5 6
–1
–2
–3
S t u d e n t ac t i v i t y 4 . 5
107
MathsWorks for Teachers
Matrices
SUMM A R Y
108
chapter 4
Transformations of the cartesian plane
SUMM A R Y (Cont.)
109
MathsWorks for Teachers
Matrices
SUMM A R Y (Cont.)
References
Anton, H & Rorres, C 2005, Elementary linear algebra (applications version), 9th
edn, John Wiley and Sons, New York.
Cirrito, F (ed.) 1999, Mathematics higher level (core), 2nd edn, IBID Press, Victoria.
Evans, L 2006, Complex numbers and vectors, ACER Press, Camberwell.
Leigh-Lancaster, D 2006, Functional equations, ACER Press, Camberwell.
Nicholson, KW 2003, Linear algebra with applications, 4th edn, McGraw-Hill
Ryerson, Whitby, ON.
Sadler, AJ & Thorning, DWS 1996, Understanding pure mathematics, Oxford
University Press, Oxford.
Websites
http://wims.unice.fr/wims/en_tool~linear~matrix.html – WIMS
This website provides a matrix calculator.
http://en.wikipedia.org/wiki/Linear_transformation – Wikipedia
This site provides a comprehensive discussion on linear transformations with
links to other resources and references.
http://en.wikipedia.org/wiki/Affine_transformation – Wikipedia
This site provides a comprehensive discussion on affine transformations with
links to other resources and references.
http://www.ies.co.jp/math/java/misc/don_trans/don_trans.html
This website contains an applet that shows how the shape of a dog is transformed
by a 2 × 2 matrix.
http://merganser.math.gvsu.edu/david/linear/linear.html
This website contains an applet that allows you to move a slider adjusting
coefficients in a 2 × 2 matrix and see the effect of the equivalent transformation
on the unit square.
110
C ha p t e r 5
Tra n s i t i o n m a t ric e s
Conditional probability
One of the key ideas that students come across early in their study of
probabilities related to compound events for a given event space is the notion
of conditional probability and the associated ideas of dependent and
independent events. While students’ experience with subjective probability
makes them well acquainted with events that may or may not be dependent,
such as the likelihood of scoring on the second of two free shots in a basketball
game, given that one may or may not have scored on the first shot, their
school study of probability often begins with compound events that are
(physically) independent events. On the other hand, a combination of
experience and knowledge indicates that certain events are dependent, for
example, gender and colour blindness to red and green, or having a disease
and the likelihood of testing positive for the disease. Whether events in a
given context are independent or not, is not always clear. Thus, experiments
with tossing coins and rolling dice, which are physically independent, can lead
to an implicit willingness, or preference, to assume that two events from a
given event space are independent. One of the more intriguing and counter-
intuitive scenarios involving conditional probability is the game show problem
called the Monty Hall dilemma (see the UCSD website). This is a good context
for stimulating student interest, many people think the events involved are (or
should be) independent.
If we consider two events A and B from the same event space, U, then the
conditional probability of A given B, that is the probability that event A
occurs given that event B has occurred, written as Pr ^ A | Bh, corresponds to the
proportion of B events that are also A events, relative to the proportion of B
Pr ]A k Bg
events; that is, Pr ^ A | Bh =
Pr ]Bg
. Conditional probability can be used to
111
MathsWorks for Teachers
Matrices
Tra n s i t i o n p r o b a b i l i t i e s
Many natural systems undergo a process of change where at any time the
system can be in one of a finite number of distinct states. For example, the
weather in a city could be sunny, cloudy and fine, or rainy. Such a system
changes with time from one state to another, and at scheduled times, or stages,
the state of the system is observed. At any given stage, the state to which it
changes at the next stage cannot be determined with certainty, but the
probability that a given state will occur next can be specified by knowing the
current state of the system. That is, the probability that the system will be in
a given state next is conditional only on the current state. Such a process of
change is called a Markov chain or Markov process. The conditional
probabilities involved (that is, the probabilities of going to one state given that
the system was in a certain state) are called transition probabilities, and the
process can be modelled using matrices.
112
chapter 5
Transition matrices
A
1 B
A
0.25
B
0.75 B
Figure 5.1: Tree diagram representation for the first transition
Note that the columns of the table sum to 1. The store at which Jane shops
in a given week is not determined. The most we can expect to know is the
probability that she will shop at A or B in that week. Let s(1m) denote the
probability that she shops at A in week m, and s(2m) the probability that she
shops at B in week m.
113
MathsWorks for Teachers
Matrices
Example 5.1
Use the law of total probability to find the probability that Jane shops at
store A in week 1 and the probability that Jane shops at store B in
week 1 given that she shops at store A initially.
Solution
As she shops at A initially, s(10) = 1 and s(20) = 0. For the next week (using
the law of total probability):
s(11) = Pr ^shops at A in week 1 | shopped at A in week 0h
# Pr ^shopped at A in week 0h
+ Pr ^shops at A in week 1 | shopped at B in week 0h
# Pr (shopped at B in week 0)
= 0 # 1 + 0.25 # 0 = 0
s(21) = Pr ^shops at B in week 1 | shopped at A in week 0h
# Pr ^shopped at A in week 0h
+ Pr ^shops at B in week 1 | shopped at B in week 0h
# Pr (shopped at B in week 0)
= 1 # 1 + 0.75 # 0 = 1
114
chapter 5
Transition matrices
Hence, from S1 we can see that given that she shopped at A in week 0, the
probability that she shops at A in week 1 is 0 and the probability that she shops
at B in week 1 is 1. What happens two weeks after shopping at A initially?
If we use a tree diagram representation, as shown in Figure 5.2, we can
calculate the corresponding probabilities for the first two transitions.
Week 2
Initially Week 1 0 A
0 A 1
B
A A
1 0.25
B
0.75 B
Figure 5.2: Tree diagram representation for the first two transitions
115
MathsWorks for Teachers
Matrices
Example 5.2
Find the probabilities that Jane shops at A (i) 3, (ii) 4, (iii) 5, (iv) 10,
(v) 50 and (vi) 100 weeks later.
Solution
116
chapter 5
Transition matrices
If this is indeed the case, then we can say that the long-term probability of
Jane shopping at A is 0.2 and shopping at B is 0.8. That is, in the long term she
will shop at A 20% of the time and at B 80% of the time, provided that she
doesn’t change her pattern of behaviour in this regards.
If instead of shopping at A in week 0, we knew she shopped at B in
Example 5.3
Find the probabilities that Jane shops at A in (i) 1, (ii) 2, (iii) 3, (iv) 4,
(v) 5, (vi) 10, (vii) 50 and (viii) 100 weeks later, given that she initially
shopped at B.
Solution
S1 = PS0 = = G= G = = G, and so the probability that she
0 0.25 0 0.25
i
1 0.75 1 0.75
shops at A 1 week later is 0.25. This state vector is the same as that
after two transitions if she shops at A initially.
117
MathsWorks for Teachers
Matrices
Table 5.2: Summary of state and transition matrices for five transitions from either
initial state
= G = G = G
1 0 0.25 0 0.25
1 0.75 1 0.75
= G = G = G
2 0.25 0.1875 0.25 0.1875
0.75 0.8125 0.75 0.8125
= G = G = G
3 0.1875 0.203125 0.1875 0.203125
0.8125 0.796875 0.8125 0.796875
= G = G = G
4 0.203125 0.199219 0.203125 0.199219
0.796875 0.800781 0.796875 0.800781
= G = G = G
5 0.199219 0.200195 0.199219 0.200195
0.800781 0.799805 0.800781 0.799805
We can see from this table that the columns of Pn contain the state vectors
(after n transitions) after initially shopping at A or at B respectively. So the
(i, j)th element of Pn gives the probability of starting in state j and moving to
state i after n transitions.
118
chapter 5
Transition matrices
S t u d e n t ac t i v i t y 5 . 1
a Suppose initially we are unsure of where Jane shops, but it could be at either A or
0. 5
B with equal probability. Then S0 = = G . Find S1, S2, S5, S10, and S50.
0. 5
b Consider a Markov chain, with two states 1 and 2, with transition probability matrix
P, with pij = probability of going from state j to state i in one transition.
0.445 0.444
Suppose P3 = = G . What is the probability of
0.555 0.556
i going from state 1 to state 2 in three transitions?
ii going from state 2 to state 2 in three transitions?
T h e s t e a dy- s tat e v e c t o r
It appears that in the long term, after many transitions, the state vectors
converge to the same vector regardless of where Jane initially shopped. Such a
vector is called a steady-state vector. If this is the case, how can we find the
steady-state vector for this problem?
We can phrase this question more generally, and suppose that P is the
transition matrix of a Markov chain, and assume that the state vectors Sm
converge to a limiting state vector S. Then Sm is very close to S for sufficiently
large m, so Sm + 1 is also very close to S. Then the equation
Sm + 1 = PSm
is closely approximated by
S = PS
where S is a solution to this matrix equation. It is easily solved as it can be
written as a system of linear equations in matrix form
(I – P)S = O
where the entries of S are the unknowns and I is the identity matrix. This
homogeneous system has many solutions; the one we are most interested in is
the one whose entries sum to 1.
119
MathsWorks for Teachers
Matrices
Example 5.4
Solution
1 - 0.25
I-P == G-= G=>
1 0 0 0.25
H
0 1 1 0.75 - 1 0.25
1 - 0.25
Using Gaussian elimination, this reduces to > H, and so the
0 0
s1
solution for S = > H is s1 = 0.25 × s2 with s1 + s2 = 1, hence s1 = 0.2,
s2
s2 = 0.8.
It is a natural question to ask if we can always find a steady-state vector,
and whether the powers of the transition matrix converge to a matrix whose
columns are equal to the steady-state vector. To answer this we need to
introduce the notion of a regular transition probability matrix.
We say that a transition probability matrix P is regular if, for some positive
integer m, the matrix Pm has no zero entries. It can be shown that, if P is a
regular transition probability matrix, then it has a unique steady-state vector S
(see, for example, the Iowa State maths website). Further, the matrix defined
by L = lim Pm exists, and is given by L = 6S | S | f | S@, that is a matrix where
m"3
each column is a copy of the steady-state vector S, and if L = [lij] then lij is the
long-term probability of being in state i if the system began in state j.
It is straightforward to determine the steady state vector for a 2 × 2
transition probability matrix, given that it exists. To do this we use a general
= G.
a -b
0 0
x
Writing S = = G, we have ax = by. There are now two possible cases:
y
ax ax
Case 1: If b 2 0, then y = , and as x + y = 1, x + = 1, and so
b b
]a + bg
x b1 + l = 1 or x
a b a
= 1. Since b 2 0, x = and hence y = .
b b a+b a+b
120
chapter 5
Transition matrices
0 1
b Show that the transition probability matrix P = = G is not regular. Does the limit
1 0
matrix L = lim P m exist? Does it have a steady-state vector?
m"3
a 0
c For 0 1 a 1 1, show that the matrix P = = G is not regular. Find a
1-a 1
steady-state vector and lim P n if it exists.
n"3
A p p l ica t i o n s o f t ra n s i t i o n m a t ric e s
Examples such as the following, and others from various practical and
research contexts, or from the literature, can be used to help students develop
the formulation, solutions and interpretation skills associated with modelling
and problem solving that employs transition matrices and Markov chains.
Keys aspects of these processes are:
• consideration of features of the context that indicate a Markov process is
likely to provide a suitable model
121
MathsWorks for Teachers
Matrices
Example 5.5
OzBank offers customers two choices of credit card: Ordinary and Gold.
Currently 70% of its customers have an Ordinary card and 30% have a
Gold card. The bank wants to increase the percentage of its customers
with a Gold card, as it gets higher fees from these customers, and so
sends out an offer to all Ordinary cardholders offering a free upgrade to
a Gold card for twelve months. It expects that each month for the next
three months, 10% of its Ordinary cardholders will upgrade to a Gold
card, but 1% of Gold cardholders will downgrade to an Ordinary card.
What percentage of its customers would have Gold cards at the end of
the three months?
Solution
This information can be summarised as a table:
Current
Ordinary Gold
Ordinary 0.90 0.01
One month later
Gold 0.10 0.99
122
chapter 5
Transition matrices
Example 5.6
A wombat has its burrow beside a creek and each night it searches for
food on either the east or west side of the creek. The side on which it
searches for food each night depends only on the side on which it
searched the night before. If the wombat searches for food on the east
side one night, then the probability of the wombat searching on the east
side of the creek the next night is 0.2. The transition matrix for the
probabilities of the wombat searching for food on either side of the creek
given the side searched on the previous night is
= G
0.2 0.7
0.8 0.3
a If the wombat searches for food on the west side one night, what is the
probability that it searches for food on the west side the next night?
b If the wombat searches for food on the west side on the Monday
night, what is the probability it searches for food on the west side
again on the following Saturday (5 days later)?
c In the long term, what proportion of nights will it spend searching
for food on the west side?
123
MathsWorks for Teachers
Matrices
Solution
a We can view the transition matrix as below. It is clear that if it
searches on the west side one night then the probability that it
searches for food on the west side the next night is 0.3.
Side searched for food
current night
East West
124
chapter 5
Transition matrices
Example 5.7
A wombat has its burrow beside a creek and each night it searches for
food on either the other side of the creek or north or south of its burrow
on the same side of the creek. The area in which it searches for food each
night depends only on the area in which it searched for food the night
before. If the wombat searches for food on the other side of the creek one
night, then the probabilities of the wombat searching on the other side
of the creek, or north or south of its burrow the next night are 0.2, 0.4
and 0.4 respectively. If the wombat searches for food north of its burrow
one night, then the probability that it will search for food north of its
burrow the next night is 0.1. The transition matrix for the probabilities
of the wombat searching for food in each area given the area searched
for food on the previous night is
R V
S0.2 0.5 0.5 W
S0.4 0.1 0.3 W
SS W
0.4 0.4 0.2 W
T X
a If the wombat searches for food on the south side of its burrow one
night, what is the probability that he searches for food on the north
side the next night?
b If the wombat searches for food on the north side of its burrow on
the Monday night, what is the probability it searches for food on the
north side of its burrow again on the following Saturday (5 days
later)?
c In the long term, what proportion of nights will the wombat spend
searching for food north of its burrow?
125
MathsWorks for Teachers
Matrices
Solution
a We can view the transition matrix as below, and it is clear that if it
searches for food on the south side one night then the probability
that he searches for food on the north side the next night is 0.3.
Side searched for food
current night
Other North South
Other 0.2 0.5 0.5
Side searched
for food the North 0.4 0.1 0.3
next night
South 0.4 0.4 0.2
R V
7711
R V5 R V S 20000 W R V
S0.2 0.5 0.5 W S 0 W S 28101 W S 0.38555 W
b We need to find S0.4 0.1 0.3 W S 1 W = SS 100000 WW . S 0.28101 W, and so
SS W S W S W
0.4 0.4 0.2 W S 0 W S 1042 W S0.33344 W
T X T X SS 3125 WW T X
T X
the probability it searches for food north of its burrow on the
28101
following Saturday is 100000 = 0.28101.
JR1 0 0 V R0.2 0.5 0.5 VN Rx V R0 V
KS W S WOS W S W
c Solve K S0 1 0 W- S0.4 0.1 0.3 WOSy W = S 0 W for x, y and z, with
K SS0 0 1 WW SS0.4 0.4 0.2 WWOSS z WW SS 0 WW
LT X T XP T X T X
x + y + z = 1.
R V R V R V
S1 0 0 W S0.2 0.5 0.5 W S 0.8 - 0.5 - 0.5W
S0 1 0 W- S0.4 0.1 0.3 W = S- 0.4 0.9 - 0.3 W
SS W S W S W
0 0 1 W S0.4 0.4 0.2 W S- 0.4 - 0.4 0.8 W
T X T R X TV X
S1 0 - 15 W
S 13 W
S - 11 W
which reduces to S0 1 13 W by Gaussian elimination.
S0 0 0 W
T X
15z 11z
Hence we have x = 13 and y = 13 with x + y + z = 1. This
1 11 5
gives 3z = 1, so z = 3 , y = 39 and x = 13 . Thus in the long term
11
the wombat will spend 39 (≈ 0.28205) of nights searching for food
north of its burrow.
126
chapter 5
Transition matrices
Example 5.8
127
MathsWorks for Teachers
Matrices
Solution
a We need to calculate P2, and obtain the number in the (1, 3)
position.
R V
S3 1 1W
S8 4 8 W
1 1 1
P = SS 2 2 2 WW
2
S1 1 3W
SS 8 4 8 WW
T X
Hence one-eighth of the third generation offspring of the recessive
population has the dominant trait.
b From P2, we see that one-half of the third generation offspring of
the hybrid population has the hybrid trait, while one-quarter has
dominant and one-quarter has the recessive trait. So one-half is not
hybrid.
128
chapter 5
Transition matrices
R V
R V S1W
S0 W S4W
1
c Here S0 = S 1 W, and so PS0 = SS 2 WW and this is the population
SS WW
0 S1W
T X S4W
T X
distribution vector for the next period.
R V R V
S1W S1W
S3W S4W
1 1
d Here S0 = SS 3 WW, and so P2 S0 = SS 2 WW is the population vector for the
S1W S1W
SS 3 WW S4W
T X T X
third generation. This seems familiar.
e As P2 has no zero entries, P is regular. If we approximate a suitably
R V
S0.25 0.25 0.25W
high power of P, say P20, we get S 0.5 0.5 0.5 W. So we can guess
SS W R V
0.25 0.25 0.25W S0.25W
T X
that the steady-state population distribution should be S 0.5 W. This
SS W
should be checked. 0.25W
Begin with I - P: T X
R V R V
1 1 1 1
R V S2 4 0W S 2 - 4 0 W
S1 0 0 W S 1 1 1 W S 1 1 W
1W
S0 1 0 W - S W = S- -
SS W S2 2 2W S 2 2 2W
0 0 1W S 1 1 W S 1 1 W
T X S0 4 2W S 0 - 4 2 W
T X T X
and use technology to obtain the reduced row echelon form:
R V
S1 0 - 1 W
S0 1 - 2 W
S W
S0 0 0 W
R V T X
Sx W
Let S = Sy W. From the above, x = z and y = 2z. Then since
SS WW
z R V
T X S 0.25W
1
x + y + z = 1, z = 4 and so S = S 0.5 W.
SS W
0.25W
T X
At this stage it is useful to recall that a state in a Markov chain is called an
absorbing state if it is not possible to leave that state over the next time
period. If state i is an absorbing state, then in the ith column of the transition
129
MathsWorks for Teachers
Matrices
matrix, there will be a 1 in the ith row and zeroes everywhere else. When we
take powers of the transition matrix, the ith column will remain the same,
and so the transition matrix is not regular, and may not have a steady-state
vector.
Now, suppose that we always cross with recessives.
R V
S0 0 0 W
S 1 W
Then the transition matrix is P = S1 2 0 W. This has an absorbing state,
S 1 W
SS0 W
2 1W
the recessive state. Let us take some powers T X P, and see what happens.
of
R V
S 0 0 0W
S1 1 W
P2 = S 2 4 0 W
S1 3 W
SS 4 1 WW
2
T X
R V
S 0 0 0 W
3 S1 1 W
P = S 4 8 0W
S3 7 W
SS 4 8 1 WW
R T X V
S 0 0 0W
S 1 1 W
P10 = S 512 1024 0 W
S 511 1023 W
SS W
512 1024 1 W
T X
Continuing with higher powers, it would appear that there will be a
R V
S0 W
steady-state vector, equal to S 0 W. In the long term, we will end up with
recessives. SS WW
1
R V T X
0
S W
Check that S = S 0 W is a steady-state vector; that is, show PS = S.
SS WW
1
T X
Example 5.9
Humans have two sets of chromosomes, one obtained from each parent,
which determine their genetic makeup. In this example we investigate
the inbreeding problem.
130
chapter 5
Transition matrices
131
MathsWorks for Teachers
Matrices
132
chapter 5
Transition matrices
S t u d e n t ac t i v i t y 5 . 3
a For Example 5.8, suppose that we always cross with dominants. Determine the
transition matrix P, calculate P10 and P20, find lim P n and the steady-state vector if
n"3
it exists.
b For Example 5.9, find the long-term distribution of the population if the initial state
vector is
R V R V R V
S 0.1W S 0 W S 0 W
S 0.2 W S 0. 1 W S0.25 W
S 0.2 W S 0. 3 W S W
i S W ii S W iii S0.35 W
S 0.2 W S 0.4 W S0.15 W
S 0.2 W S 0.2 W S0.25 W
S W S W S W
S 0.1W S 0 W S 0 W
T X T X T X
c There are three states in a country, called A, B and C. Each year 10% of the
residents of state A move to state B and 30% to state C, 20% of the residents of
State B move to state A and 20% to state C, and 5% of the residents of state C
move to state A and 15% to state B. Suppose initially the population is equally
divided between the three states.
i Find the percentage of the population in the states after 3 years.
ii Find the percentage of the population in the three states after a long period
of time.
133
MathsWorks for Teachers
Matrices
SUMM A R Y
References
Anton, H & Rorres, C 2005, Elementary linear algebra (applications version), 9th
edn, John Wiley and Sons, New York.
Nicholson, KW 2001, Elementary linear algebra, 1st edn, McGraw-Hill Ryerson,
Whitby, ON.
134
chapter 5
Transition matrices
Websites
http://math.ucsd.edu/~crypto/Monty/monty.html
This website simulates the Monty Hall dilemma.
http://orion.math.iastate.edu/msm/AthertonRMSMSS05.pdf#search=%22proof%20
that%20regular%20transition%20matrices%20converge%20to%20a%20steady
%20state%22 – Iowa State Department of Mathematics
This pdf discusses Markov chains.
http://math.rice.edu/~pcmi/mathlinks/montyurl.html
This website has links to many other sites that discuss or simulate the Monty
Hall dilemma.
135
C ha p t e r 6
C u rric u l u m c o n n e c t i o n s
136
chapter 6
Curriculum connections
Type 1
Courses designed to consolidate and develop the foundation and numeracy
skills of students with respect to the practical application of mathematics in
other areas of study. These often have a thematic basis for course
implementation.
Type 2
Courses designed to provide a general mathematical background for students
proceeding to employment or further study with a numerical emphasis, and
likely to draw strongly on data analysis and discrete mathematics. Such
courses typically do not contain any calculus material, or only basic calculus
material, related to the application of average and instantaneous rates of
change. They may include, for example, business-related mathematics, linear
programming, network theory, sequences, series and difference equations,
practical applications of matrices and the like.
Type 3
Courses designed to provide a sound foundation in function, coordinate
geometry, algebra, calculus and possibly probability with an analytical
emphasis. These courses develop mathematical content to support further
studies in mathematics, the sciences and sometimes economics.
Type 4
Courses designed to provide an advanced or specialist background in
mathematics. These courses have a strong analytical emphasis and often
incorporate a focus on mathematical proof. They typically include complex
numbers, vectors, theoretical applications of matrices (for example
transformations of the plane), higher level calculus (integration techniques,
differential equations), kinematics and dynamics. In many cases Type 4
courses assume that students have previous or concurrent enrolment in a
Type 3 course, or subsume them.
Table 6.1 provides a mapping in terms of curriculum connections between
the chapters of this book, the four types of course identified above, and the
courses currently (2006) offered in various Australian states and territories.
137
MathsWorks for Teachers
Matrices
As this book is a teacher resource, these connections are with respect to the
usefulness of material from the chapters in terms of mathematical background
of relevance, rather than direct mapping to curriculum content or syllabuses
in a particular state or territory.
Table 6.1: Curriculum connections for senior secondary final year mathematics
courses in Australia
138
chapter 6
Curriculum connections
Table 6.2: Curriculum connections for senior secondary final year mathematics
courses in some jurisdictions around the world
Content from the chapters of the book may be mapped explicitly to topics
within particular courses, and teachers will perhaps find it useful to
informally make these more specific connections in terms of their intended
implementation of a given course of interest to them.
References
The following are the website addresses of Australian state and territory curriculum
and assessment authorities, boards and councils. These include various teacher
reference and support materials for curriculum and assessment.
The Senior Secondary Assessment Board of South Australia (SSABSA)
http://www.ssabsa.sa.edu.au/
The Victorian Curriculum and Assessment Authority (VCAA)
http://www.vcaa.vic.edu.au/
The Tasmanian Qualifications Authority (TQA)
http://www.tqa.tas.gov.au/
The Queensland Studies Authority (QSA)
http://www.qsa.qld.edu.au/
The Board of Studies New South Wales (BOS)
http://www.boardofstudies.nsw.edu.au/
The Australian Capital Territory Board of Senior Secondary Studies (ACTBSSS)
http://www.decs.act.gov.au/bsss/welcome.htm
139
MathsWorks for Teachers
Matrices
140
C ha p t e r 7
Solution notes to student
ac t i v i t i e s
A+B == G
3 2
iii
1 6
AB = = G
8 11
iv
2 9
141
MathsWorks for Teachers
Matrices
3 4
v BA = > H
- 2 14
AC = = G
26 34 42
vi
14 16 18
b Suppose MP = kP. Then we have two simultaneous equations
4p + q = kp
8p + 6q = kq
and collecting terms in p and q, we have
]4 - kgp + q = 0 (i)
8p + ]6 - kgq = 0 (ii)
Multiply equation (i) by 6 - k and subtract equation (ii) from it:
]6 - kg ]4 - kgp - 8p = 0
That is, ^16 - 10k + k2h p = 0. Since p ≠ 0, 16 - 10k + k2 = 0, so k = 2 or 8.
c A2 = = G = 2 = G, A 3 = = G, A 4 = = G, A 5 = = G
2 2 1 1 4 4 8 8 16 16
2 2 1 1 4 4 8 8 16 16
Hence An = 2n - 1 = G = 2n - 1 A.
1 1
1 1
d Suppose A and O are of size n × n. Consider AO. The element in ith row
n n
and jth column of this product will be / aik okj = / aik 0 = 0, since
k=1 k=1
1 1
Take A = = G, B = >
2 1
H, then AB = O, but neither A = O nor B = O.
6 3 -2 -2
0 -1 0 -1 -1 0
H =-= G =- I
1 0
e J2 = > H#> H=>
1 0 1 0 0 -1 0 1
-1 0 -1 0
H== G= I
1 0
J 4 = J2 # J2 = > H#>
0 -1 0 -1 0 1
f (X − Y)(X + Y) = X2 − YX + XY − Y2. Since generally YX ≠ XY for
matrices, then (X − Y)(X + Y) ≠ X2 − Y2.
Take X = = G and Y = = G.
4 1 3 0
2 2 0 1
142
chapter 7
Solution notes to student activities
Take X = = G and Y = = G.
4 0 3 0
0 2 0 1
143
MathsWorks for Teachers
Matrices
= G= G== G
5 3 19 22 0 8 0 8 12 19 98 125 60 55 69 43 75 95
3 2 1 5 20 5 23 1 5 0 59 76 40 34 46 26 46 57
and so the message sent would be 98, 59, 125, 76, 60, 40, 55, 34, 69, 46, 43,
26, 75, 46, 95, 57.
b To find the original message, we must first find the inverse of the coding
matrix:
5 3 -1 2 -3
M- 1 = = G => H
3 2 -3 5
Arrange the received message in column vectors of length 2, and put them
together into one matrix:
= G
65 75 138 90 85 80 160 123
42 50 87 54 54 49 99 76
Now multiply this on the left by M−1 to recover the original message as a
matrix:
2 - 3 65 75 138 90 85 80 160 123
H= G== G
4 0 15 18 8 13 23 18
>
- 3 5 42 50 87 54 54 49 99 76 15 25 21 0 15 5 15 11
So the original message is represented by the numbers
4, 15, 0, 25, 15, 21, 18, 0, 8, 15, 13, 5, 23, 15, 18, 11
Checking the coding listed previously, we see that the message reads DO
YOUR HOMEWORK.
144
chapter 7
Solution notes to student activities
145
MathsWorks for Teachers
Matrices
146
chapter 7
Solution notes to student activities
two leading 1s) x and y, and one free variable z. So let z = k, an arbitrary
real number. Then the solution set is 'b 2 , 4, kl: k ! R1.
k-3
147
MathsWorks for Teachers
Matrices
Next, the new row 4 will be the old row 4 with row 3 subtracted from it.
R V
S1 1 0 0 a W
S0 1 1 0 b W
S W
S0 0 1 1 c W
S0 0 0 0 d - a + b - c W
T X
Now the last equation is 0v + 0x + 0y + 0z = d − a + b − c.
Generally, if a + c ≠ b + d, there will be no solution.
In this example, (v + x) + (y + z) = a + c, and (x + y) + (z + v) = b + d, so
a + c = b + d = sum of all players scores in the tournament, and this means
R V
S1 1 0 0 a W
S0 1 1 0 b W
that the echelon form of the augmented matrix is S W, which is
S0 0 1 1 c W
S0 0 0 0 0 W
R T X V
S1 0 0 1 a - b + c W
S0 1 0 - 1 b - c W
equivalent to the reduced row echelon matrix S W.
S0 0 1 1 c W
S0 0 0 0 0 W
T X
So there will be three basic variables v, x, and y, and one free variable z,
and hence there will be an infinite number of solutions and thus the scores
will not be able to be uniquely determined.
Writing z = k, k ∈ R, the first row of the matrix tells us that
v = −k + a − b + c, the second row that x = k + b − c, and the third row that
y = −k + c.
b Let f(x) = ax3 + bx 2 + cx + d be the equation of a cubic polynomial
function, with a, b, c and d the unknown coefficients. From Example 3.15
we have three equations:
a+b+c+d = 0
-a + b - c + d = 0
3a + 2b + c =- 4
Since f ′(x) = 3ax 2 + 2bx + c, and the slope at x = −1 is 12, so
3a − 2b + c = 12 is a fourth equation to add to the system of three above.
148
chapter 7
Solution notes to student activities
2
1
x
–4 –3 –2 –1 O 1 2 3
–1
–2
–3
c Let f(x) = ax4 + bx3 + cx 2 + dx + e be the rule for the family of quartic
polynomials.
Since they pass through (1, 2) and (−2, −1) and have slope 5 at x = 1, we
have the following equations (see Example 3.16 for details):
a+b+c+d+e = 2
16a - 8b + 4c - 2d + e =- 1
4a + 3b + 2c + d = 5
In addition, they must pass through the point (2, 0), so f(2) = 0 and
16a + 8b + 4c + 2d + e = 0.
149
MathsWorks for Teachers
Matrices
When t = 1, f]xg = b- 24 l x 4 + b 3 l x 3 + b 24 l x2 + b- 12 l x + 1
41 4 155 61
When t =- 8, f]xg = b 24 l x 4 + b- 6 l x 3 + b- 24 l x2 + b 12 l x - 8
13 19 7 155
The graphs of these are given in the figure on the following page.
150
chapter 7
Solution notes to student activities
20
15
10
5
x
–10 –8 –6 –4 –2 O 2 4 6 8 10
–5
–10
–15
–20
–25
–30
d Let f(x) = ax4 + bx3 + cx 2 + dx + e be the rule for the family of quartic
polynomials.
Since they pass through (1, 2), (−2, −1) and (2, 0), and have slope 5 at
x = 1, we have the following equations (see example above for details):
a + b + c + d + e =2
16a - 8b + 4c - 2d + e = - 1
4a + 3b + 2c + d =5
16a + 8b + 4c + 2d + e = 0
Now we are looking for the particular curve that passes through (−1, 5),
so f(−1) = 5, and this gives a fifth equation a − b + c − d + e = 5.
The augmented matrix for this system of 5 equations is
R V
S1 0 0 0 0 - 4 W
R V S 3W
S 1 1 1 1 1 2 W S0 1 0 0 0 7 W
S16 - 8 4 - 2 1 - 1W S 12 W
S W S 16 W
S 4 3 2 1 0 5 W, which has reduced echelon form S0 0 1 0 0 3 W.
S16 8 4 2 1 0 W S W
S W S0 0 0 1 0 - 25 W
S 1 -1 1 -1 1 5 W S 12 W
T X S 1W
S0 0 0 0 1 - 2 W
T X
Hence f]xg =- 3 x 4 + 12 x 3 + 3 x2 - 12 x - 2 , and its graph is given in
4 7 16 25 1
151
MathsWorks for Teachers
Matrices
8
6
4
2
x
–5 –4 –3 –2 –1 O 1 2 3 4
–2
–4
–6
–8
–10
) 3 and ) 3
3a - b =- 1 3c - d = 1
- 3a + b = 1 - 3c + d =- 1
But the two equations in each set are in fact a multiple of one another,
so we only have two distinct equations in 4 unknowns. If we choose any
values for b and d, then the values of a and c will be determined by these
equations.
1 4
Take b = 4 and d = −1. Then a = 1 and c = 0, giving > H as a matrix
0 -1
for T.
152
chapter 7
Solution notes to student activities
153
MathsWorks for Teachers
Matrices
x1 2 1 x
b > H = = G= G
y1 1 3 y
x 2 1 - 1 x1
= G== G > H
y 1 3 y1
3 -1
=> H >y H
5 5 x1
-1 2
5 5 1
x = 53 x1 - 15 y1
-1 2
y= 5 x1 + 5 y1
y = 5 - 3x becomes
- 15 x1 + 25 y1 = 5 - 3` 53 x1 - 15 y1j
which simplifies to y1 = 8x1 - 25 or y = 8x - 25
x1 1 1 x
c > H = = G = G
y1 1 1 y
x1 = x + y
y1 = x + y
` Since x + y ! constant, x1 = y1 so y = x is image of y = 5 - 3x.
d If the line maps to a point, then from c x1 = y1 = k, where k is a real
constant. So lines of form x + y = k are mapped to the point (k, k) under
this transformation.
e = G= G = = G
2 1 0 0
1 3 0 0
= G= G = = G
2 1 0 1
1 3 1 3
= G= G = = G
2 1 1 3
1 3 1 4
= G= G = = G
2 1 1 2
1 3 0 1
So the transformation maps the unit square to the parallelogram with
vertices (0, 0), (1, 3), (3, 4) and (2, 1), and the area of the parallelogram is
det = G × area of unit square = 5 square units.
2 1
1 3
154
chapter 7
Solution notes to student activities
3 5 x x1
b = G= G = > H
1 2 y y 1
x 2 - 5 x1 2x1 - 5y1
= G=> H> H = > H
y - 1 3 y1 - x1 + 3y1
Hence the image of y = x 2 is −x1 + 3y1 = (2x1 − 5y1)2, or
4x 2 − 20xy + 25y2 + x − 3y = 0.
1 1 x x1
c = G= G = > H
1 1 y y1
So x + y = x1 = y1, and so the image of y = x 2 is y = x.
Hence the area of the rectangle formed by the transformed vertices is kxky.
ky 0 x x1
b > H = G = >y H
0 kx y 1
x1 y y x
So x = and y = 1 , and so y = f(x) is transformed to 1 = f e 1 o, that is,
ky kx kx ky
x
y = kx f e
ky o
.
155
MathsWorks for Teachers
Matrices
x y
c From part b, x = a1 and y = 1 , and so x 2 + y2 = 1 is transformed to
b
a b a b
with horizontal axis length 2a and vertical axis length 2b.
G = πab.
a 0
Hence area of ellipse = area of unit circle × det =
0 b
d The matrix for an anticlockwise rotation about the origin through an angle
cos ]qg - sin ]qg
of θ is = G.
sin ]qg cos ]qg
The matrix for an anticlockwise rotation about the origin through an
cos ^fh - sin ^fh
angle of φ is > H, and the matrix for an anticlockwise rotation
sin ^fh cos ^fh
cos ^q + fh - sin ^q + fh
about the origin through an angle of θ + φ is > H.
sin ^q + fh cos ^q + fh
Then, since a rotation through an angle of θ followed by a rotation
through an angle φ is equivalent to a rotation through an angle θ + φ:
cos ]qg - sin ]qg cos ^fh - sin ^fh cos ^q + fh - sin ^q + fh
= G> H=> H
sin ]qg cos ]qg sin ^fh cos ^fh sin ^q + fh cos ^q + fh
156
chapter 7
Solution notes to student activities
x
–1 O 1 2 3 4
–1
1 3 x x1
f = G= G = > H
0 1 y y1
x 1 3 - 1 x1 1 - 3 x1 x1 - 3y1
so = G = = G > H=> H> H = > H.
y 0 1 y1 0 1 y1 y1
Hence the function y = f(x) is transformed to y = f(x − 3y) under this
transformation.
The image of y = x 2 will be y = (x − 3y)2.
157
MathsWorks for Teachers
Matrices
1 0 2
b > H gives reflection in the x-axis, > H gives a translation of 2 units to
0 -1 -1
the right and 1 unit down.
x1 1 0 x 2
Then > H = > H= G + > H
y1 0 -1 y -1
x1 2 1 0 x
and > H - > H = > H= G
y1 -1 0 -1 y
x1 - 2 1 0 x
that is, > H=> H = G, and so
y1 + 1 0 -1 y
1 0 - 1 x1 - 2 1 0 x1 - 2 x1 - 2
H
x
= G=> H > H=> H> H=>
y 0 -1 y 1 + 1 0 -1 1 y + 1 - ^y1 + 1h
So x − y2 = 0 is transformed to (x − 2) − (y + 1)2 = 0.
158
chapter 7
Solution notes to student activities
a S1 = PS0 = = G= G = = G
0 0.25 0.5 0.125
1 0.75 0.5 0.875
0 0.25 2 0.5
S2 = P2 S0 = = G = G== G
0.21875
1 0.75 0.5 0.78125
0 0.25 5 0.5
S5 = P5 S0 = = G = G.= G
0.19971
1 0.75 0.5 0.80029
0 0.25 10 0.5
S10 = P10 S0 = = G = G.= G
0.20000
1 0.75 0.5 0.80000
0 0.25 50 0.5
S 50 = P 50 S0 = = G = G.= G
0.20000
1 0.75 0.5 0.80000
b P 3 = = G
0.445 0.444
0.555 0.556
Hence the probability of going from state 1 to state 2 in 3 transitions is
the element in (2, 1) position of P3, which is 0.555.
Similarly, the probability of going from state 2 to state 2 in 3 transitions
is the element in (2, 2) position of P3, which is 0.556.
b P = = G = P3 = P5 = f
0 1
1 0
P2 = = G = I = P 4 = P6 = f
1 0
0 1
P is not regular. The limit matrix L does not exist, as powers of P
oscillate between = G and = G. There is no steady-state vector; however,
0 1 1 0
1 0 0 1
if S = = G, then PS = S.
0.5
0.5
c It is not regular since any power of P will have a zero in (1, 2) position. (In
an
H.)The steady-state vector is S = = G and lim Pn = = G.
0 0 0
fact, Pn = >
0
n
1-a 1 1 n"3 1 1
159
MathsWorks for Teachers
Matrices
S0 0 0 W
T R V X
S1 1 1 W
lim Pn = S0 0 0 W
n"3 SS W
0 0 0W
T X R V
S1 W
Steady-state vector S = S 0 W
SS WW
0
J R V T X N
S 1 W
K 1 2 0 R V R V O
K S WS 1 W S 1 W O
K Check: PS = S0 1 1 WS 0 W = S 0 W = SO
K S 2 WS W S W O
K S0 0 0 W S 0 W S 0 W O
L T X T X
R T X VR V P R V
S1 0.75 0.5 0.5 0.25 0 WS0.1 W S0.5W
S0 0 0 0 0 0 WS0.2 W S 0 W
S WS W S W
S 0 0 0 0 0 0 WS0.2 W S 0 W
b i =
S0 0 0 0 0 0 WS0.2 W S 0 W
S WS W S W
S0 0 0 0 0 0 WS0.2 W S 0 W
S0 0.25 0.5 0.5 0.75 1 WS0.1 W S0.5W
T X T X TR XV
R V R V S 19 W
S 1 0 . 75 0 . 5 0 . 5 0 . 25 0 WS 0 W S 40 W
S0 0 0 0 0 0 WS 0.1 W S 0 W
S WS W S W
0 0 0 0 0 0 WS 0.3 W S 0 W
ii S =
S0 0 0 0 0 0 WS0.4 W S 0 W
S WS W S W
S0 0 0 0 0 0 WS0.2 W S 0 W
S0 0.25 0.5 0.5 0.75 1 WS 0 W S 21 W
T X T X S 40 W
T X
160
chapter 7
Solution notes to student activities
R VR V R V
S1 0.75 0.5 0.5 0.25 0 WS 0 W S0.5W
S0 0 0 0 0 0 WS0.25W S 0 W
S WS W S W
iii S0 0 0 0 0 0 WS0.35W S 0 W
=
S0 0 0 0 0 0 WS0.15W S 0 W
S WS W S W
S0 0 0 0 0 0 WS0.25W S 0 W
S0 0.25 0.5 0.5 0.75 1 WS 0 W S0.5W
T XT X T X
c
Current state
A B C
A 0.6 0.2 0.05
State next year B 0.1 0.6 0.15
C 0.3 0.2 0.8
R V
S0.6 0.2 0.05W
Then the transition matrix is P = S0.1 0.6 0.15W, and the initial state
R V SS W
S1W 0.3 0.2 0.8 W
S3W T X
S 1W
vector is S0 = S 3 W, and so the distribution after 3 years is
S1W
SS 3 WW
T X R V
R V3 S 13 W R V
S0.6 0.2 0.05W S 1 W S0.226 W
P 3 S0 = S0.1 0.6 0.15W S 3 W . S0.256 W
SS W S W
0.3 0.2 0.8 W SS 1 WW S0.518 W
T X S3W T X
T X
After 3 years approximately 22.6% of the population will live in
state A, 25.6% in state B and 51.8% in state C.
To find the long term population distribution, we can investigate powers
of P.
R V
S0.1961 0.1961 0.1961 W
P15 . S0.2549 0.2549 0.2549 W
SS W
0.5490 0.5490 0.5490 W
TR XV
S 0.1961 0.1961 0.1961 W
P20 . S0.2549 0.2549 0.2549 W
SS W
0.5490 0.5490 0.5490 W
T X
161
MathsWorks for Teachers
Matrices
162
R e f e r e n c e s a n d f u r t h e r
reading
163
MathsWorks for Teachers
Matrices
164
Notes
Notes
Notes
Notes
Notes
Notes
MathsWorks for Teachers
MathsWorks for Teachers Series editor David Leigh-Lancaster
Matrices
Pam Norton
MATRICES
Matrices are used in many areas of mathematics, and have also have
applications
applications
in diverse
in
diverse
areas such
areas
as such
engineering,
as engineering,
computer computer
graphics,graphics,
image processing,
image processing,
physicalphysical
sciences,
sciences, biological
biological sciences and
sciences
socialand
sciences.
social Powerful
sciences. Powerful
calculators calculators
and computers
and computers
can now
can now
carry outcarry
complicated
out complicated
and difficult
and numeric
difficult numeric
and algebraic
and algebraic
computations
computations
involving
involving
matrix methods,
matrix and
methods,
such technology
and such technology
is a vital tool
is ain
vital
related
tool real-life,
in relatedproblem-
real-life,
Pam Norton
problem-solving
solving applications.
applications.
This book provides mathematics teachers with an elementary introduction to
matrix algebra and its uses in formulating and solving practical problems, solving
systems of linear equations, representing combinations of affine (including linear)
transformations of the plane and modelling finite state Markov chains. The basic
theory in each of these areas is explained and illustrated using a broad range of
examples. A feature of the book is the complementary use of technology, particularly
computer algebra systems, to do the calculations involving matrices required for
the applications. A selection of student activities with solutions and text and web
references are included throughout the book.
Series overview
MathsWorks for Teachers has been developed to provide a coherent and contemporary
framework for conceptualising and implementing aspects of middle and senior
mathematics curricula.
Matrices
Titles in the series are:
Functional Equations
David Leigh-Lancaster
Contemporary Calculus
Michael Evans
Pam Norton
Functional Contemporary Matrices Foundation Numeracy Data Analysis Complex Numbers
Matrices Equations
David Leigh-Lancaster
Calculus
Michael Evans
Pam Norton in Context
David Tout and Gary Motteram
Applications
Kay Lipson
and Vectors
Les Evans
Pam Norton MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers MathsWorks for Teachers
Foundation Numeracy
in Context
David Tout & Gary Motteram
ISBN 978-0-86431-508-3
Data Analysis
Applications
Kay Lipson
Complex Numbers
and Vectors
9 780864 315083 MathsWorks for Teachers
Les Evans