Calculation in Laboratory
Calculation in Laboratory
IN
LABORATORY SCIENCE
UPDATED FOR 2023
ALLAN DEACON
CALCULATIONS IN LABORATORY
SCIENCE
Editors:
Roy Sherwood BSc, MSc, DPhil
Consultant Clinical Scientist, King’s College Hospital, London
A catalogue record for the book is available from the British Library
© Copyright 2009. Association for Clinical Biochemistry, 130-132 Tooley St, London
SE1 2TU
All rights reserved. No part of this publication may be reproduced, stored in a retrieval
system or transmitted in any form or by any means electronic, mechanical, photocopying,
recording or otherwise, without written permission from the publisher.
Preface
“What one fool can do, another can” (Ancient Simian Proverb)
I have previously tried to address this problem in two ways. Firstly, by holding
tutorials: initially for local trainees in the Clinical Biochemistry Department at
King’s College Hospital and in later years on regional and national training courses
organised by the Association for Clinical Biochemistry. Secondly, by publishing
worked answers to past FRCPath examination calculation questions in the
ACB News. However, trainees often express the need for a comprehensive textbook
which not only presents worked examples, but brings together the relevant
mathematics and basic science. This book is an attempt to meet that need.
This book was originally intended for trainees in clinical biochemistry, particularly
those preparing for the FRCPath examination. However, I hope it will also prove
useful to trainees in other clinical sciences and to undergraduate and postgraduate
students in any of the life sciences. Each chapter takes a topic and explains the
relevant physical chemistry and/or physiology. I have tried to derive the various
formulae and mathematical procedures from first principles whenever possible in
the belief that an understanding of their basis leads not only to their correct
application, but helps the reader develop these methods to new problems which
he/she may encounter in the future. Questions are included in the text, which I
hope the reader will attempt before looking at the worked answer. At the end of
each chapter there is a set of additional questions, the answers to which are
provided in the Appendix.
Allan Deacon
March 2009
Contents
Chapter Page
2. Laboratory manipulations 13
4. Spectrophotometry 51
5. Renal function 77
6. Osmolality 105
9. Enzymology 157
Index 492
Important notice
Although ACB Venture Publications has made every effort to ensure the accuracy
of the information contained in this book, the responsibility for the patient is
ultimately that of the medical practitioner ordering or performing/supervising the
investigations. All drugs and intravenous fluids must be prescribed by a registered
medical practitioner and administered by an individual authorised to do so. The
publishers, authors and editors do not assume any liability for any injury and/or
damage to persons or property arising from this publication.
UNITS AND THEIR MANIPULATION
Chapter 1
Failure to pay attention to units is probably the commonest cause of error when
performing even the simplest of calculations. In clinical biochemistry we often use
several units for the same property. For example concentration may be expressed as
mmol/L, mg/100 mL or g/L and volume may be expressed as L, mL or μL. Therefore
ability to manipulate units is an essential prerequisite for the successful completion of
most calculations.
Under this system, volume is defined as a cubic quantity of length, for example m3, cm3.
However, most laboratories (and scientific journals) have retained the old metric unit of
volume, the litre (L). 1L is equivalent to the SI volume of 1 dm3.
Where the molecular weight (MW) of the substance being measured is known, the SI unit
of quantity is the mole or multiple of a mole but the metric unit of volume, the litre, is
still used. For example, glucose concentration is expressed as mmol/L. One mole is the
formula weight of a substance measured in grams. Often the term molar is used instead of
mol/L and is abbreviated to M. A term in common use for the symbol “ / ” is “per” and
means the value obtained when one quantity is divided by another. Alternatively, the
divisor may be written to the power of “ -1”. For example 1 mol/L may also be written
1 mol L-1.
1
CHAPTER 1
Question: Q1(1)
Answer: Q1(1)
a) Since 180 g of glucose was dissolved in 1L of water the concentration is 180 g/L.
When the molecular weight is not known, or the analyte being measured is a mixture of
substances of differing molecular weights (such as plasma total protein) then mass units
may be used. e.g. plasma total protein is expressed as g/L.
When concentration is measured indirectly by some property which is not easily related
to a given mass of analyte (e.g. enzyme activity), or using a method calibrated against a
standard of unknown purity (e.g. some hormones) then arbitrary units are used. e.g.
creatine kinase activity is expressed as U/L. When an internationally accepted standard
preparation is used to calibrate the assay, results may be expressed as IU/L.
Units are often adapted by adding a prefix to simplify dealing with very small and very
large numbers. For example, volume may be expressed in litres, mL or μL and
concentration may be expressed as mmol/L or μmol/L, which may cause problems, for
example, when calculating glomerular filtration rate from creatinine concentrations
expressed as μmol/L in blood and mmol/L in urine, particularly when by convention the
result is required in mL/min. This practice has arisen in order to avoid numbers which
are clumsy. For example, 25 μL is far more convenient to handle than 0.000025 L. A list
of common prefixes is given in Figure 1.1.
2
UNITS AND THEIR MANIPULATION
centi- c 10-2 x cm
(1/100th of a metre)
micro- μ 10-6 x μL
(1/1,000,000 th of a litre)
nano- n 10-9 x nm
(1/1,000,000,000th of a metre)
pico- p 10-12 x pg
(1/1,000,000,000,000th
of a gram)
3
CHAPTER 1
The general term 10x means 10 multiplied by itself x number of times, so that the
expression y x 10x means y multiplied by 10, x times. Since our number system is based
on 10 all this means is moving the decimal place to the right x number of times. For
example 3.125 x 102 simply means that the decimal point is moved 2 places to the right
to become 312.5. The number 14 x 103 does not have a decimal point (or to be correct it
is implied that the decimal point is placed after the last digit, i.e. 14) so instead of moving
the decimal point 3 digits to the right 3 noughts are added to give 14000. Similarly,
for 6.25 x 104 the decimal point is moved two places to the right then two noughts added
to give 62500.
On the other hand, 10-x means 1 divided by 10x i.e. 1/10x (in a similar way that 1 mol L-1
means 1 mol/L). Therefore, instead of moving the decimal point one place to the right or
adding a nought for each increment in x, the opposite applies and the decimal point is
moved to the left; when the number of digits is exhausted, noughts are placed in front of
it. For example 6834 x 10-3 means 6.834, 24.52 x 10-2 means 0.2452 and 6.35 x 10-4
means 0.000635.
Significant figures
The way in which a numerical value is written makes a statement about the reliability or
accuracy of that value. For example, the concentration of an analyte determined by a
colourimetric assay written as 0.103562 mol/L implies a greater degree of accuracy (and
hence precision of its method of measurement) than if it were written 0.104 mol/L. This
result must have been calculated from an absorbance reading with only three (or possibly
four) digits, so it is misleading to quote the result to 6 significant figures. The reliability
of the measurement process permits the reporting of the result to only 3 significant
figures at the most.
As a general rule one should express a result with no more (or only a little more)
precision than the accuracy of the data from which it was calculated. When the result is
calculated from more than one piece of data then the accuracy of the least precise piece
of data should be used.
4
UNITS AND THEIR MANIPULATION
For example, consider a creatinine clearance calculated from plasma and urine creatinine
measurements and a 24 h urine volume of 1836 mL. Creatinine measurements are made
to only 3 figures and the reliability of the third must be doubtful. Although the volume is
known to 4 figures, the accuracy of timed urine collections is notoriously poor and the
volume is only likely to be accurate to 2 figures at the most.
The accuracy of the final result of a calculation can never be greater than that of the least
accurate measurement used in its calculation.
When the final result has been calculated and clearly has a greater number of significant
figures than is warranted, the unwanted digits can be removed either by rounding or
truncation. In rounding, the value of the retained digit is increased by 1 only if the
discarded digit(s) begin with 5, 6, 7, 8 or 9. In truncation the unwanted digits are simply
removed. For example the value 2.3478 can be rounded to 2.35 or truncated to 2.34.
Rounding is slightly more accurate than truncation and is preferred.
Use one more significant figure in the calculations than one expects to retain in the final
result.
The exception to this rule is calculations which involve small differences between nearly
equal numbers. For example, if a beaker weighs 20.4675 g empty but 20.5796 g after
addition of a sample then it would be wrong to subtract 20.5 from 20.6 since this would
give only one significant figure in the answer, whereas the data would clearly support
more.
5
CHAPTER 1
• For the operations of addition and subtraction, the dimensions and the units must
be the same and remain unchanged after the calculation.
• For the operations of multiplication and division, the dimensions are multiplied
and divided just as are the numbers, the result being the product or quotient of the
dimensions.
For example, to add together the two weights 0.952 g and 0.23 mg it is necessary to either
convert the first weight to mg then add it to the second, yielding a result in mg, or, to
convert the second weight to grams then add it to the first, in which case the answer will
be in g.
The calculation of molar absorptivity involves combining several units. The expression
for calculating molar absorptivity (ε) is:
ε = Absorbance
Concentration x path length
Where as absorbance does not have any units, concentration is in mol/L and the optical
path length is expressed in cm. Substituting these units into this expression gives:
ε = 1 = L
mol/L x cm mol x cm
Care needs to be taken when calculating ratios of concentrations. If the two analytes and
their units are identical, then the result does not have any units. For example, to calculate
the ratio of plasma to urinary calcium, when both are expressed in the same units, then
the units cancel:
6
UNITS AND THEIR MANIPULATION
If both concentrations are in mass units, then both concentrations can be converted to SI
units by multiplying the mass concentration by the atomic weight (40). Since both
operations are identical, the ratio will be the same and will not have any units:
If the concentrations are for different analytes then this is no longer true. The ratio of
their concentrations calculated from mass units will be different to that calculated from SI
units. For example, calculation of the calcium:creatinine ratio in a random urine in which
the concentration of calcium is 2.5 mmol/L and that of creatinine is 5.0 mmol/L gives:
Question Q1(2)
7
CHAPTER 1
Answer Q1(2)
The units of the two volumes are different (i.e. 45 mL and 2.65 L)
Before they can be added together one must be converted to the other so
that the units are the same. There are 1000 mL in a L, therefore if 45 mL
is divided by 1000 then it becomes 0.045L. (Alternatively 2.65 L could be
multiplied by 1000 to convert it to 2650 mL).
Both volume and concentration contain a volume term, but their units are
different i.e. mL and L. These need to be converted to the same units. If
the volume is converted from mL to L by dividing by 1000 then it
becomes 0.35 L So that:
To find out what units to use, carry out the calculation with units rather
than numbers:
Since the litres (the Ls) cancel, then the final units are g
so the amount of substance contained in 350 mL is 17.5 g.
8
UNITS AND THEIR MANIPULATION
This relationship still applies to units with different prefixes as long as the same prefix is
used on both sides of the equation. For example if the mass concentration is in mg/L then
in SI units the concentration will be in mmol/L and not mol/L; and if the mass
concentration is in mg/100 mL then the SI units will be mmol/100 mL. The above
relationship can be manipulated to carry out the reverse calculation, e.g. to convert
mmol/L to mg/L.
Chemists have, in the past, attempted to simplify calculations by introducing the concept
of equivalent weight. The equivalent weight of one substance reacts exactly with the
equivalent weight of another. For example in titrimetric analysis one mole of
hydrochloric acid neutralises one mole of sodium hydroxide, whereas one mole of
sulphuric acid (which yields two titratable hydrogens) will neutralize two moles of
sodium hydroxide. Therefore one mole of sulphuric acid is equivalent to two moles of
hydrochloric acid. These principles also apply to redox reactions involving metal ions.
Monovalent ions, such as sodium and potassium, are equivalent to one hydrogen ion so
that their equivalent weights are equal to their atomic weights. However, divalent ions ,
such as calcium and magnesium are equivalent to two hydrogen ions and their equivalent
weight is one half their atomic weights. In general:
In the case of univalent ions the units will be numerically the same e.g. 140 mmol/L of
sodium is the same as 140 mEq/L. For divalent ions 1 mol contains 2 Eq, e.g. 1 mmol/L
of magnesium is the same as 2 mEq/L. Results are no longer reported as mEq/L in the
UK but may be found in the literature. The term “normal,” often abbreviated as N, may
also be encountered which is the concentration in equivalents per litre e.g. 1 molar
(or M) saline (1 mol/L) is equivalent to 1 normal (or N) saline (1 Eq/L). This should not
be confused with “normal” or “physiological saline (9 g/L).
Question: Q1(3)
9
CHAPTER 1
Answer Q1(3)
a) mmol/L = mg/L
MW
b) g/L = mol/L x MW
And NaCl (g/100 mL) = g/L = 8.19 = 0.82 g/100 mL (2 sig figs)
10 10
10
UNITS AND THEIR MANIPULATION
It is common practice in the USA to express plasma urea concentration in terms of its
contribution to the nitrogen content of plasma, usually using mg% (i.e. mg/100 mL or
mg/dL) as units. The formula of urea is CO(NH2)2 so that each molecule contains
2 nitrogen atoms. Therefore one mole of urea contains one mole of molecular nitrogen
(N2) and the molecular weight of nitrogen (N2) is twice the atomic weight (14) i.e. 28.
Since:
and if BUN is to be expressed as mg/100 mL (mg%), then both sides are divided by 10:
The overall result is the same. The decision of whether to work in atoms or moles of
nitrogen often causes confusion. The two expressions for the inter-conversion of BUN
and urea concentrations are:
Question Q1 (4)
11
CHAPTER 1
Answer Q1 (4)
Further questions
1. Convert the following: a) 125 mg% to g/L; b) 0.25 mol/L to mmol/L; c) 0.236
nmol/L to μmol/L; d) 1.6 mg/L to ng/mL.
2. Convert the following concentrations to “SI” units: a) plasma glucose 120 mg%;
b) serum calcium 4.0 mEq/L; c) BUN 21 mg%; d) Serum creatinine 0.66 mg%.
6. If an acid dissociates in solution to give its conjugate base and hydrogen ions,
what are the units of its dissociation constant if urine contains 0.1 M of
undissociated acid, 25 x 10-5 mol/L of its conjugate base and 120 nmol/L of
hydrogen ions? NB the dissociation constant is the product of the concentrations
of conjugate base and hydrogen ions divided by the concentration of
undissociated acid.
12
LABORATORY MANIPULATIONS
Chapter 2
Laboratory manipulations
It is important that the units should be compatible. If the concentration is in g/L then the
volume should be expressed as litres and the calculated amount to be weighed out will be
in grams.
Question Q2(1)
13
CHAPTER 2
Answer Q2(1)
Since the weight is required in grams and the final volume in litres then the final
concentration is first converted from mg/dL to g/L.
Since there are 1000 mg in a gram, 150 mg is equivalent to 150 / 1000 = 0.15 g
Therefore weight required (g) = Concentration (1.5 g/L) x final vol (2 L) = 3.0 g
If the target concentration is given in SI units then the weight of the substance to be
weighed out must be calculated using both the molecular weight (MW) of the substance
and the final volume required. It is usually simplest to first convert the target
concentration from SI to mass units:
Again the prefix to the concentration terms (i.e. m, n or μ) must be the same for both the
mass and SI units. For example, if the SI concentration is in mmol/L then the mass
concentration will be in mg/L.
The atomic weights of sodium and chlorine are 23 and 35.5 respectively. Therefore the
molecular weight of sodium chloride (NaCl) is 23 + 35.5 = 58.5.
The amount of NaCl required to prepare 500 mL of solution will be half of this:
8190 / 2 = 4095 mg. Since most balances have scales in g rather than mg, division by
1000 (since there are 1000 mg in a g) gives the weight in g (4095/1000 = 4.095 g).
Sometimes the chemical required to prepare a solution may not be in exactly the same
form as that described in a method. For example, a method sheet for a manual glucose
14
LABORATORY MANIPULATIONS
Since one mole of glucose monohydrate (formula C6H12O6.H2O) contains one mole of
glucose, then the concentration of glucose monohydrate will also be 0.0278 mol/L.
Conversion of the glucose monohydrate concentration to mass units will give the weight
of glucose monohydrate to be weighed out.
Therefore 5.50 g of glucose monohydrate will need to be weighed out instead of 5.00 g of
glucose. Note that the same result can be obtained by multiplying the weight of glucose
by the ratio of the molecular weights of the hydrated to the anhydrous form:
Question 2(2)
15
CHAPTER 2
Answer Q2(2)
Na2 = 2 x 23 = 46
H = 1 = 1
P = 31 = 31
O4 = 4 x 16 = 64
142
For many chemicals used in the laboratory the percentage purity is significantly less than
100%. If a compound has a purity of x %, this means that each 100 g of the material
contains x g of the compound. It follows that each g will contain only x/100 g.
Therefore, to prepare a solution containing W g/L then the weight required is W x 100 /x
g/L. In general:
16
LABORATORY MANIPULATIONS
Again the weight units must be the same on both sides of the expression.
Some chemicals used to prepare reagents are themselves liquids. If the liquid is weighed
then the procedure is the same as for solids, providing allowance is made for any
departure from a purity of 100%. However, if the liquid is measured in volume then
allowance must be made for the fact that most liquid chemicals do not have a density of
one.
The units of density are weight/volume. If a liquid has a density of x g/mL, then this
means that each mL contains x g of the compound. In general:
Density = weight
volume
The term specific gravity (SG) is often used. This is the density expressed as a ratio to the
density of water (which is 1 g/mL). For most practical purposes SG and density can be
considered as being the same thing (although SG, being a ratio, does not have units). By
re-arranging the above equation, it is a simple matter to calculate the volume which
contains the target weight of a compound. Suppose we wished to prepare 1 L of an
ethanol standard solution, containing 800 mg/L of ethanol, from ethanol which has an
SG of 0.79. The units for density must be the same as for concentration, the density
(=SG) is in g/mL, the weight of ethanol per litre must also be in g (i.e. 800 mg / 1000 =
0.8 g /L).
Question Q2(3)
How many mL of hydrochloric acid (SG 1.16) are required to prepare 500 mL of 2.0
molar hydrochloric acid. The purity of the acid is 32 % w/w.
17
CHAPTER 2
Answer Q2(3)
Weight (g) of pure acid required to make 1 L 2.0 M HCl = 2.0 x 36.5 = 73 g
Since HCl has a purity of 32 %w/w, the weight of HCl (SG 1.16) required is
In the laboratory we often need to calculate the final concentration of a substance after a
given dilution, the volume of a stock solution which has to be diluted to give a target final
concentration or how much liquid to add to a set volume of a stock solution to give a
desired concentration. All of these problems are variations on a single theme and are
approached in the same way. First it is important to realise that the total amount of a
substance (whether expressed in mass or SI units) in a solution is the product of
concentration and volume (the volume in the concentration term must be in the same
units as the volume of solution).
For example:
2 L of 0.1 M sodium hydroxide (0.1 mol/L) will contain 2 x 0.1 = 0.2 mols
500 mL of 100 mM glucose (100 mmol/L) will contain 0.5 x 100 = 50 mmol
1.5 L of 20% sodium chloride (20% = 20 g/100 mL = 200 g/L) contains 1.5 x 200 =
300 g
18
LABORATORY MANIPULATIONS
If a finite amount of solution is diluted with solvent then the total amount of the solute in
the final solution will be the same. It is only the volume (which has become larger) and
the concentration (which has become lower) that have changed:
Since the amount of a solute is the product of concentration and volume, then the
following expression can be written:
All dilution problems can be solved using this equation. If any three of the terms are
known then this formula can be rearranged to obtain the remaining term. In the
laboratory there are four applications which are commonly used:
This equation can be rearranged if both sides are divided by the initial volume
(the initial volume terms on the left hand side cancel each other).
19
CHAPTER 2
In other words the result for the diluted sample is simply multiplied by the
dilution (2.0/0.1 = 20). If the dilution is prepared manually, it is often simpler to
add 0.1 mL of urine to 2.0 mL of water, giving a final volume of 2.1 mL. The
method of calculation is exactly the same, and the final dilution will be 21
(2.1/0.1) instead of 20.
20
LABORATORY MANIPULATIONS
The volume to be added can be calculated from the initial and final volumes:
Question Q2(4)
A working reagent for a phosphate assay is prepared by mixing 100 mL of stock reagent
with 900 mL of diluent. If only 360 mL of diluent is available, how much stock reagent
must be added to obtain the maximum volume of working reagent?
21
CHAPTER 2
Answer Q2(4)
In this situation, the initial concentration of the stock reagent can be regarded as 100%.
The final concentration in the diluted stock will be 10% (since 1 part of stock reagent is
mixed with 9 parts of diluent giving a working dilution of 1 in 10).
Taking the “volume stock” term outside of the brackets, re-arranging and solving:
22
LABORATORY MANIPULATIONS
For example, to set up a manual glucose method then a likely set of standards to cover the
range of clinical interest would be solutions containing 5, 10, 15, 20 and 25 mmol/L of
glucose. The first step is to prepare a stock glucose solution containing the highest
concentration of glucose required i.e. 25 mmol/L. The lowest concentration required is
5 mmol/L which is 1/5th the concentration of the stock standard. As the steps are equal
(5 mmol/L difference between each adjacent standard) then the second lowest
concentration would be 2/5th of the stock, the next 3/5th etc. If 1 mL of each standard is
required, then the lowest standard will contain 1/5 of 1 which is 0.2 mL of stock, the next
standard will require twice this (0.4 mL of stock) etc. To ensure that the total volume is
the same (1 mL) the volume of diluent (water) will decrease by 0.2 mL each time the
volume of stock increases by 0.2 mL:
The scale can be altered to either increase or decrease the volumes of standards prepared
– as long as the ratios of stock standard to diluent remain the same. Intermediate
concentrations can be introduced by extrapolation. For example to prepare a glucose
standard containing 7.5 mmol/L, 0.3 mL of stock would be mixed with 0.7 mL of diluent,
and to prepare a standard containing 2.5 mmol/L, 0.1 mL of stock would be mixed with
0.9 mL of diluent.
23
CHAPTER 2
For example, if a stock solution has a concentration of 200 mmol/L, then preparing serial
dilutions would give the following concentrations:
Tube number 1 2 3 4 5 6 7
Vol diluent (mL) 1.0 1.0 1.0 1.0 1.0 1.0 1.0
1 / final concentration (L/mmol) 0.01 0.02 0.04 0.08 0.16 0.32 0.64
The main drawback (apart from the concentrations not being evenly spaced) is that
awkward numbers are soon encountered. This need not be a problem if the data is
subsequently processed and plotted by computer. However, if the reciprocal of
concentration is to be plotted (e.g. to determine the Km of an enzyme) then the numbers
produced are much easier to handle. If the material is in short supply then preparing
doubling dilutions affords an economical and easy way to produce a wide range of
concentrations provided minimal volumes are used. Serological titrations always employ
doubling dilutions.
Further questions
4. A solution contains 5 % sucrose. How much of this solution would you dilute to
prepare 500 mL of 1 % sucrose?
24
LABORATORY MANIPULATIONS
6. Concentrated sulphuric acid (SG 1.84) is 96% by weight H2SO4. Calculate the
volume of concentrated acid required to prepare 1 L of 0.1M H2SO4.
8. If you have available 650 mL of 95 % ethanol, how much water would you add to
obtain the maximum volume of 65 % ethanol?
10. Solution A contains 12.0 g of anhydrous sodium dihydrogen phosphate per litre.
What is the phosphate concentration expressed as mmol/L? What volume of
solution A needs to be diluted to 1 L to give a phosphate concentration of
4 mmol/L.
25
CHAPTER 2
26
ACID-BASE, pH AND BUFFERS
Chapter 3
AH H+ + B-
Acid Base
The acid and base (which differ only by a proton) are said to form a conjugate pair;
every acid must have its conjugate base, and every base its conjugate acid e.g.
CH3COOH H+ + CH3COO-
acetic acid acetate ion
OH- + H+ H2O
hydroxide ion water
Strictly speaking, an alkali is any substance which can produce hydroxyl ions in
aqueous solution but the terms base and alkali are often interchanged.
Some species, such as the bicarbonate ion can act as both proton donors and acceptors
and therefore function as both an acid and a base:
HCO3- H+ + CO32-
bicarbonate ion carbonate ion
H+ + HCO3- H2CO3
carbonic acid
Water can dissociate to produce a proton and its conjugate base (hydroxyl ion):
H2O H+ + OH-
27
CHAPTER 3
When as acid is added to water this equilibrium is driven to the left and the
concentration of hydroxide ions decrease i.e. H+ >> OH-. When a base
(e.g. hydroxide) is added to water then this equilibrium is again driven to the left but
this time protons are removed i.e. OH- >> H+.
The fact that in aqueous solutions hydrogen ions are hydrated to form hydroxonium
ions (H3O+) and free hydrogen ions do not exist is usually ignored.
K = [H+][OH-] ………………………………………….Eq.3.1
[ H2O ]
where the brackets denote the concentrations of each species in mol/L. The
concentration of water can be regarded as constant (since its concentration is very
high and the molecule is only weakly ionized) and therefore the water concentration
term is incorporated into the ionization constant, which is then referred to as the ionic
product of water (Kw):
Since [H+] = [OH-] then the concentration of each of these ions must be 10-7 mol/L.
Therefore any solution in which [H+] >10-7 mol/L will be acidic with the degree of
acidity related to [H+]. Conversely, any solution with [H+] < 10-7 mol/L will be
alkaline.
The range of [H+] usually encountered by the chemist is very wide and quite often the
[H+] is very low e.g. the [H+] of blood is typically 0.000000040 mol/L. In order to
compress the scale and simplify the expression of low concentrations (and hopefully
make calculations simpler) Sorensen, in 1909, devised the logarithmic pH scale.
28
ACID-BASE, pH AND BUFFERS
Mathematicians have found that working with logarithms has advantages in some
situations. In clinical biochemistry we only need to use logarithms to two different
bases: 10 (known as common logarithms) and 2.718 (known as natural or Napierian
logarithms). Values for both types of logarithms are readily available on most pocket
calculators or can be obtained from tables of logarithms. For example the logarithm of
31.62 is 1.5, a result which is difficult to obtain by manual calculation. In this chapter
we will only use common logarithms (natural logarithms may be used in later
chapters).
4 2 2 4 = 22 = 2 x 2
8 2 3 8 = 23 = 2 x 2 x 2
16 2 4 16 = 24 = 2 x 2 x 2 x 2
9 3 2 9 = 32 = 3 x 3
16 4 2 16 = 42 = 4 x 4
25 5 2 25 = 52 = 5 x 5
125 5 3 125 = 53 = 5 x 5 x 5
The reverse of a logarithm is the antilogarithm which is the number which gave rise
to the logarithm in the first place. The practical importance of antilogarithms is that at
the end of a calculation we often end up with a result which is a logarithm; we then
need to determine the number which would give rise to this antilogarithm.
Fortunately antilogarithms are also available on most pocket calculators or can be
obtained from tables of antilogarithms (or by using tables of logarithms backwards).
The abbreviations “log” and “antilog” are respectively used for logarithm and
antilogarithm.
29
CHAPTER 3
A general notation relating a number (N) to its logarithm (x) to a base (b) can be used:
Log10N (x): -3 -2 -1 0 1 2 3
3
Log N (x)
0
0.0001 0.001 0.01 0.1 1 10 100 1000
-1
Number (N)
-2
-3
30
ACID-BASE, pH AND BUFFERS
Question: Q3(1)
31
CHAPTER 3
Answer to Q3(1)
a) Reverse the expression for pH so that the concentration term is on the left:
- log 10 [H+] = pH
N.B. 10-7 means “move the decimal point seven places to the left” whereas
109 means “move the decimal point nine places to the right”. The net result is
to move the decimal point 2 places (9-7 = 2) to the right so that 1.12 becomes
112.
Some calculators are cannot cope a large number of digits. Use can be made of the
following property of logarithms:
In Q3(1) to evaluate the log of 0.000000056 first write the number in exponential
form
evaluate each component separately then add to give the final result:
log10 (5.6 x 10-8) = log10 5.6 + log10 10-8 = 0.75 + (-8) = - 7.25
32
ACID-BASE, pH AND BUFFERS
pH 6.95 = 10-6.95
Since 1 mol = 1,000,000,000 nmol, then 1 mol/L can be written as 109 mol/L. If
concentration of hydrogen ions is expressed as nmol/L, then pH 6.95 can be written:
The antilog of 2.05 is 112 and so the hydrogen ion concentration is 112 nmol/L.
Similarly for part (b) where the concentration of hydrogen ions is halved to
56 nmol/L:
As
• Since the hydrogen ion concentration of pure water is 10-7 mol/L it follows that
neutral pH is 7.
• The small ‘p’ means ‘minus the logarithm to the base 10’ and should not be
confused with capital ‘P’ which denotes partial pressure e.g. Pco2.
33
CHAPTER 3
Weak acid: HA H+ + A-
Salt: MA M+ + A-
The high concentration of the common anion (A-) from the salt component suppresses
ionisation of the weak acid (HA). The pH of the buffer solution will therefore depend
on the relative amounts of weak acid and salt present. If a small amount of strong
acid (HX) is added then the concentration of hydrogen ion will increase. To maintain
constant Ka these hydrogen ions are “removed” by combination with the common
anion (A-) to form undissociated weak acid:
H+ + X- + A- HA + X-
Hence the effect of the addition of the strong acid has been buffered. Consequently
the strong acid anion (X-) replaces the common anion (A-) with only a small change in
the [A-]/[HA] ratio so that pH does not change significantly.
If some alkali (XOH) is added then the hydroxyl ions (OH-) react with, and therefore
lower, the hydrogen ion concentration (with the formation of water). In order to
maintain constant [H+][A-]/[HA] ratio more weak acid dissociates so that the H+
consumed by reaction with hydroxyl ion is replaced. This process can be considered
as the titration of hydroxyl ions with weak acid (HA) so that the common anion (A-)
replaces the added hydroxyl ions:
OH- + HA H2O + A-
34
ACID-BASE, pH AND BUFFERS
BUFFER CALCULATIONS
The expression for the dissociation constant of a weak acid (Eq.3.5) can be written
slightly differently:
Ka = [H+] x [A-]
[HA]
Note that the logarithms of two numbers multiplied together is the same as the sum of
their individual logarithms e.g. log (A x B) = log A + log B.
This equation can be rearranged by transferring log10Ka to the right hand side (which
then becomes negative) and log10 [H+] to the left hand side (which also becomes
negative):
-log10 [H+] is the same as pH and a term pKa can be defined as –log10 Ka, so that this
equation becomes:
This is known as the Henderson Hasselbalch equation and can be used to calculate
the amounts of salt and acid which must be used to prepare a buffer solution of a
desired pH. Important quantitative properties of buffers are summarised in fig 3.4.
Question: Q3(2)
Calculate the amount in grams of lactic acid which must be added to 3 g of sodium
hydroxide to give 1 litre of a solution with a pH of 4.5 (pKa of lactic acid is 3.86;
atomic weight of sodium is 23).
35
CHAPTER 3
Answer to Q3(2)
The following reaction occurs between lactic acid (LactH) and sodium hydroxide:
Both sodium lactate and lactic acid can dissociate to give lactate ions (Lact-):
LactH Lact- + H+
The relationship between the concentrations of lactate (a salt) and lactic acid (an acid)
is governed by the Henderson Hasselbalch equation:
In order to make use of this equation it is necessary to assume that the concentration
of Lact- is equal to that of sodium, i.e.
2. That the proportion of Lact- derived from lactic acid is negligible compared to
that derived from sodium lactate. This proportion will be the same as the
hydrogen ion concentration which can be calculated from the pH and is
0.0001 mol/L – clearly insignificant.
Next calculate the molar concentration of sodium hydroxide and use it in place of
[Lact-]:
Reaction of sodium hydroxide with lactic acid (LactH) yields approximately an equal
amount of lactate (Lact-) ions:
36
ACID-BASE, pH AND BUFFERS
Substitute Lact- = 0.075, pH = 4.5 and pKa = 3.86 into the Henderson
Hassebalch equation and solve for [LactH]:
4.365 = 0.075
[LactH]
The total lactate needed is the sum of the ionised and unionised acid:
Since all of this lactate must originate from the lactic acid added to the sodium
hydroxide, calculate the weight of lactic acid required to give a concentration of
0.0922 mol/L:
Question: Q3(3)
37
CHAPTER 3
Buffers are a mixture of a weak acid and its salt, the pH of which is defined by
the Henderson Hasselbalch equation:
Buffering capacity (i.e. the ability to resist a change in pH) is maximal when
pH = pKa. Outside the pH range pKa - 1 to pKa + 1 buffering action is
minimal since the further the [salt]/[acid] ratio is from 1 the greater the resulting
change in the logarithm of this ratio and hence pH when the same amount of
acid or alkali is added.
• Outside 2 pH units either side of the pKa the solution can be considered, for
practical purposes, to consist entirely of acid or salt.
• For a weak base (B) the pKa describes the dissociation of its conjugate acid:
BH+ B + H+
38
ACID-BASE, pH AND BUFFERS
Answer to Q3(3)
H2PO4- HPO42- + H+
dihydrogen hydrogen
phosphate ion phosphate ion
Phosphoric acid is a tribasic acid (with 3 pKa values) but at near neutral pH the
contributions from the other species (H3PO4 and PO43-) are negligible.
Substituting [salt] = [HPO42-], [acid] = [H2PO4-], pKa = 6.82 and pH = 7.0 into the
Henderson Hasselbalch equation gives:
Rearranging gives:
Taking antilogs:
This equation now contains two unknowns and so at first sight appears impossible to
solve. However, a further piece of information is given, namely, that the total
phosphate concentration must be 0.1 mol/L. There are only two forms of phosphate
to consider, therefore:
Rearranging to obtain an expression for one phosphate species in terms of the other (it
doesn’t matter which one) gives
which can be substituted into equation Eq. 3.7 to give an expression containing one
variable which can then be solved:
39
CHAPTER 3
The other unknown, [HPO42-], can be obtained by substituting [H2PO4-] = 0.040 into
equation Eq.38:
MW = 23 + (2 x 1) + 31 + (4 x 16) = 120
H2CO3 H+ + HCO3-
40
ACID-BASE, pH AND BUFFERS
These two reactions can be linked together (each with their own equilibrium
constants, K1 and K2):
The bicarbonate buffer system is central to acid-base homeostasis for two reasons:
• It is an open system. The two components feeding into the system (carbon
dioxide and bicarbonate ions) can be generated by the lungs and kidney
respectively to ensure a constant CO2/HCO3- ratio, and hence pH.
The equilibrium constants for these two reactions involving carbonic acid are given
by:
K1 = [H2CO3] and K2 = [H+] [HCO3-]
[H2O] [CO2] [H2CO3]
In practice it is not possible to measure the very low concentrations of carbonic acid
present in blood directly. Instead carbon dioxide is measured and so the above two
equations are combined so that the carbonic acid term is eliminated and the buffer
system can be described in terms of carbon dioxide and bicarbonate ion terms only:
Rearranging the first term gives an expression for carbonic acid concentration:
This expression can then be substituted for [H2CO3] in the expression for K2:
K2 = [H+] [HCO3-]
K1 [H2O] [CO2]
The concentration of water can be considered constant and combined with K1 and K2
to give a new constant, K1'
41
CHAPTER 3
In routine practice the CO2 content of blood is expressed as its partial pressure, Pco2.
Pco2 is the partial pressure of a gaseous phase which is in equilibrium with the
sample. This practice has arisen because calibrants are prepared by equilibrating
blood with gaseous mixtures with known CO2 content. Henry’s Law states that the
amount of gas physically dissolved in a solution is proportional to the partial
pressure of that gas. The constant of proportionality is the Bunsen solubility
coefficient, α:
[CO2] = α Pco2
The constant K1' can be combined with the solubility coefficient, α, to give a new
constant. If [H+] is expressed in nmol/L, [HCO3-] in mmol/L and Pco2 in kPa then
this constant is approximately 180:
Alternatively, if equation 3.10 is inverted and logarithms of both sides taken, then it
becomes:
By definition log10 1/ [H+] is the pH and log10 1/ K1' is the pK1' and if these values are
substituted into the above equation then the result is the familiar Henderson
Hasselbalch equation. If 6.1 is substituted for pK1' , and Pco2 is expressed in kPa,
then it becomes:
42
ACID-BASE, pH AND BUFFERS
Both equations Eq.3.11 and Eq.3.12 contain three variables. Change in one variable
must be accompanied by a change in at least one other. This is of importance for two
reasons:
In recent years there has been a move towards expressing the acid-base status of blood
in terms of concentration using nanomolar units. The normal pH (7.4) then becomes
40 nmol/L (antilog10 (-7.4) = 4.0 x 10-8 mol/L = 40 nmol/L) – quite an easy number
to manage. The advantages of using hydrogen ion concentration instead of pH are:
• The changes are linear. If pH is used then [H+] has to increase 10-fold before
the pH increases by a value of 1
• Unlike pH, [H+] is linearly related to both Pco2 and [HCO3-]. This makes it
easier to calculate an expected compensatory change and helps interpretation
of patients’ blood gas results.
There is little doubt that using concentration is simpler than using pH (otherwise the
three variables would have three very different types of units). Adoption of [CO2]
instead of Pco2 would further simplify matters since all components would then be
expressed as concentrations!
Question Q3(4)
The SHO in ITU carried out a blood gas analysis but failed to record all of the results
in the patient’s notes. The only available results are:
H+ concentration = 93 nmol/L
Actual bicarbonate = 21 mmol/L
Calculate the pH, Pco2 (in kPa) and carbon dioxide concentration (in mmol/L).
Assume that the solubility coefficient of CO2 (in kPa) is 0.225.
43
CHAPTER 3
Answer Q3(4)
Since there are 1,000,000,000 (i.e. 109) nmol in a mol, then 93 nmol/L can be written
as 93 x 10-9 mol/L.
0.93 = log 10 21
0.225 Pco2
antilog10 0.93 = 21
0.225 Pco2
44
ACID-BASE, pH AND BUFFERS
URINARY BUFFERS
The maximum hydrogen ion gradient from tubular lumen to blood which can be
generated by the kidney tubule is approximately 600:1. Since the hydrogen ion
concentration of normal blood is about 40 nmol/L (pH = 7.4) this means that the
maximum hydrogen ion concentration in urine will be 40 x 600 = 24000 nmol/L,
which corresponds to a pH of 4.62.
The human body normally produces approximately 70 mmol of hydrogen ions per day
of fixed acid (i.e. from sources other than carbon dioxide) which is excreted almost
exclusively in the urine. The vast majority of these hydrogen ions are buffered by
base in the urine, principally phosphate and ammonia:
The high pKa of ammonia means that at physiological pH and below the majority
exists as ammonium ions. For this reason it has been debated whether or not
excretion of urinary ammonium ions truly reflects excretion of acid. However, unlike
phosphate, where its availability in the tubular fluid is relatively independent of
hydrogen ion status, ammoniogenesis increases during acidosis and is therefore
closely linked to acid-base status.
A quantity called the titratable acidity can be obtained by titrating urine with alkali
back to the pH of blood (7.4). It therefore approximates to:
Whereas the total acid excretion includes hydrogen ions buffered with ammonia:
Question: Q3(5)
45
CHAPTER 3
Answer to Q3(5)
pH = - log10 [H+]
= - (- 3.16 x 10-6)
Since the urine volume is 1 L, then in 24 h approx 0.003 mmol of acid is excreted
as free hydrogen ions (H+).
The concentration of H2PO4- both before and after buffering the secreted
hydrogen ions can be calculated using the Henderson Hasselbalch equation since
the total phosphate concentration (50 mmol/L) is known.
At pH 7.40:
[HPO42-] = 50 - [H2PO4-]
46
ACID-BASE, pH AND BUFFERS
3.80 = 50 - [H2PO4-]
[H2PO4-]
4.80 [H2PO4-] = 50
At pH 5.5:
= 22.9 + 10
47
CHAPTER 3
FURTHER QUESTIONS
1. What is the pH of 0.5 per cent (w/v) hydrochloric acid (assume complete
dissociation, atomic weight Cl = 35.5)?
2. The reference range for blood pH is often quoted as 7.35-7.45. Express this
range in terms of nannomoles of hydrogen ion per litre.
3. If the pH of urine is 6.0 and of blood 7.40, what is the gradient of hydrogen
ion concentrations across the tubular cell walls?
8. A buffer solution (pH 4.74) contains acetic acid (0.1 mol/L) and sodium
acetate (0.1 mol/L) i.e. it is a 0.2M acetate buffer. Calculate the pH after
addition of 4 mL of 0.025 M hydrochloric acid to 10 mL of the buffer.
48
ACID-BASE, pH AND BUFFERS
49
CHAPTER 3
50
SPECTROPHOTOMETRY
Chapter 4
Spectrophotometry
Basic principles
A photometer is a device for measuring the amount of light transmitted through (or
absorbed by) a solution. The wavelength of light absorbed will depend on the chemical
structure of the compound present in the solution. A photometer therefore consists of a
light source which generates a beam of light which passes through a cell (cuvette)
containing the solution under analysis; the light which is transmitted through the solution
falls on photodetector and generates an electric current proportional to the intensity of the
light, which is then translated into a reading. Maximum sensitivity and specificity is
achieved if the beam of light reaching the sample cell is parallel and of a constant
wavelength (i.e. is monochromatic) and is of the wavelength which gives maximum
absorption (minimum transmission) of the light. To achieve this, instruments isolate a
portion of spectrum of white light generated from the light source (usually a bulb) by
placing a monochromator in the light path before reaching the sample cell. In simple
filter photometers a glass filter is used, whereas spectrophotometers use a prism or
diffraction grating. Further details can be found in the standard textbooks of analytical
chemistry.
Consider an incident light beam with intensity Io passing through a square cell containing
a solution of a compound which absorbs light at the wavelength being used. The
intensity of the light reaching the detector, I, will be less than Io. However it is the
fraction of light absorbed (or transmitted) which is related to the concentration of the
compound of interest. The fraction of incident light reaching the detector, I/Io, is known
as the transmittance (T). If expressed as a percentage then the term percentage
transmittance (%T) is used:
51
CHAPTER 4
definition of these terms). Other factors affect the value of Io when a reading of the
sample is made, including a small amount of incident light reflected by the surface of the
cell, absorbed by the material of the cell and by the solvent and/or components of the
reagent. Therefore instead of setting the instrument to 100 %T or zero absorbance with
the cell compartment empty (i.e. against air) it is customary to use a “blank” consisting of
a cell containing either solvent alone or reagent without the analytical sample added. In a
single beam spectrophotometer the blank is inserted into the cell compartment and the
instrument blank value set, then the sample inserted etc. This has the disadvantage that
error will be introduced if the instrument “drifts”. This difficulty is overcome in double
beam spectrophotometers in which the light source is split into two equal beams, one
passing through the blank or reference and the other through the sample position,
enabling the blank to be monitored continuously. Note that T and %T are ratios and so do
not have units.
Light is absorbed only when a photon collides with a molecule. It is not surprising
therefore that the chance of a photon of light colliding with a molecule in solution, and
hence the amount of light absorbed, will depend on the concentration of the compound in
solution and the path length or thickness of the cell. This simple concept gives rise to the
two absorption laws:
Bouger’s Law or Lambert’s Law: The fraction of light absorbed is proportional to the
thickness of the absorber.
If the concentration is increased by 1 g/L to 2 g/L (i.e. doubled) then a half of the light
which would have been transmitted by the solution containing 1 g/L, will be absorbed
with a result that only a quarter of Io will reach the detector Therefore of the incident light
intensity I0, one half is transmitted after passing through a cell containing 1 g/L of the
compound and a quarter after passing through a solution containing 2 g/L. If the
concentration is increased by a further 1 g/L to 3 g/L then only an eighth is transmitted
52
SPECTROPHOTOMETRY
Concentration (g/L) 0 1 2 3 4
and if it is increased to 4 g/L, one sixteenth is transmitted. Therefore, of the incident light
1/2 is absorbed by 1 g/L, 3/4 by 2 g/L, 7/8 by 3 g/L and 15/16 by 4 g/L. It is clear that
the relationship between concentration and transmittance is non-linear (Fig 4.1) and is in
fact a geometric progression in which subsequent increase in concentration by 1 g/L
decreases the transmitted light by a factor of 2. Taking logs of % transmittance (it
doesn’t matter to which base, but 10 is usually used) converts to a linear relationship with
53
CHAPTER 4
Note that absorbance is a logarithmic function and so does not have units. Absorbance is
some times called “optical density” and abbreviated “OD”.
A = log10 100
%T
Since log10 100 can also be written as : log10 100 - log10 T and since log10 100 = 2
T
it follows that: A = 2 - log10 %T ……… Eq.4.3
This is a very convenient way to inter-convert A and %T. For example, when the
concentration of the absorbing species is zero the %T is 100, the logarithm of 100 is 2 so
that the absorbance must be 2 – 2 which is zero. When %T is 50, the logarithm of 50 is
1.699 so that the absorbance is 2 – 1.699 which is equal to 0.301. A plot of absorbance
versus concentration is linear and passes through the origin (Fig 4.1). Similar reasoning
shows that absorbance is also linearly related to cell path length. Therefore both Beer’s
and Lambert’s Laws can be redefined as follows:
Question Q4(1)
54
SPECTROPHOTOMETRY
Answer Q4(1)
According to Lambert’s law, if the cell path length is halved by making the reading in a
0.5 cm cell, then the absorbance will be halved:
A = 2 - log10 %T
0.078 = 2 - log10 %T
If the initial concentration is doubled, then, according to Beer’s law, the absorbance will
also double:
A = 2 x 0.155 = 0.310
Convert to %T as above:
0.310 = 2 - log10 %T
55
CHAPTER 4
Where a is a proportionality constant known as the “absorptivity”. The units for a will
be the reciprocal of the units of b and c, and can be evaluated by substituting the units for
A, b and c into equation Eq 4.4 then rearranging it. For example, if b is in cm and c is in
mol/L (A of course, has no units), then the units of a will be L.mol-1,cm-1:
A = a x (mol/L) x (cm)
a = A = L = L/mol/cm OR L.mol-1.cm-1
(mol/L) x (cm) mol x cm
When concentration is in molar units, then a is termed the “molar absorptivity,” and the
symbol ε is used. Terms and units used in spectrophotometry are defined in Fig 4.2.
Question Q4(2)
56
SPECTROPHOTOMETRY
Answer Q4(2)
A = a bc
a = molar absorptivity = ?
c = molar concentration =
= concentration (mg/L)
1000 x Molecular weight
a = 0.268 (units: L x 1)
0.5 x 0.000 00685 mol cm
Note that the answer is rounded to 3 significant figures because the absorbance
measurement is only given to 3 decimal places.
57
CHAPTER 4
Various approaches can be used, but all are based on the simple relationship between
absorbance and concentration (Eq. 4.4):
A = abc
A = ac
If the absorptivity of the absorbing species (that is, the analyte or a chromogen formed by
the reaction of the analyte with a reagent) is known, then the concentration of the
unknown can be calculated from its absorbance reading. It is vital to allow for any
dilution involved and differences in concentration units.
Question 4(3):
0.1 mL of serum is mixed with 3.0 mL of a reagent which forms a coloured product with
glucose. After the reaction has reached equilibrium the absorbance (versus a reagent
blank) in a 1 cm cuvette was found to be 0.250. If the absorpivity of the chromogen is
933 L.mol-1cm-1 what is the serum glucose concentration expressed as mmol/L?
58
SPECTROPHOTOMETRY
Answer Q4(3):
c = A
a
c = 0.250 mol/L
933
Note that as the units of absorptivity of the chromogen are L.mol-1cm-1, the calculated
concentration of glucose is in mol/L NOT mmol/L. If the above result is multiplied by
1000 (since there are 1000 mmol in a mol) then the result will be in mmol/L. Since the
cuvette path length is also 1 cm then no correction need be made for differences in path
length.
The absorptivity relates to the concentration of glucose derived chromogen in the solution
of which the absorbance is being measured. However, 0.1 mL of serum was mixed with
3.0 mL reagent (i.e. diluted to 3.1 mL) before the absorbance reading was made.
Therefore to convert the calculated concentration of chromogen (c) to glucose
concentration in the undiluted sample, it is multiplied by 3.1 and divided by 0.1:
Note that the absorbance reading is multiplied by the reciprocal of absorptivity. Therefore
if a large number of analyses are to be carried out then it is relatively easy to program this
factor into the memory of a pocket calculator so that concentrations may be read off
directly upon entering absorbance readings. Alternatively, using the spreadsheet facility
of a PC, a table can be generated with absorbance values and their corresponding
concentrations. It is important to allow for any differences in units and any dilution of
the biological sample. If question 4(3) is taken as an example:
59
CHAPTER 4
c = A x 33.2 mmol/L
so that multiplication of the absorbance reading by 33.2 gives the serum glucose
concentration in mmol/L.
a = A1 and a = A2
c1 c2
Question Q4(4):
A method for the measurement of serum glucose involves adding 0.1 mL of sample
(serum, water or standard) to 3 mL of reagent, then after 10 min incubation at room
temperature, measuring the absorbance at 500 nm in a cuvette with a 1 cm path length
using an identical cuvette containing distilled water as reference. The readings using
serum, standard or water as sample were 0.302, 0.353 and 0.052 respectively. If the
concentration of glucose in the standard is 10 mmol/L, calculate the glucose
concentration in the serum.
60
SPECTROPHOTOMETRY
Answer Q4(4):
Since all the absorbances are measured against a cuvette containing water as reference,
the first step is to subtract the absorbance of the blank (in which 0.1 mL of distilled water
is used as the sample) so that in the absence of analyte the absorbance reading will be
zero:
Failure to correct each reading for the absorbance reading of the blank (0.052) would
result in an erroneous answer of 8.6 mmol/L (0.302 x 10 / 0.353).
Since the dilutions of the sample (serum or standard) with reagent are identical, the
dilutions cancel and do not need to be included in the calculation. However, any dilution
of the sample which is different to that of the standard must be taken into account.
If Beer’s Law is obeyed and cuvettes with a constant path length are used then the
equation relating absorbance to concentration (Eq. 4.4) is a linear function:
A = ac
In other words if, for a series of solutions, absorbance (A) is plotted on the vertical axis
and concentration (c) on the horizontal axis then a straight line is obtained which passes
61
CHAPTER 4
through the origin and has a slope (i.e. gradient) of a (Fig 4.3). Such a plot is called a
calibration curve. There are various ways in which a calibration curve can be used to
calculate the concentrations of analyte in an unknown sample:
c A A-A0 A4- A0
c0 A0 0
A3- A0
c1 A1 A1-A0
Absorbance (A)
A2- A0
c2 A2 A2-A0
A1 - A0
c3 A3 A3-A0
c4 A4 A4-A0 A0
c3 c4
-0.5 0.5 1.5
c2
2.5 3.5
0 c1
Question Q4(5)
A manual method for the determination of plasma glucose involves mixing 0.1 mL of
sample (water, glucose standard solution or plasma sample for the reagent blank, standard
and unknown sample respectively) with 3 mL of reagent, then after 10 min incubation at
room temperature, measuring the absorbance of the chromogen at 500 nm. The following
results were obtained:
Sample Absorbance
Before the absorbance measurements were made, the instrument was set to zero using an
identical 1 cm cuvette containing distilled water. Determine the concentrations of glucose
in plasma samples A and B.
62
SPECTROPHOTOMETRY
Answer Q4(5):
There are several approaches which can be used. The simplest is to plot the data with
absorbance (A) on the vertical axis glucose concentration (c) on the horizontal axis:
0.5
0.4
A (absorbance vs water)
Plasma B (diluted 1 in 4)
0.3
Plasma A
0.2
0.1
0
0 1 2 3 4 5 6 7 8 9 10 11 12
c (glucose concentration, mmol/L)
A straight line is then drawn through the points in such a way as to produce the best
visual fit to the data. It is then an easy matter to read off the values for the plasma
samples from this curve.
For plasma A (absorbance = 0.220) a horizontal line is drawn from the absorbance value
on the vertical scale until it intersects the calibration curve. From this point a vertical line
is drawn downwards until it intersects the horizontal axis. The reading at this point gives
the glucose concentration in the sample as 4.5 mmol/L.
63
CHAPTER 4
This procedure cannot be followed for the neat (undiluted) absorbance value for plasma B
(0.80) since this value is beyond the range of the calibration curve i.e. the absorbance
value of the highest standard is only 0.445. Using a ruler the calibration curve could be
extrapolated to this value but this would assume that the calibration is linear to this point.
This is unlikely to be true if either the linear range of the instrument is exceeded or the
capacity of the reagent is exhausted. The absorbance for plasma B which had been
previously diluted 1 in 4 is well within the working range of the assay (0.310) and
reading the plasma glucose concentration from the calibration curve using an identical
procedure as for plasma A gives a value of 7.5 mmol/L. Since the 0.1 mL of plasma B
sample had been diluted 1 in 4 prior to assay, this result is multiplied by 4 to give a final
glucose result of 30 mmol/L.
Another approach is to determine the equation which describes the line through the
standards (the calibration curve) then use it to arithmetically calculate the concentration
of glucose in the plasma sample. The first step is to draw a right angled triangle (ABC)
for which the calibration curve is the hypotenuse:
0.5 B
0.4
A (absorbance vs water)
0.3
0.360
0.2
0.1
A
0.085 C
0
0 1 2 3 4 5 6 7 8 9 10 11 12
c (glucose concentration, mmol/L)
The linear equation describing the calibration curve takes the form:
A = Intercept + ( c x slope)
64
SPECTROPHOTOMETRY
The “intercept” is the point at which the calibration curve crosses the vertical axis i.e. the
value of the absorbance (A) when glucose concentration (c) equals zero, and is 0.085.
The slope is the change in absorbance which is observed for a 1mmol/L increment in
glucose concentration i.e. BC divided by AC:
A = 0.085 + 0.03 c
c = (A - 0.085)
0.03
By substituting absorbances obtained for the plasma samples, their glucose concentration
is easily calculated:
For plasma B the absorbance obtained from the diluted sample should again be used:
Some modern spectrophotometers can be calibrated directly as the blank and standard
absorbances are read so that as unknown samples are read the result is displayed directly
in concentration units rather than absorbance values. Again it is important to emphasize
that this equation should only be used if the calibration curve is linear and for the
concentration range covered by the standards. The glucose concentration can be
calculated for the undiluted plasma B sample (A = 0.80) from the above equation:
Clearly this value is erroneously low and illustrates the danger of extrapolating the
standard curve beyond the highest standard used in its construction.
65
CHAPTER 4
Sometimes the scatter of the points of the calibration curve make it difficult to decide
where to draw the line of best fit. Under these circumstances the statistical line of best fit
can be calculated using statistical methods. Modern instruments often use very complex
mathematical techniques to calculate the best equation which describes the calibration
curve.
This has the effect of forcing the calibration curve through the origin:
0.4
A (absorbance corrected for blank)
0.3
0.2
0.1 Plasma A
0
0 1 2 3 4 5 6 7 8 9 10 11 12
c (glucose, mmol/L)
66
SPECTROPHOTOMETRY
Note that the slope is the same (0.03) although the intercept on the absorbance axis
(previously 0.085) is now zero. Thus the equation describing the calibration curve is now
simpler:
and used to calculate the glucose concentration for plasma samples. It is vital that the
reagent blank is also subtracted from the absorbance readings for the plasma samples
before their concentrations are either calculated from the above formula or read directly
from the calibration curve.
An alternative approach when the absorbance of a plasma sample is beyond the working
range of the calibration curve is to dilute the final reaction mixture, either with distilled
water or reagent until the absorbance falls within range. This practice overcomes non-
linearity due to the spectrophotometer but not if non-linearity is due to exhaustion of the
reagent or one of its components. If the reaction mixture was diluted with more reagent
then the correction to be applied is the same as if the assay had been repeated with diluted
plasma sample. However, if the reaction mixture has been diluted with water then the
reagent blank is no longer appropriate since the contribution of the reagent to the final
absorbance reading has been reduced. The reagent blank to be subtracted should first be
divided by the dilution factor used.
If the relationship between absorbance and concentration is linear (i.e. Beer’s Law is
obeyed) then the minimum number of data points required to determine the linear
equation relating the two quantities is 2. The term single point calibration is often used
which is, strictly speaking, incorrect. A second point is always required to define a
straight line (the shortest distance between two points in the same plane). It is always
inferred that this second point is the value when concentration is zero i.e. the blank.
There are other approaches to using a set of standard absorbances. In general if there are
n standards with their corresponding absorbances then the ratios of their absorbances to
their concentrations are equal:
A1 = A2 = A3 = A4 = …… An
c1 c2 c3 c4 cn
The only time this relationship does not hold is when the concentration (c0) is zero, since
any number divided by zero becomes infinite. A further requirement is that all
measurements should be made using an appropriate blank as reference.
67
CHAPTER 4
The individual ratios of A/c can be averaged to produce a mean value for A/c which can
then be used to calculate the concentrations in unknown samples:
Although this procedure is simple and takes account of the imprecision of absorbance
measurements of the standards it is not recommended. The absorbances of the individual
standards provide a further piece of information: confirmation that the relationship
between absorbance and concentration is in fact linear. Therefore, a calibration curve
should always be plotted or accessed mathematically.
Beer’s law states that absorbance (A) is equal to absorptivity (a) multiplied by
concentration (c):
A = ac
AA = aA x cA
AB = aB x cB
Provided there is no interaction between A and B, their absorbances are additive, so that
the total absorbance (Atotal) is the sum of their individual absorbances:
Atotal = AA + AB
68
SPECTROPHOTOMETRY
A+B
A
A B
λA λB
Wavelength (λ)
At first sight it may appear impossible to use the measured absorbances to calculate the
individual concentrations of A and B in a mixture of both, even if the individual
absorptivites of A and B are known i.e. the equation contains two unknowns. However,
if the absorbance is also measured at a second wavelength (usually the λmax of the other
species) then another equation for the measured absorbance can be set up similar to
Eq. 4.7.
69
CHAPTER 4
Then two equations can be written for the measured absorption, one for each wavelength:
A1 = aAλ1 cA + aBλ1 cB
……….. Eq.4.8
A2 = aAλ2 cA + aBλ2 cB
These form a set of 2 simultaneous equations (each containing the same two unknowns)
which can then be solved in the usual way (see Fig 4.5).
The same principle can be applied to mixtures containing 3, 4 or even more components.
Absorbance measurements must be made at the same number of wavelengths as the
components in the mixture. However, the simultaneous equations become increasing
complex and difficult to solve.
Question Q4(6)
260 nm 280 nm
Drug A 100 500
Drug B 1000 200
Fractions from peaks A and B were pooled and the absorbance of the mixture measured
in a cuvette with a 1 cm light path using solvent as reference. The absobance reading at
260 nm was 0.4 and at 280 nm 0.8. Calculate the individual concentrations of A and B in
the pooled fractions.
70
SPECTROPHOTOMETRY
Simultaneous equations are a set of two or more equations in two or more unknowns that
are simultaneously true. For example, consider two equations in which x and y are
unknown but in which the values of x and y are identical:
2x + 3y = 16 ……………… (i)
3x + 2y = 14 …………..….. (ii)
Either or both equations are multiplied by factors in order to render one of the terms
equal. Subtraction of one equation from the other then eliminates one of the variables.
The resulting equation is then solved for the other variable. Simple inspection often
suggests a suitable factor to use but an approach which always works is to multiply the
first equation by the constant in front of one of the variables in the second equation and
the second equation by the constant in front of the same variable in the first equation. In
the above example if the whole of equation (i) is multiplied by 3 (the constant in front of
x in equation (ii)) which then becomes equation (iii), and the whole of equation (ii) is
multiplied by 2 (the constant in front of x in equation (i)), which then becomes equation
(iv) then both equations contain 6x. Subtraction of equation (iv) from (iii) eliminates
variable x, and the result (equation (v)) can be solved for the other variable (y):
6x + 9y = 48 ……………… (iii)
6x + 4y = 28 ………………. (iv)
5y = 20 ………………. (v)
y = 20 = 4
5
This value for y can then be substituted into either equation (i) or (ii) which is then solved
for the other variable (x). Using equation (i):
2x + (3 x 4) = 16
2x = 16 - (3 x 4x) = 16 - 12 = 4
x = 4/2 = 2
71
CHAPTER 4
Answer Q4(6):
At each wavelength the measured absorbance is equal to the sum of the absorbances due
to dugs A and B. The absobance due to either drug is given by its molar concentration
multiplied by its molar absorptivity at that particular wavelength. Therefore an eauation
for the measured absorbance at each wavelength can be set up:
Substituting values for the absorbances, molar concentrations and molar absorptivities
into equations (i) and (ii):
Multiply equation (i) by 5 throughout (to become equation (iii)), then subtract equation
(ii) from (iii) so as to eliminate the cA term:
1.2 = 4800 cB
72
SPECTROPHOTOMETRY
FURTHER QUESTIONS
a) 95 b) 75 c) 50 d) 25 e) 10 f) 1
73
CHAPTER 4
8. The absorbances of a solution containing NAD and NADH in a 1cm light path
cuvette were 0.337 at 340 nm and 1.23 at 260 nm. The molar extinction
coefficients are:
Given that the molar absorptivity of bilirubin under these conditions is 6.07 x 104,
calculate the percentage purity of the bilirubin.
10. A method for creatinine determination based on the Jaffe reaction involved
mixing 0.1 mL of sample with 2.5 mL alkaline picrate reagent, incubating for
10 min at room temperature, then measuring the absorbance at 530 nm in a 1-cm
cuvette in a spectrophotometer set to read zero using a cuvette containing distilled
water. The following readings were obtained:
Calculate the creatinine concentration in the serum (in μmol/L) and urine (in
mmol/L).
11. A standard curve for a plasma glucose method was set up by preparing a series of
dilutions of a stock glucose standard (containing 50 mmol glucose/L) and
measuring the absorbance at 500 nm in a 1 cm cuvette using a blank with zero
glucose concentration to zero the instrument. The following readings were
obtained:
Glucose (mmol.L): 5 10 15 20 25 30
Absorbance: 0.102 0.203 0.305 0.375 0.410 0.432
Does the method obey Beer’s Law? What glucose concentration corresponds to
an absorbance reading of 0.250?
74
SPECTROPHOTOMETRY
75
CHAPTER 4
76
RENAL FUNCTION
Chapter 5
Renal function
The kidney has multiple functions but in routine clinical practice very few of these are
formally assessed. A proportion of the blood supplied to each kidney is filtered at the
glomerulus to produce a cell-free ultrafiltrate which is virtually protein-free but
otherwise has the same composition as plasma. The tubules then modify this filtrate by
reclaiming components (reabsorption) or by adding further components to it (secretion)
before it is transported via the ureters to the bladder then excreted. In other words the
tubules convert the glomerular filtrate into urine. These two processes, filtration or
glomerular function and tubular function can be quantitatively assessed in the laboratory.
Question Q5(1)
A normal subject has a GFR of 120 mL/min and a plasma creatinine concentration of
100 μmol/L. Calculate total volume (in litres) of filtrate produced over a 24 h period and
the 24 h excretion of creatinine (in mmol) assuming that none of the filtered creatinine is
reabsorbed by the tubules.
77
CHAPTER 5
Answer Q5(1)
The longer the time period the more filtrate is produced. Since 120 mL is produced per
minute then twice this amount is produced in 2 minutes (2 x 120 mL = 240 mL), three
times this amount in 3 minutes (3 x 120 mL = 360 mL) etc. Therefore if the rate of
filtration (the GFR) is multiplied by the time period (using the same units of time) then
the result is the total volume of filtrate produced over that time period:
In this instance the GFR is 120 mL/min and the time period is 24 h. Obviously the same
units for time must be used so 24 h is multiplied by number of minutes in an hour (60) to
give 24 x 60 = 1440 min. From this the volume of filtrate formed in 24 h can be
calculated:
This volume is considerably greater than the total amount of plasma in the human body
(approx 3.5 L) or of the total water content (approx 42 L) and emphasises the importance
of the renal tubules in reclaiming the vast majority of filtered water to reduce the volume
of filtrate to the daily output of urine (in the order of 1-2 L) and thus avoid dehydration.
Division by 1000 converts to mmol/L (since there are 1000 μmol in 1 mmol):
If none of this filtered creatinine is reabsorbed by the tubules then this represents the
24 urinary excretion of creatinine.
78
RENAL FUNCTION
It is vital to always ensure that the units are compatible. These calculations always
assume that the plasma concentration of the substance remains constant over the time
period being considered and that it is freely filtered at the glomerulus.
Again, care must be taken that units are compatible. Note that if rate of urine production
is used instead of urine volume then this calculation gives the rate of urinary excretion of
the compound.
Question Q5.2
A patient was asked to collect urine over a 24 h period. The volume was found to be
1.5 L and the concentration of creatinine in an aliquot of this urine 8.0 mmol/L.
Calculate the 24 h urinary excretion of creatinine in mmol.
79
CHAPTER 5
Answer Q 5(2)
Provided this compound is freely filtered at the glomerulus then its rate of filtration is
given by Eq.5.4:
If all of the compound that is filtered at the glomerulus is excreted in the urine i.e. it is
neither reabsorbed from nor further amounts secreted into the filtrate, then:
If we substitute for rates of filtration and excretion the following expression is obtained:
80
RENAL FUNCTION
Therefore, provided the above conditions are met, if a timed urine is collected and the
concentration of the analyte is measured in both plasma and urine, then the GFR is easily
calculated. Again it is vital that all the units are compatible.
Another way of looking at the GFR is that it is the clearance of the substance being
considered i.e. the volume of plasma from which the substance is completely removed or
cleared in unit time. This volume can easily be determined if urinary excretion is divided
by plasma concentration:
If the substance is reabsorbed or secreted by the tubules then of course the measured
urinary excretion of the compound will not be equal to the rate of its filtration at the
glomerulus. In other words the GFR will not be equal to the measured clearance. If a
compound is reabsorbed by the tubules (for example urea) then only a proportion of the
filtered compound will be excreted in the urine and the measured clearance will be much
lower than the GFR. In the case of urea, the amount reabsorbed is very dependent upon
the state of hydration (and hence the urine flow rate) and attempts have been made to
correct for this using the square root of the urine volume in the calculation. If a substance
is actually secreted into the tubules then the urinary excretion will be greater than the
amount filtered and the measured clearance will be greater than the GFR.
The clearance of any substance can be measured, but the value obtained will only give a
measure of GFR if it is excreted by glomerular filtration alone.
Question Q 5(3)
A 24 h urine (volume 2.4 L) was collected from a patient in ITU and the creatinine
concentration of an aliquot was found to be 6.0 mmol/L. The creatinine concentration of
a plasma sample collected during this 24 h period is 500 μmol/L. Calculate this patient’s
creatinine clearance in mL/min.
81
CHAPTER 5
Answer Q 5(3)
Urine flow rate (L/min) = Urine volume (L/24 h) = 2.4 = 0.00167 L/min
24 x 60 1440
(Division by 24 converts the urine flow to L/h, then division by 60 converts to L/min).
Rate of excretion (μmol/min) = Urine flow rate (L/min) x Urine concentration (μmol/L)
Since the urine creatinine concentration is given in mmol/L it is first multiplied by 1000
to covert to μmol/L, then:
Division of this rate of urinary excretion of creatinine by its plasma concentration gives
the volume of plasma completely cleared of creatinine in each minute . i.e. the clearance:
It should be noted that the calculation of clearance involves three measurements: urine
volume, urine concentration and plasma concentration. The combination of the errors
involved in these measurements is considerable. It cannot be emphasised too strongly
that the largest source of error in a clearance measurement is the accuracy of the timed
urine collection. Although clearance measurements are conventionally expressed as
mL/min, the result implies an unrealistic degree of accuracy and it would be better if
results were expressed as L/min using only 2 significant figures. For example a clearance
of 123 mL/min would become 0.12 L/min.
82
RENAL FUNCTION
The units must always be compatible. In the case of creatinine clearance, the clearance is
usually expressed as mL/min so the units of the individual measurements are first
adjusted as follows:
V: urine is usually collected over a 24 h period and its volume expressed in litres.
Since the clearance is required in mL/min this volume is multiplied by 1000 (to
convert from litres to mL) and divided by 24 (to convert from 24 h to 1 h) then 60
(to convert from hours to minutes).
If different units or urine collection times are used then the factor 700 must be adjusted.
83
CHAPTER 5
Alteration of one variable must result in the alteration of at least one other. Consider a
steady state in which clearance, P and UV are constant (Fig 5.2).
P
200
150 Change in
clearance
UV
% of 100
initial
value
Clearance
50
0 Time
If at some point (shown by the arrow) the clearance is halved (for example by removal of
one kidney) then the immediate effect will be for the urinary excretion (UV) to halve. As
time progresses the plasma concentration (P) will rise (since less of the substance is being
84
RENAL FUNCTION
removed from the plasma by filtration) and as a result the amount filtered per unit time
(given by: clearance x P) will also rise. This will be reflected in the urinary excretion
(UV) which will also increase. Eventually a new steady state will be achieved in which
the urinary excretion is unchanged but the plasma concentration is doubled (Fig 5.2).
The relationship between clearance, P and UV is summarised in Fig 5.3:
3000
2500 0.004
2000
1500
1/P
P
0.002
1000
500
0 0
0 50 100 150
0 50 100 150
Clearance Clearance
Question Q 5(4)
85
CHAPTER 5
Answer Q 5(4)
Substitution of these values into the rearranged Eq. 5.8 allows calculation of the new
value of U:
U x V = Clearance x (h x 60) x P
1000 x 1000
86
RENAL FUNCTION
To compensate for variations in body size, GFR or clearance is often related to body
surface area. Therefore results are either expressed as mL/min/m2 or relative to an
“average” body surface area of 1.73 m2 i.e. as mL/min/1.73 m2. To do this it is first
necessary to calculate the body surface area. Body surface area can be estimated from
both height (in cm) and body weight (in Kg) using the following formula:
where A is body surface area in m2 , W is body weight (in Kg) and H is height (in cm).
If the measured GFR or clearance is divided by this surface area then the result becomes
mL/min/m2. If this result is then multiplied by 1.73 then the GFR or clearance is corrected
to the average or standard surface area of 1.73 m2. These calculations are summarised in
Fig 5.6.
Question Q 5(5)
87
CHAPTER 5
Figure 5.6 Correction of GFR (or clearance) for body surface area
Again it should be emphasized that this correction is NOT the same as estimating
clearance from plasma creatinine measurement. In order to calculate clearance form
plasma creatinine then the body weight (W) will be used twice: once to estimate clearance
(uncorrected for body size) from plasma creatinine (using Eq. 5.9) then again to calculate
body surface area (using Eq. 5.11) for correction of this value to standard body surface
area.
Answer Q 5(5)
88
RENAL FUNCTION
Next correct the clearance for a body surface area of 1.73m2 using Eq. 5.12:
In this example correction of a low clearance for the small body size resulted in a normal
value.
Since the largest source of error in a clearance measurement is the accuracy of the timed
urine collection and continuous urine collections are very inconvenient, attempts have
been made to derive the urinary excretion of creatinine from sources other than urinary
analysis. Creatinine originates from two sources:
Therefore methods of calculation have been devised which attempt to relate muscle mass
to the rate of creatinine added to the body pool (and hence excreted in urine). Dietary
sources of creatinine are usually ignored.
Cockroft and Gault measured urinary creatinine output and derived its relationship with
both body weight and age which they then used to develop a formula which estimates
89
CHAPTER 5
creatinine clearance from plasma creatinine, age and body weight (Fig 5.5).
This equation assumes that the relationship between body weight and muscle mass is
constant and may be unreliable in obese and oedematous subjects. Note that body weight
and age are ONLY used to estimate urinary creatinine output which is then incorporated
into the standard equation for creatinine clearance (Eq. 5.8). It does NOT correct the
clearance to body surface area – if this is required then an additional calculation is
required.
Question Q 5(5)
A 40-year old lady has a plasma creatinine concentration of 153 μmol/L. Estimate her
creatinine clearance (in mL/min) given that her body weight is 60 kg.
90
RENAL FUNCTION
Slope = -0.0018
Rate of urinary
creatinine
excretion
mmol/24 h/kg
Intercept = 0.248
Age (years)
Simplifying:
Creatinine clearance (mL/min) = (140 - age (y)) x Body wt (kg) x 1.3 …. Eq. 5.13
Plasma creatinine (μmol/L)
The above data was validated for men. It is common practice to multiply this value
by 0.85 for women to correct for a lower muscle mass.
Figure 5.5. The Cockroft and Gault equation for predicting creatinine clearance
from age, body weight and plasma creatinine
91
CHAPTER 5
Answer Q 5(6)
Since the patient is female this result is multiplied by 0.85 to correct for a lower muscle
mass than males (for which the formula was derived).
The earliest equation to be widely recommended in most official guidelines was derived
by the Modification of Diet in Renal Disease (MDRD) study group. Later the Chronic
Kidney Disease Epidemiology Collaboration (CKD-EPI) published a new equation using
pooled data from multiple studies for both its development and validation1. Their
approach was a little different in that they divided the data into two groups – those with
serum creatinine below a certain value (called a “knot”) and those above it – resulting in
two equations (although they can be combined into one provided appropriate constants
are selected according to plasma creatinine and sex). The CKD-EPI equation performed
better than the MDRD equation, especially at higher GFR. At the time of writing the
CKD-EPI equation is recommended by the National Institute of Clinical Excellence
(NICE). An easy to use version of this equation is included in Fig 5.6.
1
Levey AS, Stevens LA, Schmid CH et al. A new equation to estimate glomerular
filtration rate. Ann Intern Med 2009; 150(9): 604-612.
92
RENAL FUNCTION
These formulae are still undergoing development and it is likely that guidelines may
change in the near future.
Male Female
Serum creatinine (µmol/L) ≤80 >80 ≤62 >62
k 0.9 0.9 0.7 0.7
α -0.411 -1.209 -0.329 -1.209
Fig 5.6 CKD-EPI formula for estimation of GFR (eGFR) from serum creatinine
concentration
Question Q 5(7)
Use the CKD-EPI formula to estimate the GFR for a 55-year old Caucasian woman
whose serum creatinine is 125 μmol/L.
93
CHAPTER 5
Answer Q 5(6)
The rate the substance is excreted in urine can be calculated from the concentration of the
substance in urine (Usubstance) and the urine flow rate (V):
The rate of filtration is the product of GFR and plasma concentration (P):
94
RENAL FUNCTION
If GFR is estimated at the same time and using the same urine collection by measuring
the concentrations of creatinine in the same plasma and urine, then:
GFR = (Ucreatinine x V)
Pcreatinine
This value for GFR can then be substituted into Eq. 5.14 to give:
Question Q 5.8
The following results were obtained in a 20 year-old male admitted after a car crash and
found to be oliguric:
95
CHAPTER 5
Answer Q5(8)
Substituting the relevant values after making sure that units are appropriate:
Many substances filtered at the glomerulus are reabsorbed by the tubules. If the rate of
filtration is less than the rate of tubular re-absorption for the substance then the urinary
excretion is zero. The proportion of the filtered substance that is reabsorbed is called the
tubular reabsorption (TR). Since the rate of filtration of a substance is given by the
product of its concentration and GFR, the rate of filtration increases with plasma
concentration. Most tubular transport mechanisms are concentration dependent so as the
concentration of the substance in the glomerular filtrate increases the rate of reabsorption
also increases but eventually a threshold is reached as the tubular transport mechanism
becomes saturated and no further substance can be reabsorbed and any excess appears in
the urine. This maximal rate of tubular transport is called the Tm value of the substance.
This gives rise to the concept of the renal threshold for a substance and is illustrated in
Fig 5.7. Since by definition the proportion filtered is 1, the numerical relationship
between FE and TR is:
If the rate of filtration (GFR x P) is less then the value for TR then none of the substance
appears in the urine and the value for FE is negative. However, if the rate of filtration
exceeds the TR value then the renal threshold is exceeded and the substance appears in
urine and is possible to calculate TR by rearranging Eq. 5.17:
96
RENAL FUNCTION
In other words the fraction that is not excreted must have been reabsorbed. To convert
this fraction to an absolute concentration (i.e. the amount of substance reabsorbed per
given volume of filtrate) then the value for TR is multiplied by the plasma concentration.
This is the same thing as the ratio of the maximal rate of reabsorption (Tm) to the GFR –
i.e. the Tm/GFR:
This method of calculation assumes that the plasma concentration (P) is well above the
renal threshold. Units are concentration e.g. mmol/L.
Question Q 5(9)
The following results were obtained for urine and plasma from a fasting adult:
97
CHAPTER 5
Rate of Filtered
Reabsorbed
filtration
or
reabsorption
or
excretion Tm Excreted
Renal
Threshold P
Tm/GFR
Figure 5.7 The renal threshold for a substance filtered at the glomerulus then
reabsorbed by the tubules
Answer Q5(9)
98
RENAL FUNCTION
TRP = 1 - FEP
c) Since the TRP is the fraction of filtered phosphate that is reabsorbed, multiplication
by the plasma phosphate concentration gives the maximum rate of reabsorption of
phosphate (in mmol) per litre of glomerular filtrate (TmP/GFR):
The calculated difference between the actual urine volume and the osmolar clearance is
known as free water clearance. In a hyposmolar urine the free water clearance is
positive; in a hyperosmolar urine it is negative. The free water clearance can also be
viewed as the volume of pure water subtracted from (positive free water clearance) or
added to (negative free water clearance) the plasma per unit time.
99
CHAPTER 5
The renal excretion of a large water load is limited by the diluting capacity of the tubules
and the amount of solute available for excretion. The minimum osmolality of urine is
about 50 mmol/kg. On a medium protein diet 1200 mmol is excreted per day (mainly
urea) and the maximum urine volume is 1200/50 = 24 L. In starvation however, the
source of solute is tissue breakdown (approx. 100-200 mmol/day. There is little ability to
excrete free water and hyponatraemia myoccur if water intake is greater than 2 to 4 L/day
in starvation.
Question Q 5(10)
100
RENAL FUNCTION
Answer Q 5(10)
Since Cosm is required in mL/min, the flow rate must first be converted to
mL/min by multiplying by 1000 (1000 mL in 1L) and dividing by 24 (to
convert to h) then 60 (to covert to min):
Cwater = V - Cosm
V = 0.833 mL/min
Cosm = 2.08 mL/min
101
CHAPTER 5
FURTHER QUESTIONS
2. A patient has a GFR of 110 mL/min. If the plasma creatinine concentration is 180
μmol/L how many mmol of creatinine are filtered in 12 h?
3. A urine collection (volume 3.2 L) was handed in by a patient which he said he had
collected over the previous day. Calculate the creatinine clearance given that the
urine was found to have a creatinine concentration of 7.2 mmol/L. The plasma
creatinine concentration taken during the collection was 94 µmol/L. Give the
most likely cause for this result.
5. A subject with a GFR of 100 mL/min was infused with a 'drug' X at a rate of
100 µmol/min and the plasma concentration reaches a steady state value of
200 µmol/L. It is known that this drug is not metabolized or excreted by organs
other than the kidney. What is the clearance of this drug? Comment on the result.
6. A patient who is severely water depleted and excreted only 100 mL of urine in the
last 6 hours was a short time before, found to have a creatinine clearance of
100 mL/min with a plasma creatinine concentration of 100 µmol/L. If renal
function has remained unchanged what concentration of creatinine would you
expect to find in the latest 100 mL (6 h collection) specimen of urine?
8. The following data were obtained for a hypertensive patient on a low sodium diet:
102
RENAL FUNCTION
If the renal tubules reabsorb 90% of filtered sodium, how many grams of
sodium are excreted in the same 24 h period?
9. The following results were obtained in a 20 year old male admitted after a car
crash and found to be oliguric:
Urine Na 60 mmol/L
Creatinine 1.2 mmol/L
Osmolality 200 mOsm/Kg
10. A 45 year old lady has a body weight of 56 kg and a height of 155 cm. If her
plasma creatinine is 150 μmol/L estimate her GFR expressing the result as
mL/min/1.73 m2.
11. Calculate the tubular maximum reabsorptive capacity (Tm/GFR) for glucose from
the following data:
13. An estimation of glomerular filtration rate can be calculated using the abbreviated
MDRD (Modified Diet in Renal Disease) formula:
103
CHAPTER 5
Calculate the GFR for a 55 year old Caucasian women whose serum creatinine is
125 μmol/L, and her creatinine clearance, given that a 24 h urine collection with a
volume of 1.1 L had a creatinine concentration of 4.7 mmol/L.
104
OSMOLALITY
Chapter 6
Osmolality
What is osmolality?
Osmosis is the process by which solvent moves from an area of low solute concentration,
through a semi-permeable membrane, to an area of high solute concentration. A semi-
permeable membrane is one which is permeable to solvent but not solute. The pressure
of the solvent movement will depend on the gradient in solute concentrations separated
by the membrane and is known as osmotic pressure. In the human body there is no active
mechanism for the transport of water into and out of cells; water always follows an
osmotic gradient. Most cell membranes are freely permeable to water. The main
exception to this are the membranes of cells lining the collecting tubules in the kidney,
which only become permeable to water in the presence of antidiuretic hormone (ADH).
The cell membranes are however, selectively permeable to solutes such as sodium and
glucose a property which is modulated by hormonal action e.g. aldosterone and insulin.
105
CHAPTER 6
solute and solvent, electrolytes do not behave in an ideal manner. In these circumstances
osmolality can be calculated in the following manner:
In routine clinical biochemistry it is usual to assume that osmotic coefficients are always
equal to 1.
Confusion often arises between the terms osmolality and osmolarity. These terms may
be defined as follows:
Properties other than osmotic pressure are also dependent upon the concentration of
solute particles in solution. These include increasing vapour pressure, raising boiling
point and decreasing freezing point. These are all known as colligative properties and
each may be used to measure to obtain a measure of the active solute concentration of a
solution. In routine clinical practice osmometers based on depression of freezing point
are usually used to obtain a measure of osmolality.
Question Q 6(1)
Estimate the osmolality (in mOsmol/Kg) of (a) 20% glucose, (b) Physiological saline,
(c) a mixture containing equal volumes of 20% glucose and physiological saline, and (d)
50 mmol/L calcium chloride. Assume ideal behaviour.
106
OSMOLALITY
Answer Q6(1)
(a) Glucose does not dissociate into ions in solution so its osmolality will be
approximately equal to its molar concentration.
107
CHAPTER 6
Then assuming ideal behaviour the osmolality will be approximately twice the
molar concentration of sodium chloride:
(c) If equal volumes of 20% glucose and physiological saline are mixed then the
resulting osmolality will be one half the sum of the individual osmolalities:
108
OSMOLALITY
The major osmotically active species present in normal plasma are Na+, Cl-, glucose and
urea. The simplest formula for calculating osmolality from the concentrations of these
species is:
Another version of this formula includes a term for potassium concentration i.e. 2[K+].
The concentration of sodium is multiplied by 2 to allow for the associated anions (mainly
chloride and bicarbonate). However, this simple formula does not give very good
agreement with measured osmolality since the following assumptions are made:
• That the anions associated with Na+ and K+ are free to contribute to omolality and
are not part of a macromolecule (e.g. protein).
• That the activity of each species is the same as concentration i.e. the ions exhibit
ideal behaviour.
• That the millimolal concentration of each ion (mmol/kg water) is the same as its
millimolar concentration (mmol/L plasma). This is not true since plasma is
approximately 95% water.
109
CHAPTER 6
In routine clinical practice the commonest reason for comparing measured with
calculated osmolality is to obtain evidence for the presence of, or to quantify, an
unmeasured osmotically active species. If no such species is present then, within the
limitations discussed above, there should be good agreement between measured and
calculated osmolality. In other words the difference between the two values (known as
the osmotic gap) should be approximately zero. If the osmolal gap has a significant
numerical value then this is good evidence that an unmeasured osmotically active species
is present and the size of the gap is proportional to its concentration.
The osmolal gap is often calculated if a patient is suspected of having ingested large
amounts of a volatile compound such as ethanol, methanol or ethylene glycol. Provided
only a single compound has been ingested and its identity is known then it is possible to
derive an approximate value for its concentration which is adequate to act as a guide for
treatment. Apart from the limitations inherent in calculating osmolality, errors may occur
because volatile solvents such has ethanol do not behave entirely as expected with some
osmometers.
Question Q 6(2)
The following data were obtained on the plasma from an unconscious man:
Assuming that the osmolar gap is due solely to ethanol, calculate the plasma ethanol
concentration in mg/dL.
110
OSMOLALITY
Answer Q 6(2)
First calculate the osmolality from the concentrations of sodium, urea and glucose:
= 283 mOsmol/kg
The osmolal gap is the difference between measured and calculated osmolality:
= 353 - 283
= 70 mOsmol/kg
If this osmolal gap is due entirely to ethanol, then the ethanol concentration is
70 mmol/L.
111
CHAPTER 6
ADDITIONAL QUESTIONS
3. A patient was mistakenly given 500 mL 20% mannitol (C6H14O6 ) intended for the
patient in the next bed instead of the same volume of normal (0.9%) saline.
Calculate the extra osmolal load given over that which would have resulted from
isotonic saline.
5. A 45 year old man is brought to casualty following a fit. He had been working
alone late in a garage, when he was found by the security guard who called an
ambulance. On admission, he has a large bruise on the left temple and is semi-
comatose, he smells of alcohol. The admitting team request urea and electrolytes,
glucose and an alcohol and blood gas estimation and arrange an urgent CT scan.
The results are as follows:
The CT scan does not show any bony injury or evidence of intracranial bleed.
The neurological registrar is called and asks for an osmolal gap to help provide a
quick estimation of whether there is a possibility that other toxic substances
present in the garage, such as antifreeze, have been taken in any quantity.
112
BASIC PHARMACOKINETICS
Chapter 7
Basic pharmacokinetics
Pharmacokinetics may be defined as what the body does with drugs and includes such
processes as absorption, distribution, metabolism and elimination. In practice
phamacokinetic factors determine not only the plasma concentration of the “active” drug
but the amount of active drug reaching its site of action. Therapeutic drug monitoring
(TDM) may be defined as the use of drug measurements in body fluids as an aid to the
use of drugs in the management (i.e. cure, alleviation or prevention) of disease.
Bio-availability (F)
The proportion of administered drug absorbed is termed its bio-availability (F) and
depends on the drug, the individual and the dosage form of the drug. Bio-availability is
defined as the proportion of administered drug which reaches the circulation:
This expression can be rearranged to give the dose absorbed for any administered dose:
Salt conversion factor (S) = Molecular weight of free drug .. Eq. 7.3
Molecular weight of compound administered
113
CHAPTER 7
Therefore for sodium phenytoin S would be 252/274 = 0.92 equation 7.2 can be modified
to incorporate the salt conversion factor:
Question Q7(1)
Calculate the theoretical maximum plasma concentration if 500 mg of the sodium salt of
a drug is administered to a 70 kg male. Assume the drug (MW of the parent drug 345
Daltons) is only distributed throughout the extra-cellular fluid (the volume of which is
20% of the body weight) and its bio-availability is 0.85.
114
BASIC PHARMACOKINETICS
Answer Q7(1)
S = MW parent drug
MW sodium salt of drug
For its sodium salt, Na (atomic weight 23) replaces a hydrogen atom (atomic weight 1).
S = 345 = 0.94
367
The volume of distribution (Vd) is the ECF volume which is 20% of the body weight
(70 kg) – assuming density of ECF is approximately 1:
Vd = 70 x 20 = 14 litres
100
Vd = Dose absorbed
Plasma concentration
115
CHAPTER 7
CLEARANCE OF A DRUG
In practice hepatic function is difficult to quantify and most dosage calculations take into
account impairment of renal function only.
Following a single dose of a drug a plot of plasma drug concentration versus time is
usually non-linear (Fig 7.1). As the drug is cleared the amount of drug removed from
plasma in unit time (i.e. the rate of fall in concentration) decreases as plasma
concentration falls. In fact the rate of fall with time can be expressed as:
The symbols d Cpt does NOT mean d multiplied by Cpt but a very small (infinitesimally
so ) change in Cpt. Similarly d t is a minute change in t. Mathematicians call d Cpt / d t
the first differential of concentration with respect to time and it can be regarded as the
slope of a tangent drawn to the curve at any defined point ( i.e. any value of t ). When the
differential d Cpt / d t is proportional to a single concentration term then the elimination
process is said to follow first-order kinetics. If d Cpt / d t is independent of concentration
(i.e. constant) then the elimination is said to follow zero-order kinetics and, if
proportional to two concentration terms, second-order kinetics etc.
where kd is known as the elimination rate constant which has units time.-1 This equation
can be rearranged to give:
116
BASIC PHARMACOKINETICS
d Cpt = kd . d t
Cpt
Therefore if ln Cpt is plotted against t then a straight line is obtained with slope -kd and
intercept on the vertical axis equal to ln Cp0 (see Fig 7.1).
If log10Cpt is plotted instead of ln Cpt then a straight line is still obtained, but this time the
slope is equal to - kd/2.303. This value can be derived using the relationship between
natural and common logarithms: ln x = ln 10 x log10 x = 2.303 log10 x. Eq. 7.7
can then be written:
117
CHAPTER 7
then each pair of values can be substituted into Eq 7.7 to give the following two
equations:
ln Cp1 = ln Cp0 - kd . t 1
ln Cp2 = ln Cp0 - kd . t 2
Subtraction of one equation from the other eliminates the Cp0 term to give:
Question Q 7(2)
A 15 year old boy presents to casualty following a convulsion. It turns out that he had
swallowed 30 of his mother’s lithium tablets about 10 hours previously. On admission
his lithium concentration is 4.1 mmol/L. A decision needs to be made whether to
haemodialyse him to reduce the lithium concentration. As this is not going to be
available quickly, the physicians want to know how long he will have toxic levels just
with endogenous clearance. How long it will be before his lithium concentration drops
to the relatively safe level of 1.5 mmol/L below which toxicity is unlikely, given an
elimination rate constant of 0.05 h.-1
118
BASIC PHARMACOKINETICS
Answer Q 7(2)
Substitute these values into the first order rate equation (Eq. 7.7) and solve for t:
ln Cpt = ln Cp0 - kd . t
A quantity known as the elimination half-life (t 1/2) can be defined as the time taken for
the plasma drug concentration or total body content of the drug to fall by 50%. Fig 7.1
shows that after one half-life the plasma concentration falls to 50%, after two half-lives to
25%, after three half-lives to 12.5% etc.
The elimination rate constant is related to the elimination half-life. If the initial drug
concentration is Cp0, then after one half-life (when t = t ½), the concentration Cpt will be
one half of the initial value i.e. Cp0/2. If these values are substituted into the logarithmic
form of the first-order rate equation (Eq. 7.7) then the following expression is obtained:
kd . t ½ = ln Cp0 - ln Cp0 / 2
119
CHAPTER 7
Since the difference between the logarithms of two individual numbers is the same as the
logarithm of one number divided by the other, this expression can also be written:
kd . t ½ = ln. Cp0 x 2
Cp0
kd . t ½ = ln 2
Rearranging Eq 7.9 gives kd = 0.693 / t ½ which can then be substituted into the
Eq. 7.7 to give an alternative formula:
which can be rearranged to give the following alternative expression for half-life:
Question Q 7(3)
120
BASIC PHARMACOKINETICS
0 100 4.61
1 50 3.91
2 25 3.22
3 12.5 2.53
4 6.25 1.83
5 3.125 1.14
100
Cp
50
0
0 1 2 3 4 5 6
t
5
Slope = - k d
4
3
ln Cp
2 Intercept = ln
Cp 0
1
0
0 1 2 3 4 5 6
121
CHAPTER 7
Answer Q 7(3)
The 2.5 h blood sample can be considered as the initial sample (Cp0) and the 5 h sample
as the sample when t = 2.5 h (i.e. 5.0-2.5) with concentration Cpt. Therefore:
Cp0 = 32 mg/L
Cpt = 10 mg/L
t = 2.5 h
These values can then be substituted into Eq. 7.10 and solved for t ½ :
t½ = 0.693. t
ln Cpt - ln Cp0
t½ = 0.693 x 2.5
ln 32 - ln 10
t½ = 1.73
3 .47 - 2.30
Note that ln Cp means the natural logarithm of concentration and not ln multiplied by Cp.
Therefore ln Cpt - ln Cp0 is NOT the same as ln (Cpt - Cp0).
An alternative approach to this problem would be to plot the natural logarithm of the two
drug concentrations (32 and 10 mg/L) against the times (2.5 and 5 h), measure the slope
to derive kd, then divide 0.693 by kd to obtain the half-life.
So far we have been dealing with the kinetics of elimination following administration of a
single dose of a drug. In many situations patients receive maintenance therapy: that is the
drug is taken at regular intervals. The dose is repeated well before the previous dose has
been eliminated from the body. Eventually, after multiple dosing, the situation is
reached where the rate of administration of a drug (RA) is equal to the rate of elimination
(RE) so that a constant steady state plasma concentration (Cpss) is achieved. The
clearance of a drug (Cl) can be defined as the volume of plasma from which the drug is
completely cleared per unit of time. Therefore the rate of elimination is the product of
clearance and plasma steady state concentration:
122
BASIC PHARMACOKINETICS
RE = Cl x Cpss
The rate of administration is the amount of drug administered per unit time and depends
on the dose administered, bioavailability (F), salt factor (S) and interval between doses
(τ):
RA = Dose x F x S
τ
When a steady state is reached, RA = RE, and substitution of the expressions for RA
and RE gives:
Dose x F x S = Cl x Cpss
τ
For a drug which is eliminated by glomerular filtration alone, its clearance is equal to the
patient’s GFR (se chapter 5).
For drugs which exhibit first order kinetics, the elimination rate constant (kd) can be
defined as the fraction of the drug which is cleared in unit time:
Ab = Vd x Cpss
Substituting these values into the expression for kd gives the following:
kd = Cl x Cpss
Vd x Cpss
123
CHAPTER 7
Canceling the Cpss terms gives a relationship between Vd, clearance and kd:
The equations developed so far can be applied to answer many practical problems in drug
therapy.
First the theoretical initial plasma concentration (Cp0) is calculated from the dose and
volume of distribution (Vd) by substituting the expression for total amount absorbed
(Eq. 7.4) into the rearranged expression for Vd (Eq. 7.5):
The values for Cp0, t and kd (if kd is not known it can be calculated from clearance and Vd
using Eq. 7.12) can be substituted into Eq. 7.7 and solved for Cpt.
Question Q 7(4)
A 70 kg lady is given an oral dose of carbamazepine of 400 mg. What is the plasma
carbamazepine concentration 24 h later assuming a volume of distribution of 1.0 L/kg,
a clearance of 0.05 L/h/kg, salt factor of 1 and a bioavailability of 0.75.
124
BASIC PHARMACOKINETICS
Answer Q 7(4)
First calculate the theoretical initial drug concentration (Cp0) by substituting values for
dose, S, F and Vd into Eq. 7.13:
Cp0 = Dose x S x F
Vd
Next substitute values for Cp0, t and kd (which must first be calculated from Vd and the
clearance) into Eq. 7.7 and solve for Cpt:
ln Cpt = ln Cp0 - kd . t
125
CHAPTER 7
Sometimes it is desirable to know which dose to give a patient (to be repeated at a time
interval, τ) to achieve a target steady state plasma concentration (Cps). This is easily
calculated by rearranging Eq. 7.11 to give:
It is important to appreciate that a new a steady state will not be achieved until at least
5 half lives have elapsed.
This is simply calculated by inserting the new dose into Eq. 7.14. Alternatively the old
steady state concentration can be multiplied by the fractional increase in dose.
Alternatively to calculate the dose necessary to change the steady state concentration by a
set amount all that is necessary is to calculate the maintenance dose (using Eq. 7.14)
required to achieve the difference in steady state concentration between the original value
and the target level then add this to the original maintenance dose. Again it is necessary
to wait at least 5 half-lives before the new steady state is achieved.
Question Q 7(5)
b) The new average steady state plasma phenobarbitone concentration if the dose
was increased to 120 mg.
Assume a clearance of 5 mL/h/kg and that both bioavailability and salt conversion factors
are 1.
126
BASIC PHARMACOKINETICS
Answer Q 7(5)
a) The maintenance dose is calculated by substituting values for Cpss, t and clearance
into Eq. 7.14:
= 90 mg
a) If the dose is increased from 90 mg to 120 mg, then the fractional increase is
given by:
The new average steady state concentration can be calculated by multiplying the
old average steady state plasma concentration by this fractional increase in dose:
= 25 x 1.33
127
CHAPTER 7
Equation Eq. 7.5 can be re-arranged to give an expression for the amount of drug in the
body:
The amount of drug in the body is the dose reaching the circulation so that Eq 7.4 can be
substituted for the left hand side of this equation:
F x S x Dose administered = Vd x Cp
The dose administered is then the loading dose (LD) required to achieve the desired
plasma concentration of the drug (Cp) which can be rearranged to give the following
expression:
If the patient is already receiving the drug in question then this formula is modified to
take account of the pre-existing drug concentration. The loading dose is then the amount
of drug which needs to be administered to raise the plasma concentration from the initial
concentration (Cpinitial) to the desired concentration (Cpdesired):
Question Q7(6)
Calculate the loading dose of digoxin (bioavailability = 0.75, salt factor = 1) required to
achieve an initial plasma concentration of 1.5 μg/L in a 60 kg man (assume volume of
distribution = 7 L/kg):
128
BASIC PHARMACOKINETICS
Answer Q 7(6)
= 7 x 60
= 420 L
LD = Vd x (Cpdesired - Cpinitial)
F x S
Figure 7.1 shows that following a single dose of a drug its concentration falls to 50% of
the original value after one half-life, 25% after two halve lives, 12.5% after three half
lives etc. After five half lives the amount remaining is negligible at 3.25%.
Consider the situation when the same dose of a drug is administered using a dosing
interval of one half-life. If the theoretical maximum plasma concentration achieved (Cp0)
is 100%, then after one half-life the concentration of the original dose will be 50%.
Administration of a second dose at the end of one half-life contributes 50% to the plasma
129
CHAPTER 7
concentration when two half lives have passed since the first dose, whereas the
contribution at this time from the first dose will be one half of 50% which is 25%.
Therefore after two halve lives the plasma concentration will be 50% + 25% = 75% of
the theoretical maximum (i.e 75% of the steady state concentration). If this process is
repeated for 5 half lives then the following pattern emerges:
Dose 1 2 3 4 5
For practical purposes the plasma concentration after five half lives have elapsed
approximates to the average steady state concentration. A similar argument, although
more complex can be applied if the dosage interval is shorter than the half-life. A dosage
interval of less than the half-life minimises oscillations around the average steady state
concentration. If the dosage interval is considerably longer than the half-life then a
steady state is never achieved and virtually all of the drug is removed from the body
between each dose.
130
BASIC PHARMACOKINETICS
Suppose that the dose is given at time t0 and that samples are drawn at times t1 and t2 and
that the measured plasma concentrations of these latter two samples are Cp1 and Cp2
respectively. The elimination rate constant can then be calculated by substituting these
values into Eq. 7.8:
The half-life can then be calculated by substituting the value for kd into Eq. 7.9:
t½ = 0.693
kd
The theoretical initial plasma concentration (Cp0) can then be calculated by substituting
either pair of concentration and times (it doesn’t matter which) into the logarithmic
integrated first-order rate equation (Eq. 7.7), together with the value for kd then solving
for Cp0:
ln Cpt = ln Cp0 - kd . t
The volume of distribution (Vd) is obtained by substituting the value of Cp0 and the dose
into Eq. 7.13:
Cp0 = Dose x F x S
Vd
Vd = Dose
F x S Cp0
131
CHAPTER 7
Note that F and S are unknown but are incorporated into the apparent value for Vd.
Dosage adjustments can be made using the apparent Vd without knowledge of the actual
values for F and S.
Substitution of kd and Vd/ (F x S) into Eq. 7.12 allows calculation of clearance (or more
precisely Cl / (F x S):
Cl = kd x Vd
F x S F x S
The maintenance dose at dosage interval τ to achieve an average steady state drug
concentration Cpss can then be obtained from Eq. 7.14.
Again values for F and S do not need to be determined since they are already
incorporated into the apparent clearance Cl / (F x S).
Question Q 7(7)
A 60 kg patient is given 4 g of a drug and blood samples drawn at timed intervals with
the following results:
2 30
3 7
132
BASIC PHARMACOKINETICS
Answer Q 7(7)
a) To calculate the half-life first calculate the elimination rate constant using the
logarithmic form of the integrated first-order rate equation (Eq. 7.7):
ln Cpt = ln Cp0 - kd . t
Use the 1st (2 h) sample as the initial sample so that Cp0 = 30 mg/L
Use the 2nd (4 h) sample as Cpt (7 mg/L )
Therefore t = 4 - 2 = 2 h
ln 7 = ln 30 - kd . 2
1.95 = 3.40 - 2 kd
b) First calculate the theoretical value for Cp0 by substituting values for kd, Cpt and t
(either set of t and Cpt values can be used) into Eq. 7.7 but this time use the actual
sample times.
ln 30 = ln Cp0 - 2 kd
133
CHAPTER 7
Cp0 = Dose x F x S
Vd
Which can be rearranged to give a value for the apparent volume of distribution:
Vd = 4000 = 31.3 L
F x S 128
c) The apparent clearance can be calculated by substituting kd and Vd into Eq. 7.12:
= 0.725 x 0.52
= 0.38 L/h/kg
Cpss = 15 mg/L
Cl / (F x S) = 0.38 L/h/kg = 0.38 x 60 = 23 L/h/kg
τ = dosing interval = 24 h
134
BASIC PHARMACOKINETICS
Although many drugs follow the principle outlined above (i.e. single compartment
models obeying first-order kinetics) there are some notable exceptions. These drugs are
often cleared by saturable mechanisms so that a concentration is reached at which the
rate of elimination becomes independent of concentration. This is very important
clinically since only small increases in dose above a threshold value can easily lead to
toxic levels of the drug. A good example of this is phenytoin the clearance of which
displays a mixture of zero and first-order kinetics. The best model to use under these
circumstances is the equation of Michaelis and Menton which describes the rate of an
enzyme reaction (v) in terms of substrate concentration (s) in terms of two constants the
Km (which can be shown to be the substrate concentration at half-maximal velocity) and
the maximal velocity (Vmax):
v = Vmax x s
Km + s
Substituting the administration rate (which in a steady state is equal to the rate of
elimination) for v:
v = Dose x S x F
τ
and the average steady state drug concentration (Cpss) for s, gives the expression:
which can be rearranged to give an equation to calculate the dose required to give the
desired average steady state plasma drug concentration for the dosing interval:
135
CHAPTER 7
ADDITIONAL QUESTIONS
2. A 15 year old boy presents to casualty following a convulsion. It turns out that he
had swallowed 30 of his mother’s lithium tablets about 10 hours previously.
On admission his lithium concentration is 4.1 mmol/L. A decision needs to be
made whether to haemodialyse him to reduce the lithium concentration. As this is
not going to be available quickly, the physicians want to know how long he will
have toxic levels just with endogenous clearance. Estimate the following,
indicating clearly any assumptions you have made:
b) How long it will be before his lithium concentration drops to the relatively
safe level of 1.5 mmol/L below which toxicity is unlikely, given a
clearance of 0.03 L/h/kg.
Time mg/L
2.5h 32
5h 10
7.5h 3
136
BASIC PHARMACOKINETICS
5. The plasma concentration of a drug is found to be 200 nmol/L at 9.00 am. It’s
elimination follows first order kinetics with a rate constant is 0.34/h. Calculate the
times at which the plasma concentrations will reach 100 nmol/L and 75 nmol/L.
7. The SHO decides to treat a patient (weight 80 kg) with intravenous theophylline (salt
factor = 0.8). What loading dose would you recommend in order to achieve a
theophylline level of 12 mg/L given a volume of distribution of 0.5 L/kg and an
initial plasma theophylline level of 4 mg/L?
8. A patient, unable to take oral medication, had been receiving intravenous valproate
for a number of days and an average steady state level of 75 mg/L. After regaining
consciousness the doctors wished two change to an oral twice daily regimen.
In order to maintain the same average steady state concentration what dose would
you recommend. Assume a clearance of 10 mL/h/kg, a bioavailability of 0.7 and a
salt factor of 0.85.
137
CHAPTER 7
138
BODY FLUIDS AND ELECTROLYTES
Chapter 8
The human body contains approximately sixty per cent of its weight of water. For an
average adult male weighing 70 kg this amounts to 42 L. Of this about a third (14 L) is
contained in the extracellular fluid (ECF) and two thirds (28 L) in the intracellular fluid
(ICF). The ECF and ICF are separated from each other by cell membranes which are
semi-permeable but allow ready diffusion of water into and out of cells. Both of these
compartments are in osmotic equilibrium with each other. Blood plasma constitutes
approximately one quarter of the ECF (3.5 L), most of the remainder is present as
interstitial fluid (10.5 L) which surrounds cells in the various tissues of the body. Plasma
and interstitial fluid are separated by capillary walls which are freely permeable to water
and electrolytes but minimally permeable to proteins such as albumin. The distribution
of ECF between plasma and the interstitial fluid is controlled mainly by haemodynamic
factors, integrity of capillary walls and the plasma albumin concentration. A small
proportion of the ECF is present in the various body cavities (e.g. peritoneal, pleural,
pericardial, synovial, spinal cavities). These relationships are illustrated in Fig 8.1.
139
CHAPTER 8
% composition of body
0 5 20 60
ECF (14 L)
ICF (28 L)
Interstitial fluid
(10.5 L)
Plasma
(3.5L)
140
BODY FLUIDS AND ELECTROLYTES
This can be extremely difficult since the mechanisms and consequences are complex and
vary according to the nature of the fluid lost. In general however, several approaches can
be used:
• Loss of body weight – short term changes in body weight are due to changes in
hydration. Difficult to assess in immobilised patients e.g. in ITU
• Fluid balance charts recording fluid input versus output – negative fluid balance
consistent with fluid losses (this is not helpful if dehydration is already
established). Accurate fluid balance charts are difficult to maintain and rely on
estimating fluid loss via lungs, skin etc
A full discussion is beyond the scope of this book, but here are two areas where
calculations are involved:
The difficulty is identifying and accurately quantifying all the fluid losses and gains.
Small errors in the calculation of daily balance can accumulate over time with
catastrophic consequences. If the net fluid loss exceeds the net fluid gain then the patient
becomes water depleted i.e. is in negative fluid balance:
Fluid balance = Net fluid intake - Net fluid loss ……….. Eq. 8.1
141
CHAPTER 8
Fluid is gained by drinking. As food has a water content fluid is also gained by feeding,
either orally, by naso-gastric tube feeding or by intravenous infusion. Another source of
water gain which must be considered is water produced metabolically when fats and
carbohydrates are oxidised. Fluid can be lost in urine, as water vapour via the lungs,
sweating and evaporation through the skin and in faeces. In the hospitalised patient
losses via the lungs may be increased if the patient is on a ventilator, losses via the skin
may increased in burns patients or if the patient is pyrexial, and further losses may occur
by vomiting, nasogastric suction and through drains and fistulae. Typical values for a
normal adult in fluid balance are shown in figure 8.2.
Figure 8.2. Typical gains and losses in a normal adult in perfect fluid balance
In practice it is difficult to measure the so-called insensible losses i.e. losses via the lungs,
skin and faeces, and it is customary to assume a value of approximately 900 mL per day.
Water production via metabolism, the insensible gain, is also difficult to determine, so an
average value of about 500 mL per day is assumed. Therefore, the net insensible loss for
an adult is normally in the order of 900 - 500 = 400 mL per day.
Question Q 8(1)
During a 24 h period a patient recovering from intestinal surgery receives 1.5 L of fluid
by intravenous infusion. The total urine output during this period is 1200 mL and a
further 450 mL of fluid was removed by nasogastric suction. Estimate the patients fluid
balance.
142
BODY FLUIDS AND ELECTROLYTES
Answer Q8(1)
= 1500 - 2050
= - 550 mL
The consequences of fluid loss depend upon the nature of the fluid lost. If the loss is
isotonic, i.e. water and sodium are lost in the same proportion, due for example to
haemorrhage, then the osmolality of the plasma (and ECF) remains unchanged and there
is no stimulus for the osmotic shift of fluid from the ICF to the ECF. In other word acute
loss of isotonic fluid is confined to the ECF. On the other hand if pure water loss occurs
from the plasma (and ECF) then the osmolality (and hence sodium concentration) of the
plasma (and ECF) rises and provides an osmotic stimulus for the shift of water from the
ICF to the ECF until a point is reached when the two compartments are again in osmotic
equilibrium. Note that at equilibrium the osmolality in both compartments will be higher
than normal, but not as high as it would be if there had been no fluid shift from the ICF to
ECF. In other word the loss in fluid volume is shared between the two compartments and
so helps protect the circulating plasma volume (see Fig 8.3). In reality pure water loss
rarely occurs, so calculation of the approximate fluid loss from changes in plasma sodium
often serve as a (rough) guide for fluid replacement. The small difference between
osmolality and osmolarity (see Chapter 6) will be ignored and it will be assumed that
each mmol/L of solute contributes an osmolality of 1 mOsm/kg.
143
CHAPTER 8
100%
100%
ICF ECF
Pure Water
water loss shift
Figure 8.3 Comparison of the effects on ECF and ICF volumes of a) isotonic
fluid loss, and b) pure water loss
144
BODY FLUIDS AND ELECTROLYTES
If only pure water loss has occurred then the total amount of osmotically active species
present in the body (both in the ICF and ECF) remains constant. The total amount of
osmotically active species is the product of the volume of total body water and the plasma
osmolality (the osmolality of all compartments must be the same since they are in
osmotic equilibrium). Before the fluid loss occurred:
Total solutes (mOsm) = Initial osmolality (mOsm/kg) x Initial vol body water (L)
Total solutes (mOsm) = Final osmolality (mOsm/kg) x Final vol body water (L)
Assuming no solutes have been lost then these two quantities are equal and we can write:
which can be rearranged to give an expression for the final body water volume:
The volume of fluid lost is the difference between the initial volume and the final
volume:
Initial vol (L) - [Initial osmolality (mOsm/kg) x Initial vol (L)] ……….. Eq. 8.2
Final osmolality (mOsm/kg)
This formula can be simplified by taking the value for initial body water of 42 L (based
on the average 70 kg male) and an average normal initial osmolality of 285 mOsm/kg:
Which becomes:
145
CHAPTER 8
Since sodium and its associated anions normally account for most of the osmolality of
plasma then this expression can be further simplified by using the plasma sodium
concentration assuming that it was initially normal (140 mmol/L), so that 140 x 42 =
5880 mmol:
It cannot be emphasised too strongly that this formula can only give a crude estimate of
fluid loss and is based on the following assumptions:
• That the plasma sodium was initially normal at 140 mmol/L (if the previous
sodium concentration was known then it could be taken into account).
• The initial body weight of the patient is 70 kg and contains 60 per cent water. If
the body weight is known then a correction could be made, but this is more
difficult for obese patients.
Question Q 8(2)
An adult dehydrated male has a plasma sodium of 165 mmol/L due to water depletion.
Estimate the fluid deficit.
146
BODY FLUIDS AND ELECTROLYTES
Answer Q 8(2)
Initial vol (L) x Intial Na (mmol/L) = Final vol (L) x Final Na (mmol/L)
= 42 - 36 = 6L
The initial rise in osmolality due to the hyperglycaemia stimulates thirst via the
hypothalamic osmoreceptors. The patient will respond to thirst by taking in fluids until a
point is reached at which the plasma (and hence ECF) osmolality is returned to normal
but at the expense of diluting sodium and other solutes i.e. to produce a dilutional
hyponatraemia. Since the plasma osmolality is unchanged it follows that the rise in
147
CHAPTER 8
plasma glucose concentration must be equal to the fall in sodium and its associated
anions. Therefore the fall in sodium concentration must be equal to one half the rise in
plasma glucose concentration:
Fall in plasma sodium (mmol/L) = Rise in plasma glucose (mmol/L) ….. Eq. 8.5
2
If there is no intake of water then the plasma osmolality will remain elevated (because the
cell membranes are essentially impermeable to sodium and, when insulin deficient,
impermeable to glucose). The osmotic stimulus will result in a shift of water from the
ICF to the ECF (and hence plasma). An equilibrium is established in which the
osmolality of both compartments is again equal but higher than normal. In other words
the osmotic load is shared between both the ECF and ICF compartments. Therefore the
increase in osmolality must be equal to the increase in plasma glucose concentration:
The osmotic load in the ECF due to this accumulated glucose can be calculated from the
rise in plasma glucose concentration (Δ glucose) and the ECF volume:
At equilibrium the rise in osmolality (Δ osmolality) (which must be the same for both
compartments) is given by the expression:
Substituting Δ glucose x ECF vol for the osmotic load and (ECF + ICF) for the total
body fluid volume gives:
148
BODY FLUIDS AND ELECTROLYTES
Hyperosmolar
Iso-osmolar
ECF
No intake of fluids
Free access to Water shifts from
oral fluids ICF to ECF
ECF osmolality Both compartments
is restored become hyper-osmolar
Hyperosmolar
Iso-osmolar
Figure 8.4 Effect of a rise in plasma (and hence ECF) glucose on the osmolality
of both the ECF and ECF compartments when there is either free
intake of water or no intake at all. In both instances addition of water
to the ECF compartment dilutes sodium and other electrolytes whilst
the plasma glucose remains elevated
149
CHAPTER 8
Although some fluid has shifted between compartments, the ratio of (ECF + ICF) to ECF
is still approximately 3:1 (since one third of body water is in the ECF). Therefore this
expression becomes:
Since the rise in osmolality is less than the rise in plasma glucose, the concentration of
other solutes (predominantly sodium and its associated anions) must have fallen by an
amount equal to the difference between the two:
This calculation assumes that there has been no net gain or water loss by the body and no
transfer of solutes between the ECF and ICF.
Question Q 8(3)
A male adult insulin dependent diabetic forgot to take his insulin. His blood glucose rose
from 5 mmol/L to 20 mmol/L in 2 hours. During this period he did not pas any urine.
Calculate the likely effect on his plasma sodium concentration assuming:
150
BODY FLUIDS AND ELECTROLYTES
Answer Q 8(3)
= 20 - 5
= 15 mmol/L
This rise in the amount of glucose present in the body is responsible for the rise in
overall osmolality. Therefore the increase in osmolality can be calculated by
dividing this amount by the volume of total body water (i.e. ICF + ECF):
∆ osmolality = 20 - 5 = 5 mOsm/kg
3
151
CHAPTER 8
As the plasma osmolality has risen by 5 mOsm/kg and the plasma glucose has risen by 15
mmol/L, it follows that the concentrations of the other solutes in plasma (principally
sodium and its associated anions) must have fallen by an amount equal to the difference
between the two:
= ∆ osmolality - ∆ glucose
= 5 - 15 = - 10 mmol/L
The advent of ion-specific electrodes for the determination of plasma/serum sodium soon
led to discrepancies when compared with values determined by traditional flame
photometry. The difficulty arose because these electrodes measure sodium activity in
plasma water. Plasma contains appreciable protein (normally about 70 g/L). This
protein, together with its hydration shell, occupies a significant proportion of plasma
volume and reduces the amount of plasma water available to dissolve sodium and other
ions. The “normal” plasma sodium determined by flame photometry is approximately
140 mmol/L of plasma. If this plasma contains 70 g/L of protein and we assume that its
volume is approximately 70 mL/L (or 0.07 L/L), then this means that a litre of plasma
contains only 1-0.07 = 0.93 L of water. Therefore the concentration of sodium in
plasma water can be calculated as follows:
This is the value which will be obtained for plasma sodium containing 140 mmol/L
plasma when measured on whole undiluted plasma in an ion-specific electrode system.
Since the measurement is made directly on whole plasma rather than diluted plasma it is
said to be a “direct reading electrode”. In general:
152
BODY FLUIDS AND ELECTROLYTES
If, on the other hand, the plasma sample is first diluted with an aqueous diluent before the
electrode measurement is taken then the discrepancy almost disappears. These
instruments are known as “indirect reading electrodes” Consider the same sample,
containing 140 mmol of sodium per L plasma, first diluted 1 in 20 before the sodium
measurement is made. This is equivalent to diluting 0.05 L of plasma to 1 L. The
sodium present in 1 L of plasma diluted 1 in 20 will be 140 x 0.05 = 7 mmol. The
amount of water this sodium is dissolved in will be equal to the amount of water in the
0.05 L plasma sample plus the water added to it.
If the protein is 70 g/L (approx 0.07 L/L), then 0.05 L of plasma is occupied by 0.05 x
0.07 = 0.0035 L of protein. The plasma sample therefore contains 0.05 - 0.0035 =
0.0465 L of water. To dilute the plasma 1 in 20, the amount of diluent added is 1 - 0.05
= 0.95 L. Therefore:
The reason for this discrepancy is that dilution results in a decrease in the proportion of
water displaced by protein. Plasma sodium measured with a flame photometer gives
similar readings to indirect reading electrodes.
It has become common practice for instrument manufacturers to “adjust” the calibration
of their direct reading ISE instruments so that the discrepancy with flame photometers
disappears, but only at a “normal” plasma protein concentration (usually 70 g/L):
[Na+] (plasma) = [Na+] (water) x 0.93
If the plasma protein differs markedly from “normal” then the discrepancy with flame
photometer readings reappears. In the above example, if the instrument is adjusted so
that the plasma water sodium of 151 mmol/L reads 140 mmol/L then a sample with the
same plasma sodium concentration but containing 50 g/L (0.05 kg/L) protein is
measured, then the concentration of sodium in plasma water will be:
153
CHAPTER 8
Question Q 8(4)
A plasma contains 140 mmol/L of sodium and 95% water by volume. Neglecting sodium
binding by plasma proteins, calculate the apparent plasma sodium concentration
determined from measurements with an electrode system which responds to water sodium
(a) in undiluted plasma, and (b) in plasma diluted 1 in 20 with water.
Answer Q 8(4)
Amount of water in diluted plasma = Water from plasma + water from diluent
154
BODY FLUIDS AND ELECTROLYTES
which can rearranged to give the following expression for the anion gap:
The anion gap was originally developed as a quality control tool when it was noted that in
most patients the difference between the sodium concentration and the sum of the
chloride and bicarbonate concentrations was always approximately 12 mmol/L.
Sometimes the potassium concentration is included in the calculation. Small deviations
from the reference range for the anion gap (7-16 mmol/L) are usually due to marked
changes in plasma calcium, potassium, phosphate and negatively charged proteins.
However, the principal use of the anion gap is as an aid in the differential diagnosis of
non-respiratory acidosis. A markedly raised anion gap indicates the presence of excess
unmeasured anions of metabolic acids e.g. ketoacids, lactic, salicylic, oxalic (from
metabolism of ethylene glycol) and formic acids (from metabolism of methanol).
155
CHAPTER 8
Question Q 8(5)
The following results were obtained on a young adult in the Accident and Emergency
Department:
Plasma sodium = 140 mmol/L
Plasma chloride = 97 mmol/L
Plasma bicarbonate = 8 mmol/L
Answer Q 8(5)
= 140 - {97 + 8}
= 140 - 105
= 35 mmol/L
ADDITIONAL QUESTIONS
1. Over a 24 h period a patient recovering from intestinal resection receives 2 L of
fluids intravenously and 750 mL orally but does not eat any solids over this
period. The urine output over the same period is 1.25 L and 600 mL of fluid is
lost via a fistula. Is the patient in positive or negative fluid balance and by how
much?
3. A male adult insulin dependent diabetic forgot to take his insulin. His blood
glucose concentration, which was 5 mmol/L, rose to 15 mmol/|L in two hours.
Estimate the effect on his plasma sodium concentration, assuming that no other
water intake nor loss of water from the body takes place during this time,
indicating what assumptions you make.
4. A plasma sample with a total protein content of 70 g/L gave identical sodium
results of 140 mmol/L when measured using either a direct-reading ion-selective
electrode or a flame photometer. What plasma sodium result would you expect
the ion-selective electrode to give with the same plasma sample if its total protein
concentration had been 90 g/L?
156
ENZYMOLOGY
Chapter 9
Enzymology
Enzymes are proteins which catalyse chemical reactions. Enzymes are of interest to the
clinical biochemist for a number of reasons:
• Enzymes are often released from tissues into the circulation as a result of disease
e.g. the release of aspartate aminotransferase from the liver affected by hepatitis.
• Since enzymes are proteins, immunoassay can be used (i.e. mass measurements).
This approach has been used for the assay of the MB isoenzyme of creatine
kinase.
157
CHAPTER 9
Catalytic activity
In routine clinical practice enzymes are usually quantified by measuring their catalytic
activity. The rate of an enzyme reaction is dependent upon many factors (see Fig 9.1)
but, with a few exceptions and provided the conditions of the assay are carefully chosen,
the rate of the reaction is always proportional to the concentration of the enzyme in the
reaction mixture. The rate (or activity) will vary depending upon the analytical
conditions used and over the past few decades considerable effort has been expended by
biochemists to standardise assay conditions so that activity measurements obtained in
individual laboratories are comparable. Reaction conditions have been optimised so that
the highest rate (maximal sensitivity) is obtained and small variations in conditions
(substrate and cofactor concentrations, pH etc) have minimal effect. Whichever assay is
used the rate is proportional to enzyme concentration but the actual rate depends on the
reaction conditions employed.
Enzyme concentration
pH
Ionic strength
Inhibitor/activator concentration
Temperature
158
ENZYMOLOGY
An enzyme catalyses the conversion of its substrate into a product. Therefore the course
of the reaction can be followed by either measuring the disappearance of substrate or
formation of product (Fig 9.2).
Product(s)
Concentration
Substrate(s)
Time
159
CHAPTER 9
If the product of the reaction does not have a physical property (such as absorbance) by
which its appearance can be monitored then either a reagent is added to form a suitable
derivative or a second enzyme is added to convert the product into another compound
which is easily measurable. When a second (or third) enzyme is added in this way the
assay is said to be coupled. It is vital that the concentration of the second enzyme is
present in excess so that product is removed as soon as it is formed i.e. the first enzyme
reaction (the one we wish to measure) is always rate limiting.
There are two approaches which can be used to obtain a rate measurement:
• Monitor the reaction continuously and take a rate measurement over a suitable
time period. These continuous monitoring methods are preferred since it is
possible to evaluate the reaction progress and ensure that a true initial rate
measurement is taken and is constant, avoiding errors due to any lag-phase.
Whichever approach is used, the timing of the measurements is critical. Initially (or
possibly after any lag-phase) the rate of reaction at any given enzyme concentration is
constant. However, as the reaction progresses substrate is consumed and its
concentration falls and eventually a point is reached at which substrate availability
become rate limiting and the rate of the reaction falls i.e. the progress curve becomes
non-linear. This is illustrated in Fig 9.3. Unless a lag-phase is observed, measurement
over the segment 0-A gives a true initial rate whereas measurement over the segments 0-B
or 0-C gives an artificially low result. At point C the substrate is completely exhausted
(or the reaction is at equilibrium) and the reading is actually a measure of substrate rather
than enzyme concentration. Sometimes a lag phase is observed so that the initial rate is
less than optimal (see inset to Fig 9.3). In this situation the optimal rate is given over the
segment A-B not 0-A.
160
ENZYMOLOGY
Concn
0 AB
Time
Concentration
0 A B C
Time
161
CHAPTER 9
King-Armstrong (KA) units were used for alkaline phosphatase (the amount of enzyme
present in 100 mL of serum that will split 1mg of phenol from phenylphosphate in 1
hour). Very soon a plethora of units were in use and attempts were made to standardise
enzyme units. This may seem a rather pointless exercise since although units may be
identical the enzyme activity will not be identical unless the reaction conditions are held
constant. Nevertheless, the Commission on Enzymes of the International Union of
Biochemistry propose that enzyme activity should be expressed in terms of international
units. One international unit (U) is the quantity of enzyme that catalyses the reaction of
1 μmol of substrate per minute. Catalytic concentration is to be expressed in terms of
U/L or mU/L, whichever gives the more convenient numerical value.
The international unit itself may eventually be replaced by a new unit termed the katal
and concentrations expressed as katals per litre (kat/L). One katal is the amount of
enzyme which catalyses the reaction of 1 mol of substrate per second.
When there is some uncertainty about the exact nature of the substrate (e.g. where the
substrate is a macromolecule such as starch or a protein) then units are still expressed as
the amount of a group or reside released per unit time (e.g. glucose units or amino acids
formed per minute).
This involves converting physical measurements (e.g. absorbances) made over timed
interval(s) into substrate concentration units which are then used to derive a rate for the
enzyme catalysed reaction. This rate then needs to be converted to the concentration of
enzyme units in the clinical sample. The following example illustrates the process:
Question Q 9(1)
162
ENZYMOLOGY
Answer Q 9(1)
This is the reverse of the normal reaction. The cofactor NADH absorbs at 340 nm,
whereas absorbance due to NAD+ is negligible. Therefore as the reaction progresses the
absorbance at 340 nm falls at a rate which is equal to the rate of consumption of the
substrate (pyruvate).
The concentration of NADH (c) at any point in time can be calculated from the
absorbance reading (A), the cuvette path length (b) and the molar absorptivity of NADH
(a) using equation Eq 4.4:
Therefore at the first time the absorbance reading is taken (t1 = 30 seconds), A = 0.183,
b = 0.5 cm and a = 6.3 x 103 L/mol/cm so the concentration of NADH (c1) in mol/L
can be calculated as follows:
The same calculation could be performed for the second absorbance reading (when t2 =
60 seconds and A2 = 0.148) to give the concentration (c2) at time t2. The two equations
for the calculation of the concentrations at t1 and t2 are therefore:
At t1: c1 = A1
ab
At t2: c2 = A2
ab
These two expressions can be combined to calculate the change in concentration (∆c)
over the time period t2 – t1 (∆t):
∆c = c1 – c2 = A1 - A2 = (A1 – A2) = ∆A
ab ab ab ab
(Mathematicians use the symbol Δ to denote a difference or change between two values.)
163
CHAPTER 9
This gives the decrease in concentration of substrate over the time period (∆t) in units of
mol/L. Since International Units use concentration expressed as μmol/L, this value must
be multiplied by 1,000,000 (since there are 1,000,000 μmol in a mol):
Division by the time interval (t2 - t1 = ∆t) gives the rate of change in concentration as
μmol/sec/L reaction mixture. Multiplication by 60 converts this rate to μmol/min/L
reaction mixture:
It is usual to express enzyme activity in serum as U/L of serum, not U/L of reaction
mixture. Therefore the dilution of the serum in the reaction mixture needs to be taken
into account. Multiplying by the total reaction volume and dividing by the sample volume
gives:
Substituting:
If a large number of calculations are to be performed for the same assay then it would be
simpler to combine all the terms (except ΔA) to produce a factor which could then be
used to obtain enzyme activity directly i.e. ΔA x Factor (28600) = LDH activity
(U/L).
164
ENZYMOLOGY
1. Divide change in absorbance by the time period (in min) over which the
measurements were taken to give rate i.e. ΔA/min
2. Divide by molar absorptivity (units: L/mol/cm) and cuvette path length (cm)
4. Multiply by total reaction volume (mL) and divide by sample volume (mL)
For an enzyme assay utilizing NADH/NAD as cofactor (monitored at 340 nm) the
formula used is:
The older literature is full of enzyme data expressed in units other than U/L. It is
sometimes useful to convert these values to the corresponding activity in U/L. For
example, King-Armstrong (KA) units were used for many years to report alkaline
phosphatase activity. One KA unit is the amount of enzyme in 100 mL of serum that will
split 1 mg of phenol from phenylphosphate in 1 hour and can be written:
165
CHAPTER 9
Therefore:
It is important to remember that even after converting enzyme activity from one unit to
another, the numerical result will still depend on the reaction conditions used.
Question Q 9(2)
A transaminase result is quoted in the literature as 207 Karmen units. One Karmen unit is
the amount of transaminase that will produce an absorbance change of 0.001 in a 1cm
cuvette at 340 nm (a coupled reaction) in 1 min per 1 mL serum (in a total volume of 3
mL). Assuming the molar absorptivity of NADH is 6.30 x 103 L/mol/cm, express the
transaminase activity as a) international units per L of serum, and b) katals/L.
166
ENZYMOLOGY
Answer Q 9(2)
Therefore:
Therefore:
b) 1 Katal/L = 1 mol/sec/L
To convert to U/L:
Therefore:
167
CHAPTER 9
Figure 9.1 listed factors which influence the rate of an enzyme catalysed reaction i.e.
enzyme activity. These factors are best studied by keeping them all, except the one under
investigation, constant. So far we have only considered the effect of variation of the
amount of enzyme on activity since this is the variable of most interest in clinical
practice. In the absence of complicating factors the rate of an enzyme reaction is directly
proportional to enzyme concentration. However, the relationship between reaction rate
and substrate concentration is a little more complex. If we take the simplest possible
enzyme reaction in which a single substrate (S) binds to enzyme (E) to form an essential
intermediate the enzyme-substrate complex (ES) which decomposes to release free
enzyme and the reaction product (P) this may be represented schematically as:
k+1 k+2
E + S ES E + P
k-1 k-2
The rate constants for the various reactions are denoted k+1, k-1, k+2 and k-2. Note that
rate constants of reverse reactions are given a minus sign. The rate of each reaction is the
product of the molar concentrations of the reactants and the respective rate constant.
Therefore we can write the following equations (in which square brackets denote molar
concentrations) for the rates of formation and decomposition of the enzyme-substrate
complex ES:
A few milliseconds after the enzyme and substrate are mixed [ES] builds up and does not
change provided [S] is in large excess and k+1>>k+2. This condition is called a steady
state in which the rate of formation of ES is balanced by its rate of decomposition so that
[ES] is constant. Therefore the following steady state equation can be written :
168
ENZYMOLOGY
Km = k-1 + k-2
k+1
The total enzyme concentration [E]Total is the sum of the free enzyme and the enzyme-
substrate complex. Therefore we can write the following enzyme conservation equation:
Re-arranging gives [E] = [E]Total - [ES], so that an expression for [E]/[ES] can be
written in terms of concentrations of enzyme components:
169
CHAPTER 9
[E]Total - 1 = Km
[ES] [S]
[E]Total[S] - [S] = Km
[ES]
[E]Total[S] = Km + [S]
[ES]
[ES] = [E]Total[S]
Km + [S]
The rate of formation of product (v) can be expressed as a function of the rate constant
k+2 and the concentration of the enzyme-substrate complex:
v = k+2[E]Total[S]
Km + [S]
K+2 and [E]Total are constant and can be replaced by a single constant called maximal
velocity (Vmax) to give the Michaelis-Menten equation:
Consider the behaviour of this equation at the two extremes of substrate concentration.
When [S] is very low, for example much lower than the Km (i.e. Km >>> [S]), then the
Michaelis-Menten equation approximates to v = Vmax[S]/Km which is of the general form
v = constant x [S]. This is a linear expression so under these conditions the rate is
directly proportional to substrate concentration as illustrated in Fig 9.5. When a reaction
170
ENZYMOLOGY
One special situation worth considering is the significance of the substrate concentration
at half-maximal velocity (i.e. when v = Vmax/2). We can then write:
2 [S] = Km + [S]
[S] = Km
Km ≈ k+1 = [E][S]
k-1 [ES]
171
CHAPTER 9
v Zero-order
v = Vmax
First-order
v = Vmax x [S] Mixture of zero and first-order
Km
v = Vmax[S]
v = Vmax/2 Km + [S]
[S] = Km
[S]
Figure 9.5 Effect of increasing substrate concentration [S] upon the initial
velocity (v) of an enzyme-catalysed reaction. At low substrate
concentrations the reaction follows first order kinetics; at high
substrate concentrations first order kinetcs. At all concentrations the
rate is described by the Michaelis-Menten equation. At half maximal
velocity the substrate concentration is equal to the Michelis-Menten
constant (Km)
Question Q 9(3)
172
ENZYMOLOGY
Answer Q 9(3)
v = Vmax [s]
Km + [s]
v = initial velocity
Vmax = maximal velocity (at infinite substrate concentration)
Km = Michaelis-Menten constant = substrate concentration at half-maximal velocity
[s] = initial molar substrate concentration
v = Vmax 10 Km
Km + 10 Km
Therefore the initial rate is approximately 90 per cent of the maximal rate (Vmax).
Although the Michaelis-Menten equation has been derived for the simplest case of a
single substrate reaction its application is by no means limited to this. The equation can
be written in a more general form:
in which Kmapp and Vmaxapp are not true constants but apparent values for Km and Vmax
which depend on the concentrations of activators, inhibitors, second substrates etc which
are held constant whilst [S] is varied. Variation of these apparent constants with other
parameters depends on the kinetic mechanisms involved. Therefore simple modification
of the Michaelis-Menten equation can be used to study enzyme inhibition, activation, pH
effects and multiple-substrate reactions.
173
CHAPTER 9
In theory it possible to obtain estimates of Km and Vmax from plots of v versus [S] but
deciding when the rate is maximal is difficult since the rate approaches Vmax
asymptotically. To overcome this practical difficulty various graphical solutions have
been proposed.
1 = Km + [S]
v Vmax[S]
1 = Km + [S]
v Vmax[S] Vmax[S]
If the [S] terms are cancelled in the second component of this expression on the right
hand side then this equation can be rewritten in a more useful form:
Therefore a plot of 1/v versus 1/[S] is linear, with slope Km/Vmax and intercept on the 1/v
axis of 1/Vmax (see Fig 9.6a). It can easily be shown that the intercept on the 1/[S] axis is
-1/Km. Double reciprocal plots are easy to interpret and computation of Km and Vmax is
straightforward.
The [S]/v versus [S} plot of Hanes. If the double-reciprocal equation of Lineweaver and
Burk (Eq 9.6) is multiplied throughout by [S] then another linear from is obtained:
Thus a plot of [S]/v versus [S] is linear with slope 1/Vmax and intercepts on the [S]/v and
[S] axes of Km/Vmax and –Km respectively (Fig 9.6b).
174
ENZYMOLOGY
0.08
Intercept
0.04
= -1/K m
0
-0.4 -0.2 0 0.2 0.4 0.6 0.8 1
1/[S]
0.4
Slope = 1/V max
0.3
Intercept
= -K m 0.2
0
-10 -5 0 5 10 15 20 25 30
[S]
Figure 9.6 Double reciprocal plot (a) and [S]/v versus [S] plot (b) for the same set
of data
175
CHAPTER 9
50 Intercept = V max
40
v 30
20 Intercept
Slope = -K m = V max /K m
10
0
0 2 4 6 8 10 12
v /[S]
V max
50 V max
40
30
20
10
0
-25 -20 -15 -10 -5 0 5 10 15 20 25
Km
Figure 9.7 Plot of v versus v/[S] (a) and the direct linear plot (b) for the same
data depicted in Fig 9.6
176
ENZYMOLOGY
The v versus v/[S] plot of Eadie-Hofstee. Division of both sides of the Michaelis-Menten
equation by [S] gives:
v = Vmax
[S] Km + [S]
vKm + v = VMax
[S]
Therefore a plot of v versus v/[S] is linear with slope –Km and v and v/[S] intercepts of
Vmax and Vmax/Km respectively (Fig 9.7a).
The direct linear plot of Eisenthal and Cornish-Bowden. These authors use a rather
unique approach in which the constants Vmax and Km are treated as variables and plotted as
points in observational space whereas values for v and [S] are treated as constants and
plotted as lines in parameter space. This strange concept is probably easier to understand
if the Eadie-Hofstee form of the Michaelis-Menten equation (Eq. 9.5) is re-arranged
slightly to:
177
CHAPTER 9
This equation is now that of a straight line of the form y = ax + b, in which the variables x
and y are now Km and Vmax respectively with a lope of v/[S] and y intercept of v. We are
used to thinking of Km and Vmax being constant with an infinite number of values of v and
[S] which can satisfy the Michaelis-Menten equation. This concept is now reversed with
v and [S] being constant but with an infinite number of values for Km and Vmax which can
satisfy the equation. As before we can calculate the values for the intercepts on the x and
y axes when Vmax is plotted against Km:
Multiplying both sides by [S]/v and gives: Km = - [S]. Therefore the intercept on
the x axis is –[S].
This means that a line joining a pair of values for v and [S] plotted on the y and x axes
respectively if extrapolated must pass through the point in the Vma x- Km space with
coordinates Vmax and Km for that enzyme. A similar argument applies for other pairs of
values for v and [S] for the same enzyme. They must all pass through the point with
coordinates Vmax and Km. In other words they must all intercept at the same point and this
point has the coordinates Vmax and Km.
The procedure is illustrated in Fig 9.7b for the same data as that used in Figs 9.6a and b
and 9.7a. The values for v and [S] are marked on the Vmax and Km axes respectively,
then each pair of values joined by a straight line which is extrapolated into the positive
Vmax-Km quadrant. All the lines intercept at a common point with coordinates Vmax and
Km.
Difficulties often occur in deciding upon the exact intersection point. Frequently several
such points occur as a result of experimental variation (this is analogous to deciding
where to draw a straight line through a series of points on a conventional plot). In the
worse-case scenario if there are n pairs of observations then there is a maximum of 0.5n
(n - 1) possible intersections. The authors recommend that the median of all possible
intersections are used (if three lines intersect at one point the this is treated as three
intersections rather than one in finding the median, if there are four lines intersecting then
this counts as six intersections etc).
178
ENZYMOLOGY
Although the Lineweaver-Burk plot is widely used for kinetic analysis of enzyme
reactions, the use of reciprocals means that small experimental errors can result in large
errors in graphically determined values for Km and Vmax. It has been argued that the
Eadie-Hofstee plot (i.e. v versus v/[S]) results in less error. It is apparent by inspection of
the plots in Figs 9.6 and 9.7, which are all based on the same v and [S] data, that each
method of plotting results in a different “spread” of results e.g. the double reciprocal plot
compresses the points at high substrate concentrations. This pitfall can be overcome by
careful selection of concentration values employed in the experiment. The direct linear
plot overcomes some of these limitations. Nowadays computer statistics packages are
frequently used to fit data directly to the Michaelis-Menten equation.
Enzyme inhibition
Some enzyme inhibitors act irreversibly by forming a covalent bond with an amino acid
residue in the enzyme thereby rendering it inactive. However, most inhibitors bind
reversibly to the enzyme so that an equilibrium is established between the free enzyme
(E) and inhibitor enzyme (EI) complex, the position of which is determined by the
inhibitor constant (Ki) which reflects the affinity of the inhibitor (I) for the free enzyme:
E + I EI Ki = [E][I]
[EI]
Clearly the inhibitor can lower the reaction velocity (v) by either decreasing the
numerator of the Michaelis-Menten equation (i.e. value of the Vmaxapp) or increasing the
denominator (i.e. the value for Kmapp). There are three main types of inhibitors which
exert their effects on v in different ways:
Competitive inhibitors, often structurally related to the natural substrate bind reversibly
with the enzyme at or near the active site. The inhibitor and substrate therefore compete
for the enzyme according to the scheme:
E + S ES E + P
+
I
EI
179
CHAPTER 9
Therefore the inhibitor effectively removes a portion of the enzyme from the reaction.
The enzyme conservation equation therefore becomes:
By substituting [E][I]/Ki for [EI] and grouping the [E] terms this becomes:
which can the be incorporated into the steady state equation for [ES] (Eq 9.1) to derive
the following variation of the Michaelis-Menten equation for competitive inhibition:
Therefore the Km has been increased since it is now multiplied by a number which is
always greater then one, which in turn reduces the value of v. Note that it is possible to
reduce the relative contribution of the Km(1 + [I]/Ki) term by increasing the value of [S].
In other words it is theoretically possible to overcome competitive inhibition at high
substrate concentrations.
Non-competitive inhibitors bind reversibly at areas other than the active site. Binding of
inhibitor and substrate is independent so it is possible not only to form complexes
between enzyme and inhibitor (i.e. EI) but between inhibitor, substrate and enzyme
(i.e. EIS) according to the scheme:
E + S ES E + P
+ +
I I
EI + S EIS
180
ENZYMOLOGY
The EIS complex cannot breakdown to reaction products so again a proportion of the
enzyme becomes unavailable to take part in the reaction. Inhibition cannot be overcome
by increasing substrate concentration since binding at the two sites in independent.
In fact the dissociation constants (Kis) of the EIS and EI complexes are identical:
Ki = [E][I] = [ES][I]
[EI] [EIS]
By substituting [E][I]/Ki for [EI], [ES][I]/Ki for [EIS] and grouping the [E] and [ES]
terms this becomes:
which can the be incorporated into the steady state equation for [ES](Eq 9.1) then used to
derive the following variation of the Michaelis-Menten equation for non-competitive
inhibition:
Therefore the value of Km is unchanged but the Vmax is divided by a number greater than
one so its value and hence the rate of the reaction is reduced.
An uncompetitive inhibitor combines only with the ES complex, not the free enzyme
according to the scheme:
E + S ES E + P
+
I
Ki = [ES][I]
EIS [EIS]
181
CHAPTER 9
0.25
Non-competitive
0.2
1/v Competitive
0.15
Uncompetitive
0.1
Uninhibited
0.05
0
-0.6 -0.4 -0.2 0 0.2 0.4 0.6
1/[S]
Figure 9.8 Double reciprocal plots for an enzyme illustrating the effects of
competitive, non-competitive and uncompetitive inhibition. The inset
shows the effects on the apparent Km and Vmax. For each plot Km =
5 mmol/L, Vmax = 50 μmol/min, Ki = 10 mmol/L and the inhibitor
concentration is 20 mmol/L
182
ENZYMOLOGY
Substituting [EIS] = [ES][I]/Ki and grouping the [ES] terms this becomes:
which can be incorporated into the steady state equation for [ES] (Eq 9.1) then used to
drive the following variation of the Michaelis-menten equation for uncompetitive
inhibition:
As for non-competitive inhibition the Vmax is divided by a number greater than one so its
value and hence the rate of the reaction is reduced. It is interesting to note that the
apparent Km is actually reduced suggesting that the substrate is more avidly bound to the
enzyme in the presence of inhibitor. However this is insufficient to overcome the
reduction in apparent Vmax.
Inhibition data can be transformed to linear equations in the same way as for uninhibited
reactions. Figure 9.8 shows a double reciprocal plot for an enzyme inhibited in three
different ways by the same inhibitor concentration. Simple inspection of the curves
allows identification of the mode of inhibition. Values for Kmapp and Vmaxapp can be
obtained by analogous procedures used for Km and Vmax. Substitution of values for Km
and Vmax into Kmapp and Vmaxapp respectively, together with inhibitor concentration permits
calculation of Ki. Another approach attributed to Dixon is to plot 1/v versus [I] using two
or more substrate concentrations; the best fit lines at each substrate concentration should
intercept at a value of [I] equal to Ki. The preferred approach is to determine values for
Kmapp and Vmaxapp over a wide range of inhibitor concentrations then to use secondary
plots to determine Ki.
183
CHAPTER 9
Question Q 9(4)
8 cuvettes were set up (numbered 1 to 8) each with an optical path length of 1 cm. 0.6
mL of a stock solution of drug A (5 mmol/L) was diluted to 25 mL with buffer, then
used to prepare a series of dilutions from this diluted substrate as follows:
An identical set of dilutions of diluted A was prepared in cuvettes 5 to 8, except that 0.5
mL of a solution of drug C (50 mmol/L) was added at the final stage instead of buffer.
1 mL of enzyme PP solution and 1 mL of a second enzyme (which was not rate limiting
but converts B into a coloured product with a molar absorptivity at 505 nm of 5500
L/mol/cm) was added to each cuvette, the contents mixed then incubated for exactly 5
minutes. The absorbance of each cuvette at 505 nm was measured versus a cuvette
containing distilled water (there was no significant reagent blank).
Cuvette No 1 2 3 4 5 6 7 8
Absorbance 0.400 0.330 0.250 0.167 0.330 0.250 0.167 0.100
184
ENZYMOLOGY
Answer Q 9(4)
Cuvette No: 1 2 3 4 5 6 7 8
Substrate (x 106 mol/L) 20 10 5 2.5 20 10 5 2.5
185
CHAPTER 9
A = a x b x c
c = A = A mol/L
a x b 5,500 x 1
Cuvette No: 1 2 3 4 5 6 7 8
Absorbance 0.400 0.330 0.250 0.167 0.330 0.250 0.167 0.100
v (μmol/min) 14.6 12.0 9.1 6.1 12.0 9.1 6.1 3.6
c) To determine the Km and answer the rest of the questions some graphical
presentation of the data is required. Although not ideal, the double reciprocal
plot is simplest. Cuvettes 1 to 4 are without inhibitor, cuvettes 5 to 8 have the
same substrate concentrations but with inhibitor present.
186
ENZYMOLOGY
0.3
0.25
+ inhibitor
0.2
1/v
0.15
No inhibitor
0.1
0.05
0
-0.25 -0.15 -0.05 0.05 0.15 0.25 0.35 0.45
6
1/[S] ( x 10 L/mol )
From the graph the intercept on the 1/[S] axis for the uninhibited reaction is
approximately - (0.20 x 106) L/mol. This corresponds to -1/Km:
187
CHAPTER 9
d) Drug C lowers the activity of the enzyme by increasing the value of the Km
without altering the Vmax i.e. the double reciprocal plots for the inhibited and
uninhibited reaction intersect on the 1/v axis. Therefore drug C is a competitive
inhibitor of enzyme PP.
e) From the graph the intercept on the 1/[S] axis for the inhibited reaction is
approximately - (0.11 x 106 ) L/mol. This corresponds to -1/Kmapp:
f) The inhibitor constant (Ki) of drug C is considerably higher than the Km for the
substrate A (1.0 x 10-2 mol/L compared to 5.0 x 10-6 mol/L). Both of these
constants are inversely proportional to the affinity of the enzyme PP for the
substrate and inhibitor. Thus the affinity of the enzyme for the substrate, A, is
considerably greater than its affinity for the inhibitor, C. Therefore the effect of
188
ENZYMOLOGY
introducing drug C into the regimen for patients receiving drug A will depend on
the relative concentrations of the two drugs. If their therapeutic concentrations
are similar, or the concentration of A is greater than C, then C will have little
effect on the metabolism of drug A or the optimum dose required to achieve
therapeutic levels of its active metabolite B. If, however, the plasma level of drug
C is considerably higher than that of drug A, then inhibition of the metabolism of
drug A will occur and higher plasma levels of drug A will be achieved, with the
consequence of decreased levels of the active metabolite B. As a result higher
doses of dose A will be required to achieve the same therapeutic result.
If drug C is not only an inhibitor of PP but a substrate for this enzyme then the
metabolism of drug C will also be affected by drug A which will have
consequences on the levels of drug C (and its metabolites) achieved.
ADDITIONAL QUESTIONS
2. An assay for alkaline phosphatase activity involved mixing 0.5 mL of serum with
2.7 mL buffer, allowing temperature to reach equilibrium then starting the
reaction by adding 0.2 mL of substrate (4-nitrophenyl phosphate). The increase in
absorbance in a 1cm cuvette due to the liberation of product (4-nitrophenol) was
0.180 over a 5-minute period. Calculate the alkaline phosphtase activity
expressing the result as a) international units per litre of serum, and b) katals per
litre of serum. Assume that the molar absorptivity of 4-nitrophenol is 1.88 x 104
L/mol/cm.
3. The Somogyi saccharogenic method for the assay of amylase involves measuring
the rate of release of glucose from substrate. One Somgyi unit is the amount of
enzyme catalysing the release of 1 mg of glucose in 30 min per 100 mL serum.
Derive a factor to convert Somogyi units to international units per litre of serum.
189
CHAPTER 9
6. What information can be obtained from the double-reciprocal plot for an enzyme
under the following conditions: a) 1/v = 0 when 1/[S] = -12.5 x 106 L/mol, b)
1/[S] = 0 when 1/v = 5.2 x 106 min.L/mol, c) 1/[S] = 0 when 1/v = 6.5 x 106
min/mol and the slope of the line is 100 min/L?
0.5 33 9
1.0 50 17
2.0 67 29
4.0 80 44
10 91 67
190
ENZYMOLOGY
1 12 33
2 21 50
4 35 67
10 57 83
20 73 91
Stating any assumptions that you make determine the pH at which the enzyme has
greatest affinity for the substrate.
10. The apparent Km and Vmax of an enzyme were measured over a range of inhibitor
concentrations and the following data obtained:
5 10 7.5
10 7 5
15 5 4
20 4 3
191
CHAPTER 9
192
THE BASIS OF STATISTICS
Chapter 10
“If your experiment needs staticism then you ought to have done a better experiment”
In the perfect world if we were to measure the creatinine concentration in a serum sample
a large number of times then we would obtain exactly the same result every time. In
practice, however, the result is not always the same due to analytical imprecision. The
results obtained would vary but only by a small amount, often the same result would be
obtained more than once so that the results would tend to cluster around a particular
value. The value around which results would cluster is not necessarily the true value due
to inherent inaccuracy of the method. Similarly if serum samples were collected from the
same “normal” individual on a number of occasions and the creatinine concentration
measured in each sample then a similar cluster of results would be obtained but the
spread would be much wider due to intra-individual variation being added to the
analytical imprecision. On the other hand if samples were collected from a number of
“normal” individuals then the spread of results would be even wider due to a contribution
from inter-individual variation in addition to intra-individual variation and analytical
imprecision. An important consequence is that if two different results are obtained we
cannot be sure that the change is real since it may be explained by the expected analytical
impression and/or intra-individual variation.
The science of statistics gives us the tools to deal with this random variation due to
analytical imprecision and biological variation in order that we can extract maximum
information from data that we obtain in the clinical laboratory. Nowadays anyone can
use computers (and some pocket calculators) to do statistical calculations. However,
correct interpretation of the statistical parameters produced requires some understanding
of the underlying principles.
193
CHAPTER 10
Figure 10.1a gives the results obtained when creatinine concentration was measured in
sera from 60 “normal” individuals. Simple inspection of the data shows that:
In most areas of statistics an extremely useful first step is to express the results in the
form of a diagram. Depending upon the number of results the data is first grouped into
class intervals of equal size. The interval used should be chosen to make sure that most
intervals contain more than one result. For the creatinine data in Fig 10.1 a class interval
spanning a concentration range of 10 μmol/L ensures that each group contains at least one
result with one of the groups containing as many as sixteen values (Fig 10.1b). If a graph
is plotted with the concentration intervals as the x axis and frequency as the y axis then
the result (Fig 10.1cc) is a frequency distribution or histogram.
Visual inspection of the distribution reveals that the class interval with the highest
number of results is the 70-79 μmol/L group and that there are approximately equal
numbers of results below this group as above it i.e. the overall shape is symmetrical. As
we move further away from this group on either side of the diagram the number of results
in each group diminishes. If we were to join up the peaks of each class interval then the
result would be a continuous bell-shaped curve which mathematicians refer to as a
normal or Gaussian distribution.
1. The peak value. Mathematicians call this the measure of central location.
2. The spread of results (i.e. a measure of the variability of the results) or a measure
of the width of the bell-shaped curve. Mathematicians call this the measure of
dispersion.
194
THE BASIS OF STATISTICS
80 62 91 81 111 75 78 45 91 72 65 50 57 85 103 75 75
63 47 93 77 122 60 76 97 35 54 106 90 80 80 74 41 78
76 115 76 52 87 75 82 64 93 71 79 100 67 68 74 59 69
63 68 65 89 72 82 68 80 83
Interval: 30-39 40-49 50-59 60-69 70-79 80-89 90-99 100-109 110-119 102-129
Frequency: 1 3 5 12 16 11 6 3 2 1
16
14
12
10
0
30 40 50 60 70 80 90 100 110 120 130
Creatinine (μmol/L)
195
CHAPTER 10
The arithmetic mean or average: This simply is the sum of all the individual results in
the series (if the same result is encountered more than once then it is counted more than
once), divided by the number of results. If we denote each individual result in the series
by the symbol x (so that the first is x1, the second x2, etc), the symbol Σ to mean the sum
of all values of the series and n as the number of results, then we can write the following
expression for the mean (m):
The median: This is simply the value such that half of the data points fall above it and a
half below it. In other words if we have 100 results and arrange then in ascending order,
then the value of the fiftieth result is the median.
The mode: The mode is the most frequently occurring result or the class interval
containing the most results.
Measures of dispersion
The standard deviation (SD or s): This is a measure of the average difference of all the
values from the mean. If the mean result (m) is subtracted from an individual result (x)
then the result is the difference or deviation of that result from the mean i.e. (x – m). If
this is done for each data point (i.e. each individual result) and these deviations are added
together, then the result can be expressed mathematically as Σ (x – m). If this value is
divided by the number of results, n, then the result would be expected to be a measure of
the average difference of all the results from the mean. However, this is not the case, the
result comes out at zero. The reason for this is that the normal distribution is symmetrical
with an approximately equal number of results both below and above the mean. The
deviation of a result below the mean is negative, the deviation of a result above the mean
is positive. Therefore the positive deviations cancel the negative deviations so that their
sum is zero. To overcome this problem mathematicians square each deviation so as to
always give a positive result. A positive deviation multiplied by a positive deviation
gives a positive result, as does a negative deviation multiplied by a negative deviation.
If these are then added together then the result is the sum of squares of the deviations,
denoted by the expression:
196
THE BASIS OF STATISTICS
Σ(x – m)2. Mathematicians use this trick in many areas of statistics. If the sum of
squares is divided by the number of data points, n, then the result is a measure of
dispersion known as the variance, which is denoted by the symbol s2. If the square root of
the variance is taken (to allow for taking squares of the deviations in the first place), then
the result is the standard deviation, denoted by the symbol s or SD, a value which is more
easily related to the dispersion of results in the distribution. This simple concept is
complicated by the fact that instead of dividing by n it is customary to divide by n-1. n-1
is known as the degrees of freedom. The reason for this that when the sum of deviations
(or their squares) is calculated, use of the value for the mean restricts the freedom of the
individual values. Suppose we had six results, 1, 2, 3, 4, 5 and 6 (the numbers on a dice).
Their sum is 21 and their mean (21/6) is 3.5. The deviations (x – m) for the first five
values are –2.5, -1.5, -0.5, +0.5, +1.5 and their sum, Σ (x – m), is –2.5. Since the
deviations must add up to zero, it follows that the deviation for the sixth value must be
+2.5. Therefore the sixth value must be the sum of the mean and its deviation i.e.
3.5 + 2.5 = 6. In other words the final value in the series is fixed, cannot vary and so
does not add any useful information Therefore for practical purposes there are only 5
results contributing to the sum of squares. Another way of looking at this is if a dice is
lying with the six side face down then we do not need to turn the dice over to know what
the value on the hidden face is! In practice the larger the number of data points the less
important the difference between the number of values (n) and their degrees of freedom
(n-1); above n=30 this difference is usually ignored. The expressions for variance and
standard deviation are:
The units of standard deviation are the same as the units of the data used in its
calculation. In an attempt to standardise the expression of s, clinical biochemists often
use the term coefficient of variation, denoted cv. This is the standard deviation divided
by the mean; the result is usually expressed as a percentage:
197
CHAPTER 10
The hope was that by expressing imprecision in this way, the same numerical result
would be obtained over the entire concentration range of the assay. However, this is
rarely the case.
The range: This is simply the difference between the highest and the lowest value in the
set of data. This is the least reliable measure of dispersion.
The interquartile range: The data are arranged in ascending order and grouped into
four equal sets (known as quartiles). The middle two sets (comprising the middle fifty
per cent of the data points) form the interquartile range.
Often the histogram of a set of data is not a typical bell-shaped Guassian distribution.
Fig 10.2 shows two ways in which the curve may deviate from normality. In Fig 10.2a,
the two curves are not symmetrical but skewed. Statistical packages often calculate a
parameter called the skew. A skew of 0 indicates no skew, a positive value indicates
skew to the right (curve A) and a negative value skew to the left (curve B). A skewed
distribution can often be converted to a reasonably “normal” distribution by taking
logarithms of the data e.g. the distribution of serum bilirubin concentrations in normal
adults is normally skewed to the right whereas the distribution of the logarithm of
concentration becomes relatively normal.
In Fig 10.2b the curves differ in how “peaked” or “flat” they are; this is known as
kurtosis. Again statistical packages often calculate a value for kurtosis. A truly Guassian
curve has a kurtosis of 3 (some computer programs convert this value to zero).
A discussion of these tests is beyond the scope of this book but may be found in standard
statistical texts.
198
THE BASIS OF STATISTICS
a) Skewness b) Kurtosis
A B
B
A
C
Question Q 10(1)
A laboratory had just changed its method for serum creatinine. To check that there had
been no change in their reference range they analysed serum samples collected from 12
members of staff and obtained the following results (arranged in ascending order): 44,
58, 60, 68, 70, 75, 76, 78, 80, 90, 95, 106 μmol/L. Calculate the mean, variance,
standard deviation and coefficient of variation.
199
CHAPTER 10
Answer Q 10(1)
Construct a table with the individual creatinine concentrations (x) in the first column:
x (x – m) (x – m)2 x2
Add these together to give their sum (Σx). The number of results (n) is 12.
From Σx and n, the mean (m) can be calculated:
Next subtract the mean (m) from each individual value of x so as to give a column for
individual deviations (x – m). Note that the sum of all these, Σ(x –m), is zero and cannot
be used in the calculation of variance. Instead calculate the square of these deviations
i.e. (x – m)2 and enter in the third column. These are then added together to give the
sum of squares of the individual deviations, Σ(x – m)2, referred to by mathematicians as
simply the sum of squares. This value can then be used to calculate the variance:
200
THE BASIS OF STATISTICS
Before the advent of computers or sophisticated pocket calculators it was often easier to
calculate the sum of the squares of individual values (Σx2), then calculate the sum of
squares of the deviations using the identity:
This a question we often try to answer in clinical biochemistry. The creatinine data
quoted in Fig 10.1 and Question Q 10(1) were obtained from normal individuals. We
may want to know if the creatinine result obtained from a patient is “abnormal” i.e. is it
“different” from the reference population? There is no perfect way to answer this
question. Statisticians try and deal with this problem by reformulating the question as
“suppose this result does belong to this normal population what is the likelihood that this
result could have been obtained by pure chance?” If it is improbable that it arose by
chance then it is probably “significantly “ different (in this case abnormal). This begs
the question as to how unlikely the event has to be for the result to be considered different
or belonging to a different population? By convention a probability of less than 1 in 20
(i.e. 0.05 or 5%) is taken as a cut-off indicating that the likelihood of the result not being
abnormal as so low as to be negligible. This value was suggested by the eminent
mathematician Fischer and is not based on any theoretical consideration. Statistics can
only answer the question “how likely it is that an event has occurred by chance”, whether
the difference matters is a subjective one!
201
CHAPTER 10
But how is this probability obtained from the individual result and the data in the
reference population? The reference population when plotted (see Fig 10.1) shows a bell-
shaped curve and its characteristics are partially determined by the peak value (the mean,
m) and the width (standard deviation, s). Mathematicians have shown that the curve can
be described mathematically by the exponential equation:
– (x – m)2
2 s2
y = 1 x e ……….……………… Eq. 10.5
s√2 π
In other words the value of y is a complicated function of both m and s (some forms of
this equation use μ and σ instead of m and s but the subtle difference need not concern us
here). Rest assured that you will never need to manipulate this equation. However, what
we need to know is not the value of y but the probability of obtaining any particular value
of x. This probability is given by the area under the curve if a perpendicular line is drawn
at point x. Since all results for all the population must fall somewhere within the curve
(probability = 1) the total area must equal 1. This area can be calculated by a complex
mathematical function obtained by integrating equation Eq. 10.5. Thus from the values
of x, m and s it is possible to calculate the probability of obtaining a value x; if it is less
than 0.05 then the result is significantly different. It is obviously inconvenient to have to
perform such a complex calculation every time. To get around this difficulty
mathematicians always reduce their data to a “normalized population” in which the mean
is always zero and the standard deviation one. As a result the calculation need only be
done once and is used to generate a statistical table in which the probability of obtaining
any value of x can be easily obtained. The mean is subtracted from the value then
divided by the standard deviation to give a standard score, z:
z (sometimes called the normal deviate, d, or the standard deviation index, SDI) is
therefore the number of standard deviations the value of x is away from the mean. z is
always normally distributed with a mean of zero and standard deviation of one. Such a
normalized Guassian distribution (in which z is the horizontal axis) is shown in Fig 10.3.
The curve has a mean of zero and a standard deviation of one. Since all results of the
population must fall somewhere in the area between the curve and the z axis, the total
area under the curve is the probability (P) of obtaining any value and is one.
202
THE BASIS OF STATISTICS
67%
95%
-2s -s m s 2s
z -2 -1 0 1 2
P 0.025 0.34 0.50 0.84 0.975
Figure 10.3 Normalised Guassian distribution in which the mean is zero and
standard deviation 1. Values of P show the probability of obtaining a
value to the left of the z value i.e. area under the curve to the left of a
perpendicular line drawn at the value for z
The probability of obtaining a result between any two given values is the area under the
curve between the two values. For example, the probability of obtaining a value between
the mean minus one s and the mean plus one s is two-thirds or 0.67 (equal to 67%). The
probability of obtaining a value between the mean minus 2s and the mean plus 2s is 19/20
or 0.95 (equal to 95%). Strictly speaking 1.96s should be used rather than 2s. It follows
that the probability of obtaining a result outside the mean ± 2s (or more correctly mean
± 1.96s) is 1 – 0.95 = 0.05 (or 5%). This is exactly the level of probability which
statisticians regard as significant when deciding whether a particular value belongs to a
given population.
203
CHAPTER 10
Since 5% of values fall within the mean ±1.96s and the normal curve is symmetrical it
follows that 2.5% or results will be below the mean – 1.96s and a further 2.5% will be
above the mean + 1.96s. This is quite an important point since in some situations we
only wish to know if a result is significantly greater than the mean + 1.96s or is
significantly less than the mean – 1.96s. In this case we use the value of z which
excludes 10% of results (P=0.1) since only a half of these (5%) will be greater (or less
than) the range encompassing 90% of the values. The value of z which gives rise to
range which excludes the lowest and highest 5% of results is 1.645. A table of z values
with their corresponding probabilities is given in Fig 10.4.
Figure 10.4 Percentage points of the normal distribution. This table gives the
percentage points most frequently required for significance tests for a
normal variable having zero mean and unit standard deviation. Thus,
the probability of obtaining a departure from the mean of more than
1.96 standard deviations in either direction is 0.05 or 5%
The range of values obtained at any probability level is known as the confidence limits.
The 95% confidence limits of a set of results can be calculated from the corresponding z,
the mean (m) and standard deviation (s) as follows:
Upper limit = m + (z x s)
Lower limit = m - (z x s)
It is common practice in clinical biochemistry to use the 95% confidence limits obtained
for the concentration of an analyte from normal subjects as the reference range. Any
value obtained for a patient outside this range is usually regarded as abnormal. The truth
204
THE BASIS OF STATISTICS
is that the probability of obtaining a result outside of this range is 0.05 or 1 in 20 – we are
simply making a subjective judgment that it is abnormal. If we were to measure this
analyte in twenty healthy individuals then we would expect one of them to have a result
outside of the reference range. Similarly, if we were to measure twenty different analytes
in the same patient then again one of the results would most likely fall outside of the
reference range.
Calculation from the creatinine data for 60 normal individuals depicted in Fig 10.1 gives
a mean of 76 μmol/L and standard deviation of 17 μmol/L. From these figures it is
possible to calculate the 95 % confidence limits (when z = 1.96) as follows:
The chance of obtaining a result outside of these limits is 100 - 95 = 5% (1 in 20) and
if it occurs is probably abnormal. The chance of obtaining a result less than 43 μmol/L is
2.5% (1 in 40), and of obtaining a result greater than 109 μmol/L is also 2.5% (1 in 40). If
we wish to know the chance of obtaining a result greater than 115 μmol/L then the first
thing to do it to calculate the z score:
z = x - m = 115 - 60 = 55 = 3.24
s 17 17
Inspection of the table in Fig 10.4 shows that there is no value of z corresponding exactly
to z = 3.24, but that when z = 3.09 the probability (P value) is 0.002. Therefore the
chance of obtaining a value outside the mean ± 3.24s is slightly less than 0.002.
The chance of obtaining a result greater then mean + 3.24s is one half of this i.e. 0.001
(or 1 in 1000).
Question Q 10(2)
The imprecision of a certain assay for Troponin I yields a coefficient of variation of 13%
between 0.3 and 0.5 µg/L, around the decision point for myocardial infarction of
0.4 µg/L. A result of 0.46 µg/L is obtained on a sample. Assuming that is the true level
of Troponin I, give an estimate of the probability that analysis of that same sample would
give a result below the decision point.
205
CHAPTER 10
Answer Q10(2)
The true result (0.46 μg/L) can be considered as the mean with a coefficient of variation
(cv) of 13%. The first step it to calculate the standard deviation (s). cv, mean (m) and s
are related as shown in Eq 10.4:
cv (%) = s x 100
m
s = cv (%) x m
100
Therefore the analyses of the sample are distributed with a mean of 0.46 µg/L and s
of 0.06 µg/L. We want find out what proportion of results will be below the decision
point of 0.4 µg/L. To do this we need to 'normalize' the data so that the mean is zero and
the SD =1. i.e. calculate the standard deviate -'z':
Therefore the decision point is –1s from the mean. From the table in Fig 10.4 we can see
that the probability of obtaining a result which differs from the mean by more than 1s in
either direction (i.e. when z = 1 or –1) is 0.33.
Therefore the probability of obtaining a result below the decision point is one half of 0.33
i.e. 0.17 (2 sig figs).
206
THE BASIS OF STATISTICS
Another application of the normal distribution is the analysis of quality control data. It is
common practice to include quality control samples into an analytical run to check that
the method is performing to specification. The characteristics of the quality control (QC)
material are first determined by replicate analyses of the material (usually twenty) then
calculating the mean and standard deviation. The same material is then analyzed in each
batch of samples (sometimes more than once). If the method is performing to
specification then the results for the QC material should fall within the 95% confidence
limits (mean ± 2s) most of the time. In fact the distribution of results should belong to
the same normal distribution as when the characteristics were originally determined i.e.
the results should cluster around the mean.
A useful way of plotting the data is to turn the histogram on it’s side and construct a y
axis with horizontal lines representing the mean, 1s, -1s, 2s etc. The results are plotted
along the x axis and should fall on either side of the mean with equal frequency, most
results will fall within the mean ±s, fewer results between the s and 2s limits, very few
between 2s and 3s with only very occasional results outside the 3s limits. This is known
as a Levy-Jennings Chart and an example is shown in Fig 10.4.
The only limitation of using 95% confidence limits (i.e. mean ± 2s) as the only criterion
is that by definition 1 in 20 analytical runs will be rejected. For a multi-channel analyzer
measuring 20 analytes this means that on average one channel will be rejected every run.
In other words this criterion is too sensitive. Westgard has devised a set of criteria, the
Westgard Rules, to improve the power of QC data to detect “real” errors without an
unacceptably high lever of “false rejections”. These rules are based upon the fact that the
probability of obtaining two consecutive results outside the 95% confidence limits is the
product of the individual probabilities that one result is outside these limits and is
considerably lower and so increases the likelihood that the method is out of control. This
idea is extended to four results being between the s and 2s limits and ten results being one
side of the mean etc.
Question Q 10(3)
207
CHAPTER 10
3s
2s x x
s x x x x x
m x x x x x x xx x x
x x x x x
-s
x
-2s
-3s x
Answer Q 10(3)
a) The probability of obtaining a result outside the mean ± 3s range is the P value
corresponding to a z score of 3. An exact value for z = 3 is not given in Fig 10.4
but the nearest value (z = 3.09) can be used as an approximation and corresponds
to a P value of 0.002. The chance of obtaining a result outside the mean ± 3s
range is therefore 1 in 1/0.002 which is 1 in 500. This value will occur so
infrequently that the method is almost certainly out of control, and is known as the
Westgard 13s rule.
a) The probability of obtaining a result outside the mean ± 2s range can again be
obtained by looking up the P value corresponding to z = 2 in Fig 10.4. Again the
exact value for z = 2 is not given but the nearest (z = 1.96) is a good
approximation and corresponds to a P value of 0.05. There is a slight
complication here in that we really wish to know the P value when z is between
±2 and ±3, not simply when it is greater than ±2. To allow for this all we need to
do is to subtract the P value for z = 3 (P3) from the P value for z = 2 (P2) to give
the probability of obtaining a value between 2 and 3 standard deviations from the
mean (P2-3):
208
THE BASIS OF STATISTICS
Furthermore, we need the probability that the result is between the mean +2s and
mean +3s, not simply between the mean ± 2s and mean ± 3s limits on either side
of the mean. Therefore the probability of obtaining a result between the mean
+ 2s and mean +3s limits is one half of 0.048 i.e. 0.024. The likelihood of
obtaining two results between the mean + 2s and mean + 3s limits is the product
of the probability of obtaining each result between these limits:
This is similar to flipping a coin twice. The chance of heads the first time is 0.5
the second time 0.5. Therefore the chance of obtaining heads on both occasions is
0.5 x 0.5 = 0.25. Another way of looking at this is that there are four possible
results, heads and tails, heads and heads, tails and heads and tails and tails: there
are 4 equally likely results but only one of these is heads on both tosses, so that
the probability is 1 in 4 or 0.25).
b) As shown above, the probability (P2s) of obtaining a result outside the mean ± 2s
limits is 0.05. From Fig 10.4 it can be seen that the chance (Ps) of obtaining a
result outside the mean ± s range is 0.33. Therefore the probability (P2s-s) of
obtaining a result between the ± s and ± 2s limits is given by:
The probability of obtaining a result between the mean –s and mean –2s is one
half of this i.e. 0.14. The likelihood of obtaining four results within this range is
obtained by multiplying this probability by itself four times:
209
CHAPTER 10
One of the reasons why statisticians prefer to work with squares of deviations rather than
directly with deviations is that the former are additive. This is particularly true of the
variance i.e. the standard deviation squared (s2) which is calculated directly from the sum
of squares of the deviations about the mean (Σ(x – m)2). If there are two independent
sources of variation (with variances s12 and s22) contributing to a measurement, then the
total variation (with variance stotal2) is described by:
It is important to note that it is the variances only (s2) which are additive, not the standard
deviations (s). If we wish to calculate the combined standard deviation from the
individual standard deviations then these are first squared, added together then the square
root taken of the product:
Similarly coefficients of variation (cvs) are not additive unless they are first squared:
210
THE BASIS OF STATISTICS
Clearly the analytical imprecision is contributing to the overall variation in the patient
results so that the true population variation, that is the biological variation (sbiological2) is
much lower. Since the component variances are additive, and assuming that the
analytical variance is the same at all analyte concentrations encountered in the
population, we can write:
The biological variation can be further subdivided into its components e.g.
intra-individual and inter-individual variations and equation Eq. 10.11 re-written:
Another application of this principle is the analysis of the impression of the steps in an
analytical process on the total performance. For example, if a method involved pipetting
7 mL of reagent, then this could be achieved by either pipetting 5 mL and 2 mL from
separate bulb pipettes or by pipetting 7 mL from a graduated pipette. The combined error
from using separate bulb pipettes could be calculated from stotal = √ (s5mL2 + s2mL2) and
compared with the variance obtained from using the 7 mL graduated pipette.
Question Q 10(4)
211
CHAPTER 10
Answer Q10(4)
The first step is to calculate the overall standard deviation (sTotal) and mean (m) from the
reference range obtained with the original method (with analytical cv of 5%).
The 95% reference limits incorporates the mean ± 2s, i.e. spans 4s units
Rearranging, s = cv (%) x m
100
The measured variation will reflect both the biological and analytical variations. Since it
is variances and not standard deviations which are additive, then the square of the total
standard deviation (sTotal) is equal to the sum of the squares of both the biological
(sBiological) and analytical (sAnalytical) standard deviations.:
132 = (sBiological) 2 + 52
212
THE BASIS OF STATISTICS
(sBiological) 2 = 132 - 52
= 169 - 25 = 144
and the new sTotal can be calculated (assuming biological variation remains unchanged):
= 100 - 50 = 50 mmol/L
213
CHAPTER 10
When monitoring patients it is helpful to know by how much the concentration of any
given analyte has to change before the change is clinically significant. To answer this
question allowance has to made for the effects of both analytical and within-individual
variation. One way to address this problem would be to analyse each sample a number of
times and compare their means using a suitable statistical test. In day-to-day practice we
do not have this luxury, only single measurements. However, all laboratories should
have some idea of the total variability for each of their analytes (which includes both
analytical and within-individual variation).
Suppose the result for an analysis is x1 on the first occasion and x2 on the second
occasion, then we could calculate the difference (x1 – x2). We could treat the value for (x1
– x2) as a variable which forms a normal Guassian distribution. In other words, if the
analysis of the two samples was repeated a large number of times then a histogram could
be constructed with frequency plotted against (x1 – x2). If the two results (x1 and x2 are
not significantly different) then their difference (x1 – x2) should be within the 95%
confidence limits of a distribution with mean of zero and their combined standard
deviation (s1,2). In other words the difference in results (x1 – x2) can be normalized to give
a value for z if it is divided by s12:
z = (x1 – x2) - 0
s1,2
The distribution would have a mean of zero with a standard deviation of 1. If there was
no significant difference between the results obtained on the two occasions then the peak
of the histogram (i.e. the mean value for (x1 – x2) would be zero). For a value to be
significantly different from the mean (in this case zero) at a probability level of 5% (i.e. P
= 0.05) the value for z would need to be 1.96. Therefore for the difference (x1 – x2) to be
significantly different from zero we substitute z = 1.96 into the above expression:
The value for s is actually the combined standard deviations for the two measurements (s1
and s2). As seen in the previous section, the combined value for s when two results are
combined (i.e. added or subtracted) is the square root of the sum of their squares:
214
THE BASIS OF STATISTICS
However, if the value for s is the same at both concentrations then this expression
simplifies to:
Therefore for a change in a result to be significant at the 5% level of probability the two
results must differ by at least 2.8s. It is important that the value for s is the same at both
concentrations.
Question Q 10(5)
While trying to follow the National Service Framework guidelines for coronary heart
disease a doctor prescribed a statin to lower the cholesterol of a patient with coronary
heart disease. The patient's original cholesterol level was 5.8 mmol/L and at the next
visit the doctor was delighted to find that it was just below the target level of 5.0 mmol/L
at 4.9 mmol/L and discharged the patient. The patient, a statistician, was less sure the
treatment had been responsible. Given that the physiological coefficient of variation for
cholesterol is 6% and the analytical coefficient of variation is 3%, calculate the least
significant change (at p<0.05) in cholesterol as a percentage at his original level, and
determine whether the second measurement was significantly different from the first.
215
CHAPTER 10
Answer Q 10(5)
The total CV is the square root of the sum of the squares of the physiological and
analytical CVs:
= √ (32 + 62 ) = √ (9 + 36 ) = √ 45 = 6.7%
Substitute CV = 6.7% and the original level (5.8 mmol/L) as the mean:
For two results to be significantly different (at p <0.05) they have to be at least
2.8 standard deviations apart (2.8s).
Next calculate the difference between the first and second measurement as a percentage
of the first measurement:
which is less than 18.8% so that the change in cholesterol is not statistically significant.
216
THE BASIS OF STATISTICS
ADDITIONAL QUESTIONS
Total protein (g/L): 70, 68, 71, 65, 68, 70, 73, 69, 75, 74, 69, 71
3. Calculate the least significant difference for a change in cholesterol if the intra-
individual coefficient of variation for cholesterol is 4.7% and the analytical
coefficient of variation, 2.4%. A patient was changed from Atorvastatin 80 mg to
Rosuvastatin 40 mg and the total cholesterol fell from 6.9 to 5.9 mmol/L.
Calculate the percentage change in cholesterol and state whether this is
significant.
4. Your on-call laboratory service uses 30 different methods, each of which has a
1% probability of failing QC criteria during the course of a night. Assuming that
QC of any method is independent of that of the other methods, what is the
probability that on any one night all methods will pass the QC criteria?
217
CHAPTER 10
5. You attempt to derive a reference range for TSH for an ethnic minority
population. The first 10 samples give the following results:
Result n
Between 0.5 and 1.49 5
Between 1.5 and 2.49 3
Between 2.5 and 3.49 0
Between 3.5 and 4.49 1
Between 4.5 and 5.49 1
On the basis of these results, what range of TSH values would encompass 95% of
the ethnic minority population?
6. You are required to pipette a 9ml volume and have available a 10 ml graduated
pipette which has a 2% CV associated with it’s delivery volume and 5 and 2 ml
volumetric pipettes each of which has a 1%CV associated with their delivery
volumes. What is the error of pipetting a 9 mL volume, expressed as plus/minus
mL volume?
7. It has been suggested that a proposed analytical goal for an analyte is that the
between batch analytical coefficient of variation should not exceed one half of
the “true biological” inter-individual coefficient of variation. Calculate the
percentage “expansion” of the measured reference range over the true biological
reference range when this analytical goal is exactly met.
218
ANALYSIS OF MEANS AND VARIANCES
Chapter 11
Sometimes we do not want to ask the question “is a single result significantly different
from a given population” but “is the mean of a set of results significantly different from
the mean of another set”. The two problems are approached in a similar manner but
difficulties arise because the value of the mean is influenced by the number of results
used in its calculation.
Consider the data in Fig 10.1 for creatinine results obtained with sera from sixty normal
individuals (mean = 76 μmol/L; standard deviation = 17 μmol/L). Suppose we were to
take two of these results (n=2) at random and calculate their mean, then repeat this
process a large number of times. The results for the means could then be plotted in the
form of a histogram similar to the individual results in Fig 10.1. Such a histogram is
called the sampling distribution of the mean. The peak value (mean) would be the same
but there would be one important difference: the distribution of results would be much
narrower and the value of its standard deviation would be lower. The standard deviation
of the sampling distribution of the mean is called the standard error of the mean (SEm) to
distinguish it from the standard deviation of individual results. This process could be
repeated by taking three results (n=3) at random, repeating this process a large number of
times and plotting the sampling distribution of the mean in a similar manner. The mean
of this new distribution would be unchanged but the distribution of results would be less
and their standard deviation (i.e. their standard error) would be lower then when the
means of two results were calculated. This process could be repeated with increasing
sample size (n) and we would find that as n increases the standard error is reduced.
An example is shown in Fig 11.1.
219
CHAPTER 11
n = 10
n=5
n=2
n=1
Figure 11.1 Effect of sample size (n) on the sampling distribution of the mean
Mathematicians have calculated that the standard error (SEm) is in fact equal to the
standard deviation (s) divided by the square root of the number of results (n) used in its
calculation:
Provided the data follow a Guassian distribution the values for the mean (values of m) of
samples of size n will be distributed with the peak of the bell shaped curve having the
overall true mean (which we shall call μ ) and a standard deviation equal to the standard
error of the mean (SEm).
Just as with single measurements, the data can be “normalized” to produce an overall
mean of zero and a standard error of one by calculating the z value:
220
ANALYSIS OF MEANS AND VARIANCES
In order to test whether the mean (m) of a sample size n is significantly different from a
hypothetical mean μ of a Guassian distribution first calculate the z (or t) value:
then look up the corresponding probability (P value) in tables of z (see Fig 10.4). It is
important to note that this approach is only valid if the standard error calculated from the
sample (i.e. s/√n) is a reasonable estimate of the true standard error of the mean. This in
unlikely to be the case unless the value for n is relatively large (greater than 30). William
Gossett, who published under the pen-name “Student” noted that in small samples, the
sample s underestimates the population s. To get around this problem the “t-distribution”
was introduced, which is similar to a normal z-distribution in being symmetrical about a
mean of zero and is bell-shaped, but differs in that it is flatter (more dispersed) and its
dispersion varies according to the size of the sample. The larger the value of n, the more
closely a t-distribution resembles a z-distribution. Fortunately statisticians have
calculated tables of t for us (portion shown in Fig 11.2) and all that is required is to read
off the P-value for the corresponding value of both t and the degrees of freedom (equal to
n – 1).
Degrees Value of P
of
freedom 0.10 0.05 0.02 0.01 0.002 0.001
…. ……. …… …… ……. …….. …….
…. …… …… ……. ……. …….. ……..
6 1.943 2.447 3.143 3.707 5.208 5.959
7 1.895 2.365 2.998 3.499 4.785 5.408
8 1.860 2.306 2.896 3.355 4.501 5.041
9 1.833 2.262 2.821 3.250 4.297 4.781
10 1.812 2.228 2.764 3.169 4.144 4.587
….. …… ……. …… ……. ……. ……
30 1.697 2.042 2.457 2.750 3.385 3.646
221
CHAPTER 11
More often we wish to compare two groups of samples (samples 1 and 2), each of which
has its own mean (m1 and m2), standard deviation (s1 and s2) and sample size (n1 and n2).
Thus the mean of each group of samples has its own standard error (s1/√n1 and s2/√n2). In
order to calculate a z (or t) value we need to know the combined standard error (SEm1,2)
i.e. the standard error of the difference between estimates of the two means (m1-m2). This
combined standard error is calculated in the same was as we calculate the combined
standard deviation of biological and analytical variation. Like standard deviations,
standard errors are not additive but their squares are (and since we are dealing with
squares the signs are always positive):
= s1 2 + S22 = s1 2 + s2 2
(√n1)2 (√ n2)2 n1 n2
Therefore dividing the difference between the means by their combined standard error
gives the corresponding z (or t value):
s1 2 + s22
n1 n2
222
ANALYSIS OF MEANS AND VARIANCES
If the values for n1 and n2 are greater than 30, then the value for P is obtained from tables
of z. If however, the value of n1 and/or n2 is less than 30 then the value for P is obtained
from tables of t and the degrees of freedom (DF) calculated from the expression:
For the special case where s1 and s2 are equal, simplified versions of these formulae can
be used. However, it is first necessary to carry out a variance ratio test to see if this is
indeed the case so it is probably simplest to stick with only one formula that can be used
whatever the relative magnitudes of the individual variances.
Question Q 11(1)
In January a laboratory analysed a quality control sample for sodium 10 times and
obtained a mean result of 150 mmol/L with a standard deviation of 4 mmol/L.
In February the same sample was analysed 10 times and gave a mean of 154 mmol/L
with a standard deviation of 2 mmol/L. Has there been a significant change in
performance between January and February?
223
CHAPTER 11
Answer Q 11(1)
t = m1 - m2
s1 2 + s22
n1 n2
= -4 = -4 = -4 = -2.84
√ ( 1.6 + 0.4 ) √2 1.41
Since n1 and n2 are small (less than 30), the degrees of freedom (DF) is calculated using
Eq. 10.21:
DF = (s12/n1 + s22/n2)2
[(s12/n1)2/(n1 – 1)] + [(s22/n2)2/(n2 – 1)]
From tables of t it can be seen that for 13 degrees of freedom, at the 5% level of
probability t should be outside the limits -2.16 to +2.16 to be statistically significant. In
fact a t value of -2.84 is also significant at the 2% level of probability. Therefore we can
conclude that there has been a change in performance between January and February.
224
ANALYSIS OF MEANS AND VARIANCES
For each pair of results, one value is subtracted from the other to give the difference (d).
For n pairs of results there will therefore be n values for d. We can calculate the mean
(md) and standard deviation (sd) for these values of d. If there is no difference between
the two sets of results then the average value of d would be zero. To test whether there is
a statistically significant difference of md from zero we calculate t in which we are
comparing the value for md with zero assuming that values for md are normally
distributed with a standard error of sd/√n:
t = md ………………………………. Eq 11.7
sd /√n
Question Q 11(2)
It is suspected that the glucose results obtained with near patient testing (NPT) device on
the ward are positively biased. One of the investigations into the problem involves
analyzing a series of blood specimens on both the NPT device (A) and an analyzer in the
laboratory which measures whole blood glucose (B), with the following results:
Specimen No 1 2 3 4 5 6 7 8 9 10
Glucose (mmol/L) NPT (A) 4.5 6.8 3.2 5.8 8.9 9.5 4.8 7.3 5.1 7.8
Glucose (mmol/L) lab (B) 4.2 7.0 2.8 5.6 8.7 9.7 4.9 6.8 4.6 7.7
225
CHAPTER 11
Answer Q 11(2)
The variabilities of the results in groups A and B are due to differing glucose
concentrations in the specimens and to the analytical variation between the instruments.
Therefore a standard t-test comparing the means of both sets of results would be
inappropriate for comparing the analytical performance of method B with method A. As
the data are paired, i.e. the same samples were assayed by both instruments, a paired t-test
can be used.
A B d d2 d - md (d - md)2
If there is no bias then the differences between each pair of results (d) would be very
small and the average would be very close to zero. A paired t-test is used to compare the
mean difference (i.e. the mean of d) with a hypothetical value of zero taking into account
the standard error of the values of d. The mean and standard error of the difference
(between values of d) is calculated in the usual way:
226
ANALYSIS OF MEANS AND VARIANCES
= √ [( 0.93 - 0.289 ) / 9]
Next calculate t:
t = md
sd/√n
From tables of t, for 9 (i.e. n-1) degrees of freedom the probability of obtaining a t value
of 1.99 is greater than 0.05. Therefore, the mean difference (0.17) is NOT significantly
different to zero at the 5 per cent level of probability so the data does NOT demonstrate
any bias between the two methods.
227
CHAPTER 11
Therefore F = 42 = 16 = 4.0
22 4
The next step is to look up the probability of obtaining this value for F from tables of F.
Unlike tables of t, the columns refer to the degrees of freedom of s1 and the rows to the
degrees of freedom of s2. Therefore, there is a separate table for each level of probability
(typically 5 and 1%). The degrees of freedom are n –1 for each variance.
DF1 …. 7 8 9 10 12 15 ….
DF2
…. …. …. …. …. …. …. …. ….
7 …. 5.59 4.74 4.35 4.12 3.97 3.87 ….
8 …. 5.32 4.46 4.07 3.84 3.69 3.58 ….
9 …. 5.12 4.26 3.86 3.63 3.48 3.37 ….
10 …. 4.96 4.10 3.71 3.48 3.33 3.22 ….
11 …. 4.84 3.98 3.59 3.36 3.20 3.09 ….
12 …. 4.75 3.89 3.49 6.26 3.11 3.00 ….
…. …. …. …. …. …. …. …. ….
Figure 11.3 Portion of a table of variance ratio (F = s12/s22) values, for which
degrees of freedom are DF1 and DF2 for s1 and s2 respectively (where
s1 > s2), corresponding to P = 0.025 (2.5%)
228
ANALYSIS OF MEANS AND VARIANCES
An alternative to comparing means of two sets of data is to analyse the variability that
exists within the data. Consider the following two sets of data obtained by analyzing the
same QC sample five times in each of two analytical runs (A and B):
A B
6.5 6.0
6.2 5.8
6.8 5.4
5.8 5.6
6.3 5.9
Normally the data would be analysed by calculating the mean and variances of data sets
A and B separately then applying a t-test. ANOVA involves calculating the variance of
the combined data, but in three separate ways:
Between groups variance The variation between results within each group is eliminated
by substituting each group mean for the individual results. In this example group A
would consist of 5 results each of 6.34 and group B of 5 results each of 5.74 with an
overall mean of 6.04.
+ (5.74 - 6.04)2 ] / (n -1 )
Because we have substituted means for individual results, n is only 2 so that n-1 =1.
229
CHAPTER 11
Within groups variance This is the variance for the individual results calculated using
the appropriate group (A or B) mean, but combining both sets of data:
+ (5.9 - 5.74)2 ] / (n - 2)
Although there were 10 results initially, two means were used in the calculation, so that
there are n - 2 = 10 - 2 = 8 degrees of freedom. Evaluation of the above expression
gives a within groups variance of 0.10.
Total variance This is the combined variance of the two groups using the overall mean
value (in this case 6.04):
(5.9 - 6.04)2] / (n - 1)
There are 10 results initially, one mean was used in the calculation, so that there are n - 1
= 10 - 1 = 9 degrees of freedom. Evaluation of the above expression gives a total
variance of 0.19.
The null hypothesis which is used is that if the two sets of data are from the same
population then the between groups variance will not be significantly different from the
within groups (residual) variance. This can be tested by calculation of the variance ratio:
230
ANALYSIS OF MEANS AND VARIANCES
In this example, F is 0.90/0.10 = 9.0 with 1 and 8 degrees of freedom. From tables of F
at the 5% level of probability the F value would be 5.32. Since the value obtained is
higher than this then the two sets of data are significantly different at the 5% level of
probability (i.e. P <0.05). ANOVA for two sets of data is rarely used since it is much
easier to compare the means directly with a t-test. In fact the P value obtained by
ANOVA is exactly the same as that obtained with a t-test.
The value of ANOVA comes into its own when comparing more than two sets of data.
For example, if we had four sets of data we wished to compare, A, B, C and D then one
option would be to carry out t-test between each possible combination of data. The
various combinations are: A-B, A-C, A-D, B-C, B, D and C-D making six t-tests in all.
Clearly it would be simpler to first carry out an ANOVA to se if a difference exists
between any of the groups of data. There is another reason for using ANOVA in
preference to multiple t-tests. If we are looking for a difference which is significant at the
5% level and no significant difference really exists between two sets of data then a
significant value for t will be obtained on five occasions out of a hundred by chance alone
i.e. a false positive rate of 5% will be obtained. If numerous t-tests are carried out then
the incidence of false positives is even higher.
An underlying assumption when carrying out ANOVA is that the variances of the
individual groups are homogeneous i.e. they are not significantly different from each
other. This can be confirmed by first carrying out a variance ratio test on the variances of
the two groups which have the highest and lowest variance. If the variances are not
homogeneous then the problem can often be overcome by first transforming the data.
A difficulty which often arises is that the size of the groups are not equal i.e. some of the
data is missing. Computer packages, which are usually used to perform these
calculations nowadays, have the facility to correct for missing data.
If ANOVA does not reveal a significant difference between the groups then further
statistical analysis is not usually necessary. However, if a difference is demonstrated then
ANOVA cannot tell us which group(s) is/are significantly different from the rest. This
can only be done by carrying out appropriate t-tests – although simple inspection of the
data usually suggest which group(s) is/are likely to be different.
It is common practice to first calculate sums of squares rather then variances. This is
because sums of squares are additive e.g. the between groups and within groups sums of
squares should add up to the total – a useful check on the calculations (a similar check
can be applied to degrees of freedom). In fact it is only necessary to calculate two of the
sums of squares e.g. the total and within groups, then obtain the between groups sum of
squares by subtraction.
231
CHAPTER 11
1. Let u = number of groups of data and v = the number of data points in each group.
Arrange the data (values of x) into a table with u columns and v rows.
2. At the foot of each column enter values for ∑x, n, mean, ∑x2, (∑x)2/n and sum of
squares [ i.e. ∑x2 – (∑x)2/n ] for each group.
Group No
1 2 3 4 … u SUM
3. Calculate the totals, for each group, of ∑x, ∑x2 and (∑x2)/n and enter in a new
column to the right. Complete the following table using these values:
4. Divide the between groups mean square by the within groups mean square to
obtain F and look up its P value in tables of F with u - 1 and u(v - 1) degrees of
freedom.
232
ANALYSIS OF MEANS AND VARIANCES
The example discussed above in which the data was divided into several sets is known as
one-way analysis of variance. This simple concept can be extended to two-way,
three-way etc analysis of variance. For example if we were conducting a study to
compare various drug treatments with a group of patients receiving each treatment then
we would need to carry out a one-way ANOVA. If however, the patients were divided
into males and females then the creation of these extra groups would require a two-way
ANOVA. Details of these techniques can be found in statistics textbooks, but nowadays
computer packages are usually used to perform these calculations.
Question Q11(3)
Buffer
A B C D
Is there a significant difference in the measured activity between any of the four buffers?
233
CHAPTER 11
Answer Q11(3)
There are two methods available for calculation of the between groups, within groups and
total sums of squares:
First calculate the mean of each column and the overall mean:
The between groups sum of squares is calculated from the group means and overall
means:
The within groups sum of squares is calculated from the individual results and the group
means:
234
ANALYSIS OF MEANS AND VARIANCES
The total sum of squares is calculated from the individual results and the overall mean:
Total sum of squares = (175 - 174.7) + (160 - 174.7) + ……. (170 - 174.7)
Buffer
A B C D
235
CHAPTER 11
Whichever method was used to calculate the sum of squares, the procedure to calculate
the F value is the same:
From tables the probability of obtaining an F value of greater than 2.84 (for 3 and 36
degrees of freedom) is 0.05 (5 per cent). Therefore the data are not homogeneous i.e. at
least one of the groups of data are significantly different to the rest. ANOVA cannot tell
us which group(s) is/are different, t-tests must be performed on paired groups of data.
236
ANALYSIS OF MEANS AND VARIANCES
FURTHER QUESTIONS
1. The following analytical results were obtained on the same QC sample: 109, 91,
105, 112, 90, 115, 89, 113, 93, 94. Calculate the mean, standard deviation and
standard error of the mean.
2. Two laboratories measured sodium in the same plasma sample ten times. One
laboratory obtained a mean of 145 mmol/L with an SD of 3 mmol; the other
obtained a mean of 147 mmol/L with an SD of 2 mmol/L. Do the laboratories
differ in their bias or imprecision?
A B
6.8 7.2
4.2 4.5
5.0 4.8
5.6 5.9
8.5 8.7
2.9 2.8
4.8 4.9
7.6 8.1
6.5 6.4
5.0 5.2
237
CHAPTER 11
Lab
A B C D
238
CORRELATION AND REGRESSION
Chapter 12
During graphical analysis of laboratory data the question often arises as to whether or not
there is a valid relationship between variables and, if there is, where the line of best fit
should be drawn? The statistical techniques of correlation and regression seek to answer
these questions. Provided there is a linear relationship between two variables then the
best fit will be a straight line. Often the data is best described by a curve in which case
more advanced techniques are needed which are beyond the scope of this book.
Fortunately it is often possible to transform non-linear data (e.g. by taking logarithms or
reciprocals) to a reasonably straight line which can be fitted by linear techniques.
239
CHAPTER 12
30
a) Plot of: y = 2x + 5
25
20
y Slope = 2
15
10
x intercept
= -2.5
5 y intercept = 5
-4 -3 -2 -1 0 1 2 3 4 5 6 7 8 9 10 11
x
25 y intercept = 25
20
y
15 Slope = - 2
10
x intercept = 12.5
0
-2 -1 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Figure 12.1 Plots of two equations showing the significance of the slope and
intercept
240
CORRELATION AND REGRESSION
Where x and y are the independent and dependent variables respectively and a and b are
constants. b is the slope of the line i.e. the rate of change of y with x. b can be
determined by drawing a right-angled triangle with the line through the data as the
hypoteneuse and measuring the vertical and horizontal sides and dividing the former by
the latter. Alternatively the angle between the line through the data with the base of the
triangle can be measured and its tangent calculated. It is important to take into account
the scales for x and y. a is a constant and represents the intercept on the y axis if the line
through the data is extrapolated. If a has a value of zero then the line passes through the
origin i.e. where the x and y axes intercept.
Figure 12.1 shows a plot of two sets of data of y against x. In both cases the x values are
1, 2, 3,…..10. In (a) the values of y are calculated according to the equation y = 2x + 5.
Therefore the slope of the line is 2 i.e. when x increases by a value of 1, y increases by a
value of 2. The value of the y intercept is 5. This is the value of y when x = 0 in which
case the expression is y = (2 x 0) + 5 which simplifies to y = 5. When y = 0 however,
the expression becomes 0 = 2x + 5 which can be rearranged to 2x = -5 so that x = -5/2
= -2.5 which is the intercept on the x axis.
In (b) the values for y are calculated according to the equation y = -2x + 25. Therefore
the slope of the line is -2 i.e. as x increases by 1, y decreases by a value of 2. The value
of the y intercept is 25. This is the value of y when x = 0 in which case the expression is y
= (-2 x 0) + 25 which simplifies to y = 25. When y = 0 however, the expression becomes
0 = -2x + 25 which can be rearranged to 2x = 25 so that x = 25/2 = 12.5 which is the
intercept on the x axis.
241
CHAPTER 12
35 mx
30 (x - m x )
A B
25 (x - m x )(y - m y ) (x - m x )(y - m y )
(y - m y )
Negative Positive
20
y
15 my
C D
10
(x - m x )(y - m y ) (x - m x )(y - m y )
5
Positive Negative
0
0 5 10 15 20 25 30
x
Figure 12.2 Computation of (x – mx) and (y – my) for a single point (part of a set of
data). The intersecting axes representing the means (mx and my)
divide the graph area into 4 quadrants in which the product (x – mx)
(y – my) is either positive or negative
The idea behind the correlation coefficient is that it should measure the degree of
association between x and y. The measure used is the sum of the products of the
individual deviations of x from its mean and of y from its mean. Figure 12.2 shows the
mode of calculation of both deviations for a single point (which is only one of a series of
values for x and y). A vertical axis is drawn to represent the mean of the values of x (mx)
and a horizontal axis to represent the mean of all the values of y (my). The intersection of
these two axes always falls on the best straight-line fit to the data. The distance of the
observed value of x (10) from the mean of all values of x (20) i.e. (x – mx), is 10 – 20 =
-10. The distance of the observed value of y (30) from the mean of all values of y (15)
i.e. (y – my) is 30 – 15 = 15. Their product, (x – mx)(y – my) is therefore –10 x 15 = -150.
242
CORRELATION AND REGRESSION
Note that the intersecting axes of mx and my divide the area of the plot into four quadrants
– A, B, C and D. In quadrants A and D the product of the deviations is always negative,
whereas in quadrants B and C it is always positive. The product of each pair of
deviations may be thought of as a “turning moment” about the intersection point of the mx
and my axes. In fact r is often called Pearson’s product-moment coefficient. If the sum of
all the values of (x – mx)(y – my) is divided by the number of data points (or more
specifically the number of data points minus one) then the resulting parameter is known
as the covariance of x and y (cxy):
Division by the square root of the products of the individual variances of x and y:
corrects for the total variability in the data and yields an expression for Pearson’s
correlation coefficient (r):
r = cxy
√ sx2.sy2
243
CHAPTER 12
The way in which the products of the two deviations affect r is illustrated in Figs 12.3 to
12.6. These figures all use the same values for x (i.e. 5, 10, 15 and 20) and y (10, 20, 30
and 40) but differ in the way each value of y is matched with a value of x. Since all the
values for x and y are the same it follows that ∑(x – mx)2 and ∑(y – my)2, and therefore
√ [∑(x – mx)2∑(y – my)2] are also identical. However, the value of ∑(x – mx)(y- my) varies
according to the pairing of x and y. This is not surprising since the product (x – mx)
(y – my) will not only depend upon the value of y but the value of x with which it is
associated.
In Fig 12.3 the values of x and y are matched so that all the data points fall on a straight
line described by the relationship y = 2x. All data points fall into quadrants B and C so
that each value of (x – mx)(y – my) and hence r is positive. In fact the values for
∑(x – mx)(y – my) and √ [∑(x – mx)2∑(y – my)2] are identical (= 250) so that r = +1.
However, in Fig 12.4 the values for x and y are matched in such a way that the data points
all fall on a straight line described by y = 50 – 2x. All the points fall into quadrants A and
D so that each value of (x – mx)(y – my) and hence r is negative. The value for
∑(x – mx)(y – my) is –250 whereas the value for √[∑(x - mx)2∑(y – my)2] is +250 so that
their ratio (r) is -1.
In fig 12.5 the values of x and y are matched in such a way that there is no linear
relationship between x and y. All four data points fall into different quadrants giving two
negative and two positive results for (x – mx)(y – my) which cancel each other exactly.
Therefore the value for ∑(x – mx)(y – my), and hence r, is zero.
Figure 12.6 shows the situation more commonly encountered where x and y are matched
in such a way that the data can be described by a linear expression (in this case y = 2x)
but with none of the points falling on the fitted line. All of the points still fall into
quadrants B and C but the matching of the values of x and y still results in lower values
for (x – mx)(y – my) giving a total of 150. Hence r is 150/250 = 0.6.
244
CORRELATION AND REGRESSION
mx
45
40
35
30
25 my
y
20
15
10
0
0 5 10 15 20 25
x
mx = ∑x = 50 = 12.5 ; my = ∑y = 100 = 25
n 4 n 4
Figure 12.3 A set of four data points which fall exactly on a straight line described
by the relationship y = 2x yielding a correlation coefficient (r) of one,
showing the method for calculation of deviations and sums of squares
and products
245
CHAPTER 12
45 mx
40
35
30
25 my
y
20
15
10
0
0 5 10 15 20 25
x
mx = ∑x = 50 = 12.5; my = ∑y = 100 = 25
n 4 n 4
Figure 12.4 A set of four data points which fall exactly on a straight line described
by the relationship y = 50 - 2x yielding a correlation coefficient (r) of
minus one, showing the method for calculation of deviations and sums
of squares and products
246
CORRELATION AND REGRESSION
45
mx
40
35
30
25 my
y
20
15
10
0
0 5 10 15 20 25
x
mx = ∑x = 50 = 12.5; my = ∑y = 100 = 25
n 4 n 4
Figure 12.5 The same set of data used in Figs 12.3 and 12.4 but paired in such a
way as to yield no correlation (r = 0), showing the method for
calculation of deviations and sums of squares and products. Note that
each data point falls into a different quadrant
247
CHAPTER 12
45
mx
40
35
30
25 my
y
20
15
10
0
0 5 10 15 20 25
x
mx = ∑x = 50 = 12.5; my = ∑y = 100 = 25
n 4 n 4
Figure 12.6 The same set of four data points used in Fig 12.3 to but with the
y values interchanged to give a poor correlation (r = 0.6) yet still
fitting the function y = 2x, showing the method for calculation of
deviations and sums of squares and products. Note that all four data
points still fall into the positive quadrants
248
CORRELATION AND REGRESSION
The probability (P) of obtaining this value for t can be obtained from standard tables of t
where there are n – 2 degrees of freedom. Alternatively tables of r are available from
which the value of P can be read directly for any number of degrees of freedom.
A significant correlation only means that there is an association between x and y, it does
NOT necessarily mean that a change in x causes a change in y. In other words
correlation does not equal causation.
Another way of analysing comparison data is to use the analysis of variance approach
described in chapter 11. For each pair of values of x and y there is a value for y (which
we shall call yfit) which falls exactly on the line of best fit (of course if the x,y data point
happens to fall on the line of best fit then y = yfit). The sum of squares (SS) for the
regression is the sum of the differences of yfit from the horizontal axis described by my:
249
CHAPTER 12
This tells us how far the predicted values differ from the overall mean (analogous to the
between sum of squares used in chapter 11).
The residual sum of squares reflects the difference between the original data and the
fitted line:
These sum of squares can be used to calculated mean squares and hence a value for the
variance ratio (F) which can in turn be used to test the hypothesis that the line of best fit
is significantly different from the horizontal axis. Alternatively the sum of squares can be
used to calculate the coefficient of determination (R2) which is simply the proportion of
the total variance described by regression:
R2 = SSregression
SSregression + SSresidual
Linear regression
250
CORRELATION AND REGRESSION
Since x is the independent variable it is assumed to be without error. Most values for y
will not fall on the line of best fit due to inherent imprecision of y. However, the linear
relationship between x and y predicts the expected value of y (which we shall call yfit) for
any given value of x:
yfit = bx + a
The deviation of the observed value for y from the regression line is (y – yfit), which is
also known as the residual (e), and can be either positive or negative depending upon
whether the observed value falls above or below the line (see Fig 12.7):
e = y – yfit = y - bx - a
y – yfit = 0
y = yfit x
y x Regression line
y - yfit
yfit = b.x + a
yfit = b.x + a
y - yfit
y x
x x x
Figure 12.7 Regression line determined for three values of the dependent variable
(x,x and x) and the corresponding values for the independent variable
(y,y and y). The regression line is drawn (from calculated values for
slope b and intercept a) such that the sum of the squares of the
residuals ( Σ(y – yfit)2 = (y – yfit)2 + (y – yfit)2 ) + (y – yfit)2 is a minimum
251
CHAPTER 12
As in other situations these positive and negative residuals cancel but the sum of their
squares always gives a positive value which is a measure of the overall residuals of the
observations from the line. The problem is to derive values for a and b such that
∑(y – yfit)2 is a minimum (hence this calculation is also known as the method of least
squares). Mathematicians deal with this problem by equating the two derivatives of
∑(y – yfit)2 with respect to both a and b, to zero, then solving the resulting simultaneous
equations for a and b. The solution for the slope of the line (b) is given by the
expression:
The slope of the regression line (b) is also known as the regression coefficient. A similar
expression can be derived to determine a. It is simpler, however, to use the fact that the
line must pass through the intersection point of the means of x and y, so that equation
12.1 becomes my = b.mx + a. Simple rearrangement, with substitution of the value
for b enables determination of the value of a:
The regression process assumes that the distribution of residuals (y – yfit) about the
regression line is Guassian i.e. that there are approximately equal numbers of
observations each side of the line, that the residuals are independent of the value of x and
that most observations are close to the line with relatively few a large distance form it.
The standard deviation of the residuals (sres, sometimes known as syx) is a valuable
indicator (the lower the better) of the goodness of fit of the data to a straight line:
252
CORRELATION AND REGRESSION
Having obtained a value for the slope (b) the question often arises as to whether or not it
is significant i.e. whether it is really different from zero (in which there would be no
relationship between x and y). This question can be answered by dividing the difference
between b and 0 (actually b) by the standard error of the estimate of b so as to obtain a
corresponding value for t:
where t has n-2 degrees of freedom. If n is greater than 30 then the probability can be
obtained from tables of z. Ninety-five percent confidence limits for the slope are given
by b ± 1.96 t.
Question Q 12(1)
The following results for total calcium and albumin were obtained for a series of serum
samples:
1 23 1.95
2 26 2.20
3 30 2.10
4 33 2.25
5 36 2.22
6 40 2.35
7 44 2.32
8 48 2.40
9 52 2.52
Is there a significant linear relationship between serum total calcium and albumin?
Derive an expression to “correct” serum calcium to a “normal” albumin concentration of
40 g/L.
253
CHAPTER 12
Answer Q 12(1)
2.6
2.5
2.4
Calcium (mmol/L)
2.3
2.2
2.1
1.9
1.8
1.7
20 25 30 35 40 45 50 55
Album in (g/L)
x x2 y y2 xy
254
CORRELATION AND REGRESSION
r = Σxy – (ΣxΣy/n)
√ {[Σx2 – (Σx)2/n] [Σy2 – (Σy)2/n]}
= 761.54 - 749.21
√ { [13034 – 12247] [46.059 – 45.833]}
= 12.33 = 12.33
√ {787 x 0.226} √177.86
= 12.33 = 0.925
13.336
From tables of r, the probability of obtaining a value of 0.925 for n-2 degrees of freedom
(7) is approximately 0.001 (i.e. 0.1%). Therefore the correlation is highly significant.
(Alternatively t can be calculated using Eq 12.5 and its P value obtained from tables of
t for 7 degrees of freedom.) Furthermore:
R2 = 0.9252 = 0.856
The slope of the regression line of y upon x can be obtained using Eq. 12.7:
b = Σxy - (ΣxΣy/n)
Σx2 - (Σx)2/n
255
CHAPTER 12
From tables when t = 6.5 for 7 degrees of freedom, P<0.001. Therefore the regression
coefficient is significantly different from zero.
The value for the intercept (a) can be obtained by substituting for b, mx and my in
Eq. 12.8:
y = 0.0157 x + 1.68
256
CORRELATION AND REGRESSION
To draw the regression line calculate the value of y for two carefully chosen values of x
(say 20 and 55 g/L) using the regression equation, plot the points on a graph of y versus x
and join them up:
2.6
2.5
Measured Ca (mmol/L)
2.4
2.3
2.2
Regression line
2.1
of y upon x
2
1.9
1.8
1.7
20 25 30 35 40 45 50 55
Albumin (g/L)
The regression equation shows that for each increase in albumin by 1 g/L, the measured
calcium also increases by 0.0157 mmol/L. To “correct” a measured calcium
concentration to the value expected if the albumin was “normal” (40 g/L) the difference
between the measured albumin and 40 g/L is multiplied by 0.0157 then added to the
measured calcium:
If the measured albumin is greater than 40 g/L, then the expression 0.0157 (40 –
measured albumin) becomes negative so that it is subtracted from the measured calcium.
257
CHAPTER 12
Sometimes it is not clear which is the independent variable. In this case a regression of
both y upon x and of x upon y can be performed. Two different regression lines are
obtained which intercept at the intersection of the two means (mx and my).
Deming regression Ideally if neither x nor y can be identified as the independent variable
then the best solution is to carry out a regression using the method of Deming. This
involves calculating a regression line such that the sum of squares of residuals between
the data points and lines drawn perpendicular to the regression line is minimal (see Fig
12.8 (c)). The process is rather complicated but computer programmes are available to
perform the calculations.
Regression through the origin If the nature of the problem dictates that the regression
line must pass through the origin then there is a simple technique to make sure this is so.
Two points are needed to draw a straight line. The origin will be one and the other is at
the intersection of the two means (see Fig 12.8 (d)).
b = my = Σy/n = Σy
mx Σx/n Σx
Weighted regression The underlying assumption with linear regression is that the
standard deviation of y is constant throughout the range of values. This is often not true
and techniques exist for weighting each value to allow for variations in imprecision.
Multiple regression Sometimes we are dealing with more that one independent
variable. In this situation the partial regression coefficients between pairs of variables are
calculated. These techniques are beyond the scope of this book.
Non-linear regression Although non-linear data can often be transformed then analysed
by linear regression this is not always the case. Non-linear methods are available but
again are beyond the scope of this book.
258
CORRELATION AND REGRESSION
y y
00
0 x x
a) Regression of y upon x
c) “Deming” regression d) Regression through the origin
yMean x
y x
y
x xMean
0 0
x x
Figure 12.8 Modes of regression. In each case the sum of the squares of the
distance from the line ( ) is minimized in the direction
shown
259
CHAPTER 12
Method comparisons are usually carried out to determine if two methods yield the same
results with patient samples (o rare sufficiently similar for routine purposes). The
correlation coefficient is unhelpful since it only indicates if there is a relationship
between the two sets of results, not whether the results are comparable; furthermore it
would be surprising if the correlation was not significant between two methods designed
to measure the same analyte. Simple regression methods rely on the assumption that the
independent variable is determined without error – an assumption which is rarely true.
Regression methods do however, provide the relationship between the two methods.
Since two methods are measuring the same analyte, the goal is determine whether or not
the two sets of results are significantly different. One way to do this is by the paired t-test
(see Chapter 11). However, if the two sets of results are identical then the relationship
between them should be described by a straight line which passes through the origin and
has a slope of one (i.e. the data is best described by the expression y = x). Visual
assessment of the data can be made if the two sets of results (x and y) are plotted against
each other and the line y = x drawn (Fig 12.9). Altman and Bland took this approach one
step further and plotted the absolute difference d ( equal to y – x) against the mean of
each pair of data, (x + y)/2. If there is no significant difference between the two methods
then the data should scatter evenly about a horizontal line with a value of zero (Fig 12.9);
in other words the mean difference (md) should be close to zero.
Calculation of the standard deviation of the differences gives a measure of the scatter
about the line.
The 95% confidence limits of the mean difference are known as the “95% limits of
agreement”:
260
CORRELATION AND REGRESSION
25
a)
20
Method B
15
10
Line of equivalence
Method B = Method A
5
0
0 5 10 15 20 25
Method A
1.2
1 b)
d = Method B - Method A
0.8
0.6
0.4
0.2
0
-0.2 0 5 10 15 20 25
-0.4
-0.6
-0.8
-1 Mean = (Method B + Method A)/2
261
CHAPTER 12
A t-test can be applied to determine if the mean difference is significantly different from
zero:
These calculations are identical to those of the paired t-test (Chapter 11).
It is important to carefully inspect the data to ensure that the differences are normally
distributed about their mean throughout the range of results. For example, if there is a
significant slope in the y versus x plot of the data then the difference plot will show an
even more exaggerated change in d with concentration, which can be removed if the
percentage difference is plotted instead of the absolute difference.
ADDITIONAL QUESTIONS
1. Regression analysis of results using new standards (y) against old standards (x)
showed a linear relationship. The regression coefficient (slope) was 1.10 and the
intercept on the y axis 1.0 mmol/L. Calculate the results which would be expected
using new standards for the analysis of old standards containing (a) 15 mmol/L
and (b) 150 mmol/L.
2. A laboratory changed its method for the assay of serum alkaline phosphatase
activity. Assay of a selection of patient’s samples by both methods yielded the
following data:
ALP (Old method), IU/L: 50 350 700 100 1500 2000 420 1200
ALP (New method), IU/L: 40 190 350 90 750 1500 280 600
262
CORRELATION AND REGRESSION
Drug dosage (mg/kg body wt): 50 100 150 200 250 300 350 400
Prolactin (IU/L) 750 1500 350 400 2000 1250 500 1800
Do these data show a linear relationship between drug dosage and serum prolactin
concentration?
263
CHAPTER 12
264
CLINICAL UTILITY OF LABORATORY TESTS
Chapter 13
A laboratory test for a specific disease may yield either a positive or a negative result.
This may be achieved either by a qualitative test (in which the result is either positive or
negative) or a quantitative result which is either above (positive) or below (negative) a
selected “cut-off” value. In the ideal world all patients with the disease in question, and
only these patients, would give a positive result. Furthermore, all patients without the
disease, and only these patients, would produce negative results. In other words there
would be no false positives or false negatives. However, in reality a proportion of both
false positives (positive results in patients without disease) and false negatives (negative
results in patients with disease) are always obtained. The classification of positive and
negative results is shown in Fig 13.1. Correct interpretation of laboratory results demands
an appreciation of the likelihood of the result identifying the presence (or absence) of
disease.
Total TP + FP TN + FN TP + FN + TN +FP
265
CHAPTER 13
The sensitivity of a test is the proportion of patients with disease that are identified by the
test. The number with disease giving a positive result is the true positives (TP) and the
total number with disease is the sum of the true positives and false negatives (TP + FN):
The specificity of a test is the proportion of patents without disease that are identified by
the test. The number without disease giving a negative result is the true negatives (TN)
and the total number without disease is the sum of the true negatives and false positives
(TN + FP):
Sometimes these values are multiplied by 100 to give results for sensitivity and
specificity as percentages.
The efficiency of a test is the proportion of all results which are true results:
266
CLINICAL UTILITY OF LABORATORY TESTS
Determinations of sensitivity and specificity, like any other biological measurement, are
subject to error. Consider a test with a true sensitivity of 0.50 (50%) determined on a
group of 50 patients with a particular disease. If the sensitivity was repeatedly
determined on groups of 50 patients with the disease then the sensitivity would not come
out at exactly 0.5 every time but would cluster around this value in a similar way to the
mean determined for a continuous variable. Situations where there are only two possible
outcomes (positive or negative) belong to a special distribution called the binomial
distribution. Fortunately if the sample size (n) is large enough (>30) then the binomial
approximates to a normal distribution, and this can be extremely useful.
Question Q 13(1)
A test for a certain disease gave a 5% false positive rate and a 2% false negative rate.
What is the sensitivity and specificity of the test?
267
CHAPTER 13
Answer Q13(1)
If the false negative rate is 2% then this means that for every 100 patients with disease,
2 false negatives will be obtained, the remainder will give positive results.
If the false positive rate is 5% then this means that for every 100 patients without disease
5 false positive results will be obtained, the remainder will give negative results.
Predictive values
Sensitivity and specificity define the performance of a test when applied to populations of
individuals who either have (sensitivity) or do not have (specificity) the disease in
question. In practice tests are applied to populations made up of a mixture of subjects
with or without disease. The proportion of the two populations (i.e. the prevalence of
disease in the population being tested) can have a profound effect on the predictive value
of a test. The predictive value of a test is the probability that a subject with a positive
result has the disease (positive predictive value, denoted PV+) or the probability that a
subject with a negative result does not have the disease (negative predictive value
denoted PV-).
268
CLINICAL UTILITY OF LABORATORY TESTS
Consider the example in question Q 13.1 where there is a false positive rate of 5%. In a
population in which a half of all individuals have disease (i.e. the prevalence of disease is
0.5 or 50%) it is possible to calculate the number of each possible outcome of the test
given that the sensitivity is 0.98 and specificity 0.95:
If prevalence is 0.5, then proportion of patients with disease (TP + FN) = 0.5.
In other words:
Similarly if prevalence is 0.5, then proportion without disease (TN + FP) = 0.5
269
CHAPTER 13
In general:
Since TP = 0.49 and FP = 0.025 the proportion of patients with a positive result who have
disease (i.e. the positive predictive value) is
Consider a population consisting of 10 patients with and 90 patients without, disease (i.e.
prevalence = 0.1 = TP + FN).
Since we are dealing with proportions all groups must add up to one
270
CLINICAL UTILITY OF LABORATORY TESTS
The proportion of true positives has fallen from 0.49 to 0.098 and false positives risen
from 0.025 to 0.045. Therefore, the proportion of positive results which are due to
disease (i.e. the positive predictive value) is:
Therefore only 0.685 (or 68.5%) of positive results are due to the presence of disease.
Continuation of this process for other prevalences yields the data in Fig 13.2.
Prevalence TP FP PV+
Figure 13.2 Effect of disease prevalence upon the number of true positives (TP),
false positives (FP) and positive predictive value (PV+) for a test with
a sensitivity of 98% and specificity of 95% applied to a population of
10,000 subjects
Therefore when the prevalence is very low the number of false positives exceed the true
positives.
Question Q13(2)
In a cancer clinic where the prevalence of ovarian malignancy is 40%, a tumour marker
has a specificity of 88% and a sensitivity of 92%. Calculate the predictive value of a
positive test result.
271
CHAPTER 13
Question Q13(2)
To solve this type. of problem the calculations can be based on actual numbers of results,
percentages or proportions of one. It is often simplest to work with proportions.
The next task is to determine values for TP, FN, FP and TN. We are given values for
sensitivity and specificity so can write:
Since the prevalence is 0.4 and is equal to (TP + FN), and (1 – prevalence) is 0.6 and is
equal to (TN + FP), both of these expressions can be re-written:
These expressions are then rearranged and solved for TP and TN:
272
CLINICAL UTILITY OF LABORATORY TESTS
Since prevalence = (TP + FN) and (1 – prevalence) = (TN + FP), values for FN and
FP can be obtained by subtraction of TP and TN from the corresponding totals in the last
column.
The predictive value of a positive test (PV+) is then obtained by substitution of TP and
FP into equation Eq 13.7:
Question Q13(3)
A man has a PSA of 5 μg/L. 22% of patients with benign prostatic hypertrophy and 38%
of patients with prostatic cancer have concentrations of PSA between 4.1 and 10 μg/L.
What is the positive predictive value for a diagnosis of cancer of the result for this man in
this range, if the prevalence of cancer in his age group is 5% and benign prostatic
hypertrophy is 20%? Assume 2% of patients without any prostatic pathology have a PSA
>4.1 μg/L.
273
CHAPTER 13
Answer Q13(3)
This question differs from Q13(2) in that there are three instead of two groups of patients.
However, there are only two groups as far as the disease in question (prostatic cancer) is
concerned – those with cancer and those without. The only difference is that the group
without cancer is made up of two populations:
In order to calculate the predictive values first calculate the individual positive and
negative results for each of the three groups. This it is easier if everything is converted
to proportions of one rather than working with percentages.
The normal group (i.e. those with neither BPH or CAP) consists of the remaining patients
Normals FP TN 0.75
274
CLINICAL UTILITY OF LABORATORY TESTS
For the group without either disease we are told that 2% of patients have raised PSA
(i.e. false positives). Therefore FP = 2% = 2/100 = 0.02
275
CHAPTER 13
The positive predictive value (PV+) is obtained by substituting the proportion of true
positives and false positives into Eq 13.7, remembering that the false positives are made
up of two groups – those with BPH and those with neither BPH nor CAP:
PV+ = TPCAP
(TPCAP + TPBPH + FP)
The positive predictive value (the likelihood of disease when a test result is positive) is
markedly influenced by the prevalence of disease in the population being tested. The
prevalence of any disease is very low in the general population, becomes higher in a
population with symptoms of the disease in question and highest in patients referred by a
general practitioner to a hospital specialist.
This concept is of greatest significance when screening a well population for the presence
of a rare disease such as bowel cancer. False positives may be obtained due to
interference in the screening test used, dietary factors and bleeding from other sources.
As a result the predictive value of a positive screening test is low so that only a small
portion of patients with positive results will have bowel cancer. Detection of bowel
cancer then relies on a secondary test (such as colonoscopy). However, the screening test
is still valuable in that it enables selection of a sub-population of individuals with a higher
prevalence of disease which warrants the expensive and unpleasant secondary test.
On the other hand a false negative will be obtained if the tumour is not bleeding when the
screening test is carried out. When designing a screening program the incidence of false
positives and false negatives must be carefully balanced so as to achieve optimal
detection of disease with minimum cost and morbidity.
276
CLINICAL UTILITY OF LABORATORY TESTS
Although calculations of sensitivity, specificity etc are simple with a qualitative test
which yields either a positive or negative result, the majority of diagnostic tests are of a
quantitative nature yielding a result which is a continuous variable. The quantitative
result is converted into a qualitative one by using a decision level (or cut-off point).
Values above this level are classified as “positive” and below it as “negative”. This
provides an opportunity to manipulate the performance of a test by adjustment of the
decision level.
Figure 13.3a shows the result for an analyte which is able to distinguish patients with a
particular disease from healthy individuals. The distribution of results for each set of
patients is roughly Guassian, but the two distributions overlap. In this overlap area it is
impossible to use the test to distinguish patients with the disease from those who have
not. If a decision level at point “A” is used then there are no false negatives but as many
as a half of those without disease would yield false positive results (i.e. the sensitivity
would be virtually one but with a specificity of only 0.5). A decision level of “B”
produces fewer false positives but at the expense of missing a small number with disease.
Using “C” or “D” even fewer false positives are obtained but false negatives become
more frequent. At level “E” there are no false positives but at the expense of missing
about one half of patients with the disease.
A useful way of looking at the effect of changing decision levels is to plot the sensitivity
(y-axis) versus 1 – specificity (x-axis) at each decision level (see Fig 13.3b). The x-axis
therefore represents the false positive rate and the y-axis the true positive rate. For a
perfect test, the resulting receiver operator characteristic (ROC) curve would extend
from the lower left to the upper left then to the upper right. It is generally accepted that
for a test of no diagnostic value the curve would be a diagonal line from the origin with a
slope of one. When comparing the performance of different tests it is helpful to calculate
the area under the ROC curve. Other things being equal, the test with the highest area is
superior.
Although examination of the ROC curve is often helpful, usually other considerations
(the importance of avoiding false positives versus false negatives, the nature and cost of
any follow-up tests etc) determine which decision point to use.
277
CHAPTER 13
No
Analyte value
A B C D E
(b)
1
0.9 D E
C
0.8
0.7 B
sensitivity
0.6
0.5 A
0.4
0.3
0.2
0.1
0
0 0.2 0.4 0.6 0.8 1
1 - specificity
Figure 13.3 Effect of variation of decision level (A,B, C, D and E) upon the
performance of a diagnostic test. a) Distribution of results from
healthy and diseased individuals; b) Same data plotted as a
receiver operator characteristic (ROC) curve
278
CLINICAL UTILITY OF LABORATORY TESTS
What the clinician really needs to know is the probability that a patient with a given test
result has the disease in question. To answer this question two parameters are required:
• The prevalence of the disease in the population to which the patient belongs
The likelihood ratio positive (LR+) is defined as the ratio between the probability of
finding a positive test in the presence of disease and the probability of a positive test in
the absence of disease. The probability of a positive test in the presence of disease is
simply the sensitivity of the test. The probability of finding a negative test in the absence
of disease is the specificity so that the probability of a positive test when disease is absent
is simply (1 – specificity). Therefore LR+ can be calculated directly from the sensitivity
and specificity of the test:
A similar expression can be derived for likelihood ratio of a negative test (LR-):
When comparing several different tests it is often helful to calculate the diagnostic odds
ratio, which is simply the ratio of LR+ to LR-, which can be simplified to:
279
CHAPTER 13
The likelihood ratio positive is useful in converting the pre-test odds (the likelihood that
disease is present before the test is carried out) to the post-test odds. In the absence of
any data about the patient’s individual risk factors, the pre-test odds is the prevalence of
the particular disease in the population to which he/she belongs. For example if the
prevalence of disease is 0.2 (20%) then a group of 100 patients will contain 20 with
disease and 80 without disease. Therefore the odds of disease being present are given by
the ratio of the number of patients with disease to the number without i.e. 20/80 = 0.25.
In other words for each patient without disease there will 0.25 patients with disease or
rather for every 4 patients without disease there will be one with the disease or the odds
against disease being present are 4:1. In general:
Multiplication of the pre-test odds by the likelihood ratio positive gives the post-test
odds:
Post-test odds can be converted back to the probability of the patient having disease using
the expression:
280
CLINICAL UTILITY OF LABORATORY TESTS
Provided reliable data is available on the prevalence of the disease and the diagnostic
performance of the laboratory test then calculation of post-test probability is the best way
of presenting laboratory data to the clinician. However, one problem that is not addressed
is the effect of the magnitude of the increase in the analyte above the decision point. A
result which is several times the decision value often is more likely to be associated with
disease than a result which is only just above this value.
• Multiple tests. Provided the likelihood ratios of several different tests are known
then it sometimes possible to combine them to give a more reliable post-test
probability of disease than for either test on its own. An example of this is the
triple test to screen for Down’s syndrome.
Question Q 13(4)
281
CHAPTER 13
Answer Q 13(4)
If the prevalence is 10% then for a sample of 500 individuals 50 will have the disease and
450 will be healthy. Use this data to set up a 2x2 contingency table:
Diseased 45 FN 50
Healthy 40 TN 450
Subtract the positive test from the corresponding total to give to give the numbers of
negative results:
Diseased 45 5 50
Sensitivity = TP = 45 = 0.90
(TP + FN) 50
a) Calculate the likelihood ratio positive (LR+) from the sensitivity and specificity
(Eq13.13):
282
CLINICAL UTILITY OF LABORATORY TESTS
b) First calculate the pre-test odds from the disease prevalence (Eq 13.16):
Prevalence = 50 = 0.1
500
= 0.111 x 10 = 1.11
283
CHAPTER 13
ADDITIONAL QUESTIONS
1. A test for a particular disease has a sensitivity of 95% and a specificity of 95%.
Calculate the predictive value of both a positive and a negative test result in a
population in which the prevalence of the disease is:
a) 1 in 2
b) 1 in 5000
2. The table shows data from two urinary screening tests for the detection of
phaeochromocytoma.
a) a positive result?
b) a negative result?
284
CLINICAL UTILITY OF LABORATORY TESTS
4. A proposed diagnostic serological test for coeliac disease was evaluated in 200
consecutive patients referred to a paediatric gastroenterology service in whom the
condition was suspected clinically. The test result was compared with the
diagnosis as established by biopsy, withdrawal of gluten and response to
re-challenge. On this basis 76 children had the condition of whom only 64 gave a
positive test result: 10 positive test results occurred in children who were shown
not to have coeliac disease. Calculate the sensitivity and specificity of the test and
the predictive value of a positive result.
7. A two-stage sequential test strategy is used to screen for a rare inherited disease.
The prevalence of the disease is 0.0005. The initial test has a sensitivity of 98%
and specificity of 95%, the follow-up test a sensitivity of 95% and specificity of
99%. What is the probability of a patient with a positive result for the follow-up
test having the disease?
285
CHAPTER 13
286
STATISTICAL POWER
Chapter 14
Statistical power
Power is the ability of a statistical test to detect a specified effect with a given probability.
For example, comparison of the means of two sets of data involves calculation of the
number of standard errors by which they differ (i.e. the z or t value) then looking up the
P value of this statistic in appropriate tables. Fig 14.1 shows the sampling distribution of
the mean for two sets of data. In Fig 14.1a it is apparent that the two distributions
overlap considerably. However, increasing the sample size from 10 to 50 lowers the
standard deviation of these distributions (i.e. the standard error of the means) so that the
distributions are less widely spread but without any shift of the overall mean values. This
is exactly what would be predicted from Fig 11.1. As a consequence the overlap between
the distributions has been eliminated. Since the standard error of each mean is given by
s/√n it is not surprising that increasing the value of n reduces the standard error
(i.e. spread) of each distribution so reducing the degree of overlap.
The likelihood of demonstrating a significant difference between the means of two sets of
data increases as the degree of overlap is reduced. The degree of overlap and hence the
chance of detecting a difference (i.e. power) is in turn determined by four factors:
287
CHAPTER 14
a) n = 10 A B
b) n = 50
Figure 14.1 Sampling distributions of the means for two sets of data (A and B)
showing the effect of sample size (n) on the overlap between the two
distributions: a) small sample size (n = 10) resulting in large standard
errors of the means and overlap between both distributions, and b)
larger sample size (n = 50) resulting in smaller standard errors of the
means and virtually no overlap between the distributions. Note that
the overall means of A and B remain unchanged
288
STATISTICAL POWER
Figure 14.2 shows the frequency distribution of the same data but using the difference
between observed values for the means (i.e. mean B minus mean A) expressed as the
number of standard deviations from the overall mean (i.e. the z or t value). The blue
curve shows the distribution which would be obtained if there was no true difference
between the overall means of A and B; this is identical to the normalized Guassian curve
(with a mean of zero and standard deviation of one) used to generate values of P for
given values of z (or t). For any individual set of data the observed value of the mean is
compared with a decision level (DL, which is usually z = 1.96 and equivalent to a
probability of 0.05 – or 0.025 if a single sided z-test is performed) to decide whether or
not to accept the null hypothesis (Ho) that no real difference exists and the observed data
could have arisen by chance. The probability used for the decision level (the probability
that the value for z could have arisen by chance) is denoted as alpha (α) – or α/2 for a
single sided test.
There is however a risk in relying on the null hypothesisalone. If we set alpha at 0.05
then this means that a value for z of greater than 1.96 will be used to reject the null
hypothesis and we then accept that the means of the two sets of data are probably
different (i.e. belong to a different frequency distribution). However, by chance, z will be
greater than 1.96 (and P less than 0.05) on approximately one occasion in twenty even if
no real difference exists. Therefore the chance of a false positive (i.e. accepting that there
is a difference between the two means even when in fact none exists) is 1 in 20 (i.e. 0.05).
Rejection of the null hypothesis when it is true is known as a type I error, the chance of it
occurring is alpha (α). If a lower cut off point for P (i.e. α) is used then the risk of a type
I error is reduced. The chance of a type I error not occurring is (1 – α). It is important to
realize that rejection of the null hypothesis does not prove that the samples came from
different populations – it simply means that we have failed to prove that they do not come
from the same population.
The alternative hypothesis (H1) is that the two sets of data belong to different populations
i.e. the true difference between their means is not zero. Figure14.2 shows the distribution
for the difference between the means plotted as z values in red. Therefore if the data
belong to different populations then their values cluster around a value other than zero so
that the curve shifts to the right (or to the left if there is a negative difference). The
decision level cuts the H1 curve at a different point and divides the curve into two
portions. The segment to the left, denoted as beta (β), represents the chance of rejecting
the alternative hypothesis when it is true (or not rejecting the H0 hypothesis when it is
false) and is known as a type II error. The difference (1 – β) is the probability of
rejecting the null hypothesis when it is false, and is known as the power of the study.
289
CHAPTER 14
Area = (1 – α)
Area = α/2
Area = β
Area = (1 – β) = power
Figure 14.2 Frequency distributions for the null (H0) and alternative (H1)
hypotheses showing the significance of alpha (α) and beta (β)
290
STATISTICAL POWER
The relationship between type I and type II errors is shown in Fig 14.3.
Ideally the power should be one (or 100%) so that the null hypothesis will always be
rejected whenever it is false. In practice such a high power is never achieved as there is
always a small risk of a type II error. Fortunately, the power of a test can be improved by
manipulating several key factors. First it is necessary to derive a relationship between
these factors.
Consider two sets of overlapping data, A and B, with means mA and mB and a common
standard deviation (s) and sample number (n). mB is greater than mA. The distance of the
decision level (DL) from mA is (DL - mA) and defines the value of α. If this distance is
divided by the standard error (s/√ n) then the corresponding value for α expressed as
standard errors from the mean (zα) is:
Zα = DL - mA = (DL - mA) x √ n
s/√ n s
Similarly the distance of DL from mB is (mB - DL) and defines the value of β. If this
distance is also divided by the standard error (s/√ n) then the corresponding value for β
expressed as standard errors from the mean (zβ) is:
zβ = mB - DL = (mB - DL) x √ n
s/√ n s
291
CHAPTER 14
zα + zβ = (DL - mA + mB - DL) √ n
s
zα + zβ = (mB - mA) √ n
s
A more useful form of this expression can be obtained by rearrangement to give a value
for √ n:
√n = s (zα + zβ)
∆
• Evaluation of published data. Quite often authors publish a P value without any
discussion of the power of their study. This is particularly important when no
significant difference is found since the power of the study may be too low to
stand a reasonable chance of detecting any true difference which may exist.
Evidence based reviews often demand some estimate of the power of published
studies cited. Values for n and s and the individual means (and hence ∆) are
292
STATISTICAL POWER
• usually given so it is possible to insert these into Eq 14.1 to obtain the value for
(zα + zβ). The P value can be converted to zα (by dividing it by s/√ n) then zβ
obtained by subtraction. zβ can be converted to β (by multiplying by √ n/s) then
subtracted from one to give the power. It is generally accepted that a “good”
power should be at least 80%.
The above principles are also applicable to other study designs but unfortunately the
mathematics can become quite complex. Fortunately tables (and computer packages) are
available for the common study designs so direct calculation is rarely required. Note that
to simplify matters these tables often use the effect size (ES), which is the ratio of the
difference between the groups to s (i.e. ES = ∆/s).
Question Q 14.1
A study into the efficacy of a new cholesterol lowering drug involved measuring serum
cholesterol in 30 subjects both before and after administration of the drug. Using a
decision level of 5% the authors concluded that there was no effect of the drug upon
serum cholesterol concentrations (z = 0.97, P >0.1). The mean initial serum cholesterol
concentration was 7.0 mmol/L (SD = 2.0 mmol/L). After 4 weeks of treatment with the
drug the mean serum cholesterol was 6.5 mmol/L (SD= 2.0 mmol/L). Calculate a) the
power of their study, and b) the sample size needed to achieve a power of 90%.
293
CHAPTER 14
Answer Q 14(1)
zα + zβ = ∆ √n
s
∆ = difference between the means for the two groups = 6.5-7.0 = - 0.5 mmol/L
n = number of subjects in the study = 30
s = standard deviation = 2.0 mmol/L
Substitution of these values for ∆, n and s into this equation allows evaluation of
the sum of the z values for α and β:
Since the probability (P) used as a decision level in this study is 0.05, the
corresponding z value (obtainable from tables) is 1.96. Therefore, α = 0.05 and
zα = - 1.96 (since we are using the negative part of the distribution) allowing
calculation of zβ:
From tables, the value for β (i.e. proportion of total area under the curve)
corresponding to zβ = 0.59 is 0.28.
Therefore the study has a 72% chance of detected a change of 0.5 mmol/L at a
decision level of 5% probability.
294
STATISTICAL POWER
b) The sample size required to achieve the desired power can be calculated from
Eq. 14.2:
n = [ s ( zα + zβ ) / ∆ ] 2
Therefore (1 – β) = 0.9
From tables the corresponding z value (remembering that we are dealing with a
one-tailed z-test) is 1.28.
Substitute (zα+ zβ), s and ∆ into the above equation and solve for n:
n = [ (2.0 x 3.24)/-0.5]2
= 12.962 = 168
Therefore at least 168 subjects will need to be studied in order to achieve a 90%
chance of detecting a change in serum cholesterol of 0.5 mmol/L at the 5% level
of probability.
295
CHAPTER 14
FURTHER QUESTIONS
1. A study into the effect of nutritional supplements on patients with Crohn’s disease
involved measuring serum albumin both before and after supplementation for a
four-week period. During this period the mean serum albumin level increased
from 25 g/L to 30 g/L. The study involved 40 patients with a standard deviation
for albumin concentration of 10 g/L. What is the power of this study to detect a
5 g/L change in serum albumin at the 5% level of probability?
296
MISCELLANEOUS TOPICS
Chapter 15
Miscellaneous topics
Recovery experiments
In most instances the base material will already contain some of the analyte in question
but this amount is not usually known with certainty. Therefore both the base material and
the spiked material are analysed. The amount recovered is calculated by subtraction of
the “base result” from the “spiked result” so that the above expression becomes:
Recovery (%) = (spiked result - base result) x 100 ………….. Eq. 15.1
spike added
Allowance must be made for dilution of the spike by the base material and vice versa.
297
CHAPTER 15
Sometimes recovery is calculated in a different way which assumes that the expected
result for the spiked sample is equal to the sum of the base result and the spike added:
This approach is fundamentally flawed and should NOT be used. The expected
contribution of analyte in the base sample is not the same as its measured value (unless
the recovery of the method is exactly 100%). The expected result for the analyte in the
base sample is not known and therefore the expected result for the spiked base cannot be
calculated and the above equation does not give a valid recovery value. Only the value
for the “spike” is accurately known.
Most assays in routine use have recoveries in the range 90-100% with imprecisions of the
order of 5%. Imprecision of the assay can easily lead to unreliable estimates of recovery
and can be minimized by:
• Spiking the base material with concentrations similar to the endogenous analyte
concentration.
Sometimes it is useful to carry out recovery experiments with a range of spiked values
and using several different base materials.
Question Q 15(1)
0.1 mL of a 50 mmol/L glucose solution was mixed with 0.9 mL of the same plasma
sample, then 0.1 mL of the mixture taken through the assay. The absorbance was 0.300.
Calculate the recovery of the method.
298
MISCELLANEOUS TOPICS
Answer Q 15(1)
Rearranging:
Subtract the blank (water used as sample) absorbance from both standard and plasma
readings and substitute into the above equation:
However, by mixing the diluted glucose solution with the plasma sample the base
concentration and the spiked concentration has been diluted and must first be calculated:
299
CHAPTER 15
Glucose added = Concn of glucose solution (mmol/L) x Volume of glucose added (mL)
[Vol of glucose added (mL) + Volume of plasma (mL)]
= 50 x 0.1 = 5 mmol/L
(0.9 + 0.1)
Next calculate the measured glucose in the spiked plasma sample – remembering to
subtract the reagent blank:
= (0.300 – 0.080) x 10
(0.320 – 0.080)
Calculate the recovery from the measured glucose concentration in the spiked plasma
(9.17 mmol/L), the base plasma sample (4.5 mmol/L) and the glucose added (5 mmol/L):
Recovery (%) =
300
MISCELLANEOUS TOPICS
Frequently tumour markers are measured following surgical resection of the tumour to
provide evidence that removal of the tumour is complete. If the tumour has been
completely removed then the concentration of the tumour marker should reflect the value
expected from the natural decay of the compound present at the time of surgery. If the
concentration has not fallen to this level then it is likely that the marker is still being
produced by residual tumour tissue (or from secondaries).
Tumour markers are usually cleared exponentially as are most drugs (see Chapter 7).
If
then as was shown in Chapter 7, a linear expression can be derived relating these
variables:
Provided the rate constant kd is known then the concentration at any given time can be
calculated or the time taken to reach a specified concentration estimated. The latter may
be particularly useful in predicting the time when the concentration of tumour marker
should be below the upper reference limit.
There are ways of manipulating this equation to give an expression that is simpler to
apply in common situations. Subtraction of one logarithm from another is the same thing
as calculating the logarithm of their ratios:
Where CR is the ratio of the concentration at time t (CPt) to the initial concentration
(Cp0). Eq. 7.7 can then be re-written as:
301
CHAPTER 15
Further simplification is possible. In Chapter 7 it was shown that the elimination rate
constant is related to the half-life (t1/2):
If the time (t) is expressed as the number of half lives (N) so that N = t/t½ then an even
simpler expression is produced:
If logarithms to the base 10 are used then 0.693 is divided by 2.303 (since ln CR = 2.303
log10 CR) and the expression becomes:
Using this expression it is quite simple to determine the number of half-lives required for
a given change in concentration ratio or if the absolute time period is known then the
half-life of the tumour marker can be determined.
Question Q 15(2)
A tumour marker X is used to guide a decision on chemotherapy after the resection of the
main tumour mass. The concentration decays exponentially. If the half-life of the
tumour marker is less than 75 hours, then this is indicative of tumour clearance and
chemotherapy is withheld. If the half-life is greater than this, it indicates that residual
disease is present and chemotherapy is indicated. The precision of the assay is such that
measurements can be safely made at a precisely timed interval of more than 36 hours
from two or more days after surgery.
The level of X at 50 hours post surgery is 1756 ng/L and at 94 hours it is 1050 ng/L.
Calculate the half-life and indicate whether you can say with confidence whether
chemotherapy needs to be given.
302
MISCELLANEOUS TOPICS
Answer Q 15(2)
1. Using Eq 7.7:
The 1st sample (collected at 50 h) can be regarded as the initial sample. Therefore:
ln 1050 = ln 1756 - 44 kd
6.96 = 7.47 - 44 kd
Use Eq. 7.9 to covert the elimination rate constant (kd) to the half life (t½):
Since the half-life is less than 75 h, the time interval is 44 h and the 1st sample was
taken at least 48 h after tumour removal we can conclude that chemotherapy can
be withheld.
303
CHAPTER 15
2. Using Eq 15.5
log10 CR = - 0.30 N
Thus the concentration fell from 1756 ng/L to 1050 ng/L in 0.744 half-lives OR
44 h. This information can be used to calculate the half-life:
Radioactive decay
Radioactive decay follows first order kinetics and therefore obeys the same mathematical
laws as the clearance of the drug (or elimination of a tumour marker discussed in the
preceding section). Equations Eq. 7.7 and 15.5 can be used if units of radioactivity are
substituted for concentration.
Question Q 15(3)
If the half life of a radionucleotide is 20 hours at the end of how many complete days will
the activity have fallen to less than 2% of the initial value?
304
MISCELLANEOUS TOPICS
Answer Q 15(3)
log10 CR = - 0.30 N
Rearrangement gives:
N = - log10 CR
0.30
Since the activity falls from 100% to 2% of the initial value, then these figures can be
treated as concentrations Cp0 and Cpt respectively:
CR = Cpt = 2 = 0.02
Cp0 100
Therefore the activity reached 2% of the initial value after 5.67 half lives have elapsed.
Multiplication of 5.64 by the half-life (20 h) gives the total time period (t) in hours.
t = 5.67 x 20 = 113.4
Therefore 5 complete days must elapse before the activity falls below 2% of the initial
value.
305
CHAPTER 15
Exponential growth
The mathematics of exponential growth are the same as exponential decay or elimination
except that concentration increases with time. The elimination constant (kd) is replaced
by the specific growth rate (k) and the term -kd.t becomes positive (+k.t). As a
consequence Eq. 7.7 and Eq. 15.5 become:
The half-life (t½) is replaced by doubling time (td) so that Eq. 7.9 can be re-written:
Question Q 15(4)
A woman had a beta hCG concentration measured at 265 IU/L and 11 days later,
following some abdominal pain, it was 820 IU/L. Assuming hCG rises exponentially in
early pregnancy, what has been the doubling time over this period? What is the
significance of the result you obtain?
306
MISCELLANEOUS TOPICS
Answer Q 15(4)
log10 CR = 0.30 N
CR can be calculated taking the initial concentration (Cp0) as 265 IU/L, and the Cpt as
820 IU/L:
The doubling time (td) can be calculated from N (1.63) and the time period between
measurements (t = 11 days) using Eq. 15.8:
td = t = 11 = 6.7 days
N 1.63
The normal doubling time for hCG during early pregnancy is approximately 2 days.
Therefore this result is consistent with ectopic pregnancy.
307
CHAPTER 15
Urea has the formula CO(NH2)2 so that each molecule contains two atoms of nitrogen.
Therefore each mmol of urea contains 2 mmol of nitrogen and since the atomic weight of
nitrogen is 14, this constitutes 2 x 14 = 28 mg of nitrogen. Division by 1000 converts
this figure to g of nitrogen:
It has been advocated that a figure of 20% should be added to nitrogen excretion to allow
for other urinary losses and a further 2 g/day added to allow for losses by other routes.
Subtraction of this figure from the nitrogen intake gives the nitrogen balance:
Question Q 15(5)
308
MISCELLANEOUS TOPICS
Answer Q15(5)
Correcting for other urinary losses (+20%) and other routes of nitrogen excretion (+2 g)
gives a revised value for nitrogen excretion:
Acid secretion studies are less frequently performed nowadays, the main remaining
application is in the follow-up of a raised plasma gastrin result. The documentation of a
raised basal acid output (BAO) in gastric juice provides strong evidence that a high
plasma gastrin concentration is caused by Zollinger-Ellison syndrome. After an overnight
fast gastric juice is collected over a timed period (usually 30 min), its volume measured
and an aliquot titrated with standardised sodium hydroxide:
309
CHAPTER 15
There has been considerable debate over the years as to which pH to use as the endpoint
since secreted hydrochloric acid may be buffered by gastric proteins.
At the equivalence point the total amount of acid in the gastric fluid aliquot is equal to the
amount of sodium hydroxide added. The total amount of acid or alkali is equal to the
volume used multiplied by its concentration. Therefore, we can write:
Eq. 15.11 can be rearranged to determine M1 which can be converted to the total acid
secreted and finally the rate of acid secretion.
Question Q 15(6)
During a test of gastric acid secretion, 28mL of gastric juice was aspirated over a single
30 min period from a fasting patient before the administration of pentagastrin.
The volume of 0.1 M sodium hydroxide solution required to titrate a 5 mL aliquot of the
gastric juice to pH 7.4 was 14 mL. Calculate the basal acid secretion rate in mmol/h.
310
MISCELLANEOUS TOPICS
Answer Q 15(6)
Since the answer is required in mmol/h it is easiest to work in mmol from the outset.
Use Eq. 15.11:
M1 x V1 = M2 x V2
Substitute for V1, M2 and V2 in Eq. 15.11 and solve for M1:
M1 x 5 = 100 x 14
Multiply the acid concentration (280 mmol/L) by the gastric fluid volume in litres (28 mL
divided by 1000) to give the total acid output in millimols:
Multiply by 2 (since gastric fluid was collected over 30 min) to obtain the secretion rate
in mmol/h:
311
CHAPTER 15
Internal standardisation
Chromatographic techniques are often subject to variability (e.g. due to instability of the
detector) or to unpredictable losses (due to the preliminary sample preparation which may
involve extraction and/or derivatization). Addition of an internal standard, which is a
compound chemically related to the analyte with similar properties but resolvable by
chromatography, can be used to minimize these problems. Instead of plotting the peak
height or area of the standard against standard concentration, the ratio of peak height (or
area) of the standard to that of the internal standard is used. An example is shown in
Figure 15.1 in which both methods of plotting the standard curve are used. The same
amount of internal standard is also added to each of the unknown samples, the peak
height or (area) ratio similarly calculated and the results obtained by interpolation with
the standard curve.
The underlying assumption of this approach is that the internal standard behaves exactly
as the analyte being measured i.e. the ratio of analyte to internal standard remains
constant throughout the various stages of the analytical process.
Question Q 15(7)
312
MISCELLANEOUS TOPICS
(a) 2 mg/L
2
3 mg/L
S
i 1 mg/L
g 1 1mg/L
n
a 1 mg/L
l
1 mg/L
0 2 4 0 2 4 0 2 4
Retention time (min)
(b)
Area x
or
peak area
ratio x x
x
xx
0 1 2 3
313
CHAPTER 15
Answer Q 15(7)
First calculate the peak area ratio (PAR) of the phenylalanine peak to that of the internal
standard (NMP) for the standard and patient:
314
MISCELLANEOUS TOPICS
Consider an ideal population in which there is a gene locus with two alleles A and a with
gene frequencies p and q i.e:
Male gametes
A a
(p) (q)
A (p) AA Aa
Female (p2) (pq)
gametes
a (q) Aa aa
(pq) (q2)
Figure 15.2 Punnetts square showing the allele frequencies and resulting genotype
frequencies for a two allele system (A and a with frequencies p and q )
in the first generation
315
CHAPTER 15
Therefore the recombination of A and a gametes from both parents results in three
genotypes – AA, Aa and aa with frequencies p2, 2pq and q2. Since the frequencies of all
genotypes must add up to 1 the following useful expressions can be written:
Subsequent mating of this population to produce a second generation can only produce
offspring with genotypes AA, Aa or aa. The frequencies can be calculated from the
frequencies of the possible mating combinations:
For example AA can arise from the combinations AA x AA, AA x Aa and Aa x Aa. The
frequencies can be derived from the frequencies derived after the first cross as follows:
AA x Aa can give rise to AA in two ways: A from the male or A from the female.
The total initial frequency of Aa is 2pq (one pq from males and one pq from
females). Therefore the frequency of AA in this cross is p2 x 2pq = 2p3q
Aa x Aa. The initial frequency of each of these is 2pq, so that the frequency of the
A gamete from each parent is pq . Therefore frequency of AA in the second cross
= pq x pq = p2q2
Adding the frequency of Aa offspring gives p4 + 2p3q + p2q2, which can also be
written as factors with p2 outside the bracket: p2 (p2 + 2pq + q2). Since we know
that (p2 + 2pq + q2) = 1 (Eq. 15.14), the frequency of AA remains at p2.
Repeating this process for all the possible matings in the second generation gives the data
in Fig 15.3. The frequency of the various genotypes after this second mating remains
unchanged. This calculation could be repeated for further generations but the frequencies
of genotypes AA, Aa and aa would always remain at p2, 2pq and q2.
316
MISCELLANEOUS TOPICS
Frequency of offspring
Mating type Frequency
AA Aa aa
AA x AA p4 p4 - -
AA x Aa 4p3q 2p3q 2p3q -
Aa x Aa 4p2q2 p2q2 2p2q2 p2q2
AA x aa 2p2q2 - 2p2q2 -
Aa x aa 4pq3 - 2pq3 2pq3
aa x aa q4 - - q4
Figure 15.3 Frequency of various types of offspring due to different matings in the
second generation
This principle was discovered by Godfrey Hardy and Wilhelm Weinverg and is known
as the Hardy-Weinberg equilibrium:
317
CHAPTER 15
Identity of p and q for a population depends on the phenotypic expression of the alleles:
2. Complete dominance i.e. both homozygotes (AA) and heterozygotes (Aa) express
the phenotype. Therefore, the proportion of individuals without this phenotype
gives the value of q2. q is the square root of this value. Subtraction of q from 1
gives p.
3. Autosomal recessive i.e. only those individuals homozygous for the recessive
gene (aa) express the phenotype. Therefore the proportion of individuals with this
phenotype represent the q2 frequency. q is the square root of this value.
Subtraction of q from 1 gives p.
Question Q 15(8)
318
MISCELLANEOUS TOPICS
Answer Q 15(8)
q2 = 1 in 10,000 = 1 = 0.0001
10,000
q = √ 0.0001 = 0.01
p = 1 - q = 1 - 0.01 = 0.99
When data can be broken down into a set of compartments (i.e. grouped frequencies), for
each of which there is an observed number (O) and an expected (E) number of
individuals, the goodness-of fit- (Χ2) is given by:
Χ2 =
Σ ( O - E )2
E
…………… Eq. 15.14
319
CHAPTER 15
Care must be taken to use the correct number of degrees of freedom. For a contingency
table of r rows and c columns there are (r - 1)(c - 1) degrees of freedom. If the data is
only grouped into k classes then there are (k – 1) degrees of freedom. The probability of
obtaining any value of Χ2 can be obtained by looking up its P value in tables of Χ2.
In general, every additional restraint removes one degree of freedom.
The Χ2 test is extremely useful in genetics to assess whether the distribution of genotypes
fits the predicted pattern. For example if the observed and expected (i.e. predicted)
frequencies were:
AA 80 100
Aa 240 200
Aa 80 100
Χ2 = Σ (O - E )2
E
= (80 - 100)2 + (240 - 200)2
100 200
+ (80 - 100)2
100
From tables of Χ2, the P value for a Χ2 value of 16 with 2 degrees of freedom is less than
0.001. Therefore the observed frequencies do not fit with the predicted model.
It is very important to appreciate that the Χ2 test should ONLY be used for grouped
frequencies. This is a frequently abused test. In spite of the fact that Eq. 15.14 uses the
terms “observed” and “expected” results, it should NOT be used evaluate data from
method comparison studies.
Question Q 15(9)
320
MISCELLANEOUS TOPICS
Answer Q 15(9)
Let the dominant gene for the disorder be A and the recessive gene a. As the inheritance
of the disease is autosomal recessive only the homozygous recessive genotype (aa)
expresses the disease.
Since the total must equal 1, the incidence of the homozygous dominant genotype
(which does not express disease nor have carrier status) can be calculated:
= 1 - 0.0101 = 0.9899
Genotype AA Aa aa
Observed frequency 0.9899 0.01 0.0001
Since p + q = 1, p = 1 - q
321
CHAPTER 15
Using these values of p and q the expected frequencies can be calculated as follows:
If all the data is tabulated then Χ2 can be calculated at the same time:
Χ2 is the sum of all the values in the final column = 0.0050 (2 sig figs)
From tables, the value of P for Χ2 = 0.0050 is somewhere between 0.05 and 0.95.
Therefore there is no significant difference between the observed and expected values so
that the data fit with the Hardy-Weinberg equilibrium.
322
MISCELLANEOUS TOPICS
Many assays in laboratory medicine utilise the property of the analyte in question to bind
specifically and reversibly to a protein. More often than not this protein is an antibody
and so these techniques are known as immunoassays. The antibody (Ab) binds to the
analyte in question, the antigen (Ag) to form the antibody-antigen complex (AbAg)
according to the equilibrium:
k1
Ab + Ag AbAg
k-1
K = k1 = [AbAg]
k-1 [Ab] [Ag]
where
k1 = the rate constant for the forward reaction
k-1 = the rate constant of the reverse reaction
K = the binding constant for the overall reaction
If antigen is labelled in some way (Ag*) and if the label does not interfere with binding
then both the labelled and unlabelled antigen compete for the binding sites according to
the scheme:
Ab + Ag* AbAg*
+
Ab
AbAg
This is analogous to the scheme for competitive inhibition of an enzyme except that the
antibody-antigen complexes do not break down to produce reaction products.
323
CHAPTER 15
Since the labelled and unlabelled antigen compete for the same binding sites it follows
that if the biological sample containing unlabelled antigen and a reagent with labelled
antigen are both added to a solution containing the antibody then the proportion of
AbAg* and AbAg formed will reflect the proportion of Ag* and Ag in the reaction
mixture. In fact the concentration of AbAg* will be inversely proportional to the
concentration of Ag in the sample. This is the basic principle of all competitive binding
assays. A plethora of techniques have been developed which differ in the nature of the
label used, whether separation of bound from free label is necessary (and the technique
used to achieve this), the nature of the binding protein etc. Additionally non-competitive
assays have been developed.
In order to express the count rate (B) as a percentage of the maximum binding (B0) the
total binding (TB) is determined using a standard containing zero concentration of analyte
so that there is no competition for binding sites. In practice there is always a small
amount of non-specific binding (NSB) of label to the walls of the reaction tube. NSB is
assessed by setting up a tube containing label but no sample or antibody which is then
carried through the entire assay. The count rate for the NSB tube is subtracted from that
of the TB tube in order to obtain a total binding value:
324
MISCELLANEOUS TOPICS
%
Bound
A
Log concentration
Figure 15.4 Schematic diagram for the dose response curve of a typical
radioimmunoassay. The area between A and B is the analytically
useful range of the assay
325
CHAPTER 15
The NSB counts are also subtracted from the count rate for each standard and sample
before expression as a percentage of B0:
It is customary to set up a tube containing label alone which is not taken through the
entire assay procedure but used to obtain a measure of the total count rate (TC). This,
together with the B0 result is a useful quality control tool to ensure that the antibody is
able to bind a reasonable amount of label.
Question Q 15(10)
The following data were obtained for a plasma cortisol radioimmunoassay using second
antibody separation. Calculate the cortisol concentration in the serum sample.
326
MISCELLANEOUS TOPICS
Answer Q 15(10)
First identify tubes to be used for total counts (TC), non-specific binding (NSB) and total
binding (TB).
TC is the tube which contains label and nothing else (it is not used in the calculation of
results). This is the first tube, therefore TC = 12,500 cpm.
NSB is the tube which contains label, buffer and second antibody and is taken through the
entire assay procedure. This is the second tube so NSB = 200 cpm.
TB is the tube which contains everything except the analyte in question (i.e. buffer
instead of sample). This is the third tube, therefore TB = 8,050 cpm.
Maximum binding (B0) when analyte is absent is obtained by subtraction of NSB from
TB:
The NSB is also subtracted from each standard or sample count rate before it is expressed
as a ratio to B0 (See Eq. 15.16). This is best laid out in tabular form:
TC 12,500
NSB 200
TB 8,050 7,850 100
Standard 50 nmol/L 1.70 4,250 4,050 51.6
“ 100 “ 2.00 3,720 3,520 44.8
“ 200 “ 2.30 3,190 2,990 38.1
“ 400 “ 2.60 2,675 2,475 31.5
“ 800 “ 2.90 2,120 1,920 24.5
“ 1600 “ 3.20 1,600 1,400 17.8
“ 3200 “ 3.51 1,065 865 11.0
Serum 2,490 2,290 29.2
327
CHAPTER 15
60
40
B /B 0 (%)
30
20
10
0
1.5 2 2.5 3 3.5 4
Log cortisol conentration
Note that the value for B0 (100%) cannot be plotted since the log of zero (concentration
for TB) has no meaning.
328
MISCELLANEOUS TOPICS
ADDITIONAL QUESTIONS
4. A radioisotope has a half-life of 21 days. How long will it take for the activity to
fall to 10% of the initial value?
329
CHAPTER 15
7. A 30 min basal gastric secretion sample (total volume 27 mL) required 2.5 mL of
0.1 M NaOH to titrate 5 mL of the material to pH 7.4. Calculate the basal acid
secretion rate in mmol/h.
8. A five day faecal fat collection was homogenised and diluted to 1500 mL.
A 10 mL aliquot of the homogenate was subjected to hydrolysis and the
fatty acids were extracted. The volume of 0.05 M sodium hydroxide required to
effect neutralisation was 48 mL. Calculate the fat excretion in mmol/24 h.
A 0.65
B 0.35
330
MISCELLANEOUS TOPICS
12. The following data were obtained for a digoxin radioimmunoassay employing
PEG precipitation of the primary antibody. The assay was performed in duplicate.
Calculate the digoxin concentration in the serum sample.
331
CHAPTER 15
332
ANSWERS TO FURTHER QUESTIONS
Appendix I
Chapter 1 Chapter 2
1. a) 1.25 g/L 1. 7g
b) 250 mmol/L 2 1453 mmol/L
c) 0.000236 μmol/L 3. 0.25 g
d) 1600 ng/mL 4. 100 mL
2. a) 6.7 mmol/L 5. 101
b) 2.0 mmol/L 6. 8.8 mL
c) 7.5 mmol/L 7. Potassium = 0.022 mol/L
d) 58 μmol/L Sodium = 0.57 mol/L
3. a) 360 mg/100 mL Chloride = 0.59 mol/L
b) 6.4 mEq/L 8. 950 mL
c) 86 mg% 9. 5.2 mmol/L
d) 2.8 mg% 10. Phosphate = 100 mmol/L
4. a) 1.5 mmol/L 40 mL needed
b) 12.5 μmol/L
c) 2.5 g/L
d) 3.25 x 10-3 μmol/L
5. a) 0.30 mmol
b) 25 μmol/min/250 mL
6. 3.0 x 10-10 mol/L
333
APPENDIX I
Chapter 3 Chapter 4
1. 0.86 1. 207 mL
2. 36 to 45 nmol/L 2. a) 0.022
3. 25:1 b) 0.125
4. 6.80 c) 0.301
5. Na2CO3 = 7.58 g d) 0.602
NaHCO3 = 2.39 g e) 1
6. 39.8 g sodium lactate f) 2
0.22 mL lactic acid 3. a) 79 %
7. 49 mmol b) 56 %
8. 4.65 c) 32 %
d) 18 %
e) 10 %
f) 1 %
4. 7.35 L.mol-1.cm-1
5. 0.069
6. 153 nmol/g dry wt
7. 68 %
16.7 L.mol-1.cm-1
8. NADH = 53.5 μmol/L
NAD = 23.7 μmol/L
9. 97 %
10. Serum creat = 75 μmol/L
Urine creat = 7.5 mmol/L
11. Linear to 15 mmol/L
12.5 mmol/L
334
ANSWERS TO FURTHER QUESTIONS
Chapter 5 Chapter 7
1. 15.7 mmol/24 h 1. 40 h
2. 14.3 mmol/12 h 2. a) 39 L
3. Clearance = 170 mL/min b) 20 h
? >24h collection 3. 1.25 mg/L (total body water)
4. Filtration = 1584 mg/24 h 3.75 mg/L (ECF only)
? tubular reabsorption 4. a) 1.5 h
5. 500 mL/min? b) 59 L
? tubular secretion 5. 2.0 h (for 100 nmol/L)
6. 36 mmol/L 2.9 h (for 75 nmol/L)
7. Increased Na excretion 6. 6.6 h
by 192 mmol/24 h 7. 400 mg
8. 28 g Na (or 71 g NaCl)
9. 0.069 (= 6.9%)
10. 40 mL/min/1.73 m2
11. 9 mmol/L glomerular filtrate
12. 0.51 mL/min
13. GFR = 39 mL/min/ 1.73 m2
Clearance = 28 mL/min
? incomplete urine collection
Failure to correct clearance to
body surface area
Tubular secretion of creatinine
Chapter 6 Chapter 8
335
APPENDIX I
Chapter 9 Chapter 11
Chapter 10 Chapter 12
336
ANSWERS TO FURTHER QUESTIONS
Chapter 13 Chapter 15
Chapter 14
337
APPENDIX I
338
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 1
1. Convert the following: a) 125 mg% to g/L; b) 0.25 mol/L to mmol/L; c) 0.236
nmol/L to μmol/L; d) 1.6 mg/L to ng/mL.
One nmol = 1.0 x 10-9 mol, one μmol = 1.0 x 10-6 mol.
Therefore one μmol = 1.0 x 103 nmol = 1000 nmol
Division of 1 nmol/L by 100 converts to μmol/L
339
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
2. Convert the following concentrations to “SI” units: a) plasma glucose 120 mg%;
b) serum calcium 4.0 mEq/L; c) BUN 21 mg%; d) Serum creatinine 0.66 mg%.
340
WORKED ANSWERS TO FURTHER QUESTIONS
The formula for urea is CO(NH2)2. Therefore each mol of urea contains one
mol of nitrogen (N2).
341
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
342
WORKED ANSWERS TO FURTHER QUESTIONS
Since the formula of urea is CO(NH2)2, each mol contains 1 mol of molecular
nitrogen (N2). The atomic weight of nitrogen is 14, so its molecular weight
(N2) is 2 x 14 = 28. Therefore, multiplication of urea in mmol% by 28 gives
the BUN in mg%.
343
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
‘M’ is an abbreviation for mol/L. There are 1000 mmol in one mol.
344
WORKED ANSWERS TO FURTHER QUESTIONS
Another way of looking at it is that the power minus 3 means that we must
move the decimal point 3 places to the left (as opposed to a positive power
which would have meant moving it 3 places to the right). Moving the decimal
point one pace to the left gives 0.15, 2 places gives 0.015 and 3 places gives
0.0015.
In other words whenever we see a molar concentration with the term ‘ x 10-3,
it is really the same as the concentration in mmol/L.
since the 10-3 and 103 cancel (i.e. we move the decimal point 3 places to the
left then back 3 places to the right to the original position).
Since 10-5 x 106 = 101 (which is simply 10) the calculation becomes:
This is the same as moving the decimal point 5 places to the left, then 6 places
345
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Division by 1000 converts from mg/L to g/L (since there are 1000 mg in one
g)
Another way of doing this is that we first move the decimal point 2 places to
the right (the same as multiplying by 102), a further one place to the right to
convert from 100 mL to 100 mL (making 3 moves to the right altogether).
A further 3 moves to the left (the same as dividing by 1000 to convert from
mg to g) takes us back to the starting position and an answer of 2.5 g/L.
Therefore, 3.25 x 10-6 mmol/L = 3.25 x 10-6 x 103 = 3.25 x 10-3 μmol/L
(The same result is obtained by moving the decimal point 6 places to the left
then back 3 places to the right).
346
WORKED ANSWERS TO FURTHER QUESTIONS
5. After incubation of an enzyme with substrate for 30 min the concentration of product
in the reaction mixture was 3.00 x 10-3 M. a) How many mmol of product would be
present in 100 mL of the reaction mixture; and b) what is the rate of formation of
product in 250 mL of reaction mixture expressed as μmol/min?
If the concentration in the reaction mixture is 3.00 x 10-3 M then each litre
contains 3.0 x 10-3 mol of product.
Since this amount of product was formed over 30 min, division by 30 gives
the amount which would be formed in one minute.
347
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
NB 10-3 x 106 = 103 i.e. move decimal point 3 places to the left,
then 6 places to the right making 3 places to the right overall, which is the
same as multiplying by 1000.
6. An acid dissociates in solution to give its conjugate base and hydrogen ions.
Calculate the dissociation constant if urine contains 0.1 M of undissociated acid,
25 x 10-5 mol/L of its conjugate base and 120 nmol/L of hydrogen ions? NB the
dissociation constant is the product of the concentrations of conjugate base and
hydrogen ions divided by the concentration of undissociated acid.
Before calculating the dissociation constant (K) convert all concentrations to the same
units. It doesn’t really matter which units but conventionally molar concentrations
(mol/L) are used.
0.1 M undissociated acid is the same as 0.1 mol/L of acid, which can also be written
as 1.0 x 10-1 mol/L.
There are 1,000,000,000 (i.e. 109) nmol in one mol so that 120 nmol/L hydrogen ions
can be written 120 x 10-9 mol/L or more conveniently 1.20 x 10-7 mol/L (moving the
decimal point 2 places to the left and decreasing the power of 10 by 2 from -9 to -7).
348
WORKED ANSWERS TO FURTHER QUESTIONS
The expression for the dissociation constant, with molar concentrations in square
brackets [] can be written:
(mol/L) x (mol/l)
(mol/L)
One set of (mol/) above the line cancels with (mol/L) below the line leaving mol/L as
the units.
K = 2.5 x 10-4 x 1.20 x 10-7 = 2.5 x 1.20 x 10-10 = 3.0 x 10-10 mol/L
1.0 x 10-1
349
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Chapter 2
And 100 mL must conation 100 times this amount, so that the weight required to
make 100mL of 70 g/L albumin
= 70 x 100 = 7g
1,000
Another way of looking at this is that 100 mL is one tenth of 1 L (i.e. 1,000 mL)
so that one tenths of the amount present in l L is required.
350
WORKED ANSWERS TO FURTHER QUESTIONS
Since each molecule of NaCl dissociates to give one ion of Na+, this is also the
concentration of sodium ions in mmol/L.
Calcium carbonate has the formula CaCO3 so that each mol contains 1 mol of
calcium. Therefore, the standard solution will need to contain 5.0 mmol/L of
CaCO3.
500 mL of 5.0 mmol/L will contain 100 x 5.0 g CaCO3 (1L = 1,000mL)
1,000 x 2
351
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
4. A solution contains 5% sucrose. How much of this solution would you dilute to
prepare 500 mL of % sucrose?
The total amount of sucrose (as opposed to concentration) remains the same after
dilution. The amount of sucrose in a given volume of solution is equal to the
volume multiplied by concentration. Therefore the following expression can be
written:
The units for volume and concentration must be the same on both sides of the
equation.
Another way to do this is that the final concentration (1%) is one fifth of the
initial value (5%) so that the 5% sucrose solution has to be diluted 5-fold.
The volume required is 500 mL so that one fifth of this, 100 mL has to be diluted.
Both volumes must be expressed in the same units. Multiplication of the volume
of water (5 mL) by 1000 gives its volume in μL (5,000 μL) since 1 mL =
1,000 μL.
The total volume of diluted urine is the sum of the volumes of urine and water:
352
WORKED ANSWERS TO FURTHER QUESTIONS
The dilution is the number of times the urine was diluted which is the final
volume divided by the initial volume:
6. Concentrated sulphuric acid (SG 1.84) is 96% by weight H2SO4. Calculate the
volume of concentrated acid required to prepare 1 L of 0.1M H2SO4.
Since the sulphuric acid is only 96% pure this weight must be multiplied by
100/96:
Volume = Weight
Specific gravity
353
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
We are told that the specific gravity of H2SO4 is 1.16 so that 1 mL weighs 1.16 g.
The volume which weighs 102.08 g is calculated as follows:
For potassium:
Final K+ (mol/L) x Final vol (mL) = Initial K+ (mol/L) x Initial vol (mL)
354
WORKED ANSWERS TO FURTHER QUESTIONS
150
For sodium:
Final Na+ (mol/L) x Final vol (mL) = Initial Na+ (mol/L) x Initial vol (mL)
For chloride:
Final Cl- (mol/L) x Final vol (mL) = [Initial Cl- from KCl (mol/L) x
vol KCl (mL)] + [Initial Cl- from NaCl (mol/L) x vol NaCl (L)]
50 ] + [ 0.855 x 100 ]
= 3.355 + 85.5
150
= 88.855
150
355
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Initial vol (mL) x Initial concn (%) = Final vol (mL) x Final concn (%)
N.B. The expected volume of water to be added to the 95% ethanol (950 – 650 =
300 mL) will be insufficient because mixing an alcohol with water always results
in some contraction of the total volume. Therefore further water should be added
until a final volume of 950 mL is reached.
356
WORKED ANSWERS TO FURTHER QUESTIONS
As a short cut the target concentration (0.2 mol/L) could be simply multiplied by
the ratio of the molecular weight of NaH2PO4.2H2O to that of NaH2PO4:
Initial concn (mol/L) x Initial vol (mL) = Final concn (mol/L) x Final vol (mL)
0.26 x 5 = Final concn (mol/L) x 250
Multiplication by 1,000 (since there are 1,000 mmol in a mol) converts this
concentration to mmol/L:
357
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
10. Solution A contains 12.0 g of anhydrous sodium dihydrogen phosphate per litre.
What is the phosphate concentration expressed as mmol/L? What volume of
solution A needs to be diluted to 1 L to give a phosphate concentration of
4 mmol/L.
Initial vol (mL) x Initial conc (mmol/L) = Final vol (mL) x Final conc (mmol/L)
358
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 3
1. What is the pH of 0.5 per cent (w/v) hydrochloric acid (assume complete
dissociation, atomic weight Cl = 35.5)?
pH = - log10 [H+]
2. The reference range for blood pH is often quoted as 7.35 – 7.45. Express this
range in terms of nannomoles of hydrogen ion per litre.
pH = - log10 [H+]
Substitute pH = 7.35:
359
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Substitute pH = 7.45
3. If the pH of urine is 6.0 and of blood 7.40, what is the gradient of hydrogen ion
concentrations across the tubular cell walls?
Another way of approaching this problem is to use the fact that the ration of two
values is equal to the antilog of the difference between their logarithms.
360
WORKED ANSWERS TO FURTHER QUESTIONS
= antiolg10 1.4
= 25 (2 sig figs)
H2PO4- ↔ HPO42- + H+
[Na2HPO4] = 12.85 x 10
1,000 x MW
[NaH2PO4] = 6.88 x 10
1,000 x MW
361
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
HCO3- → H+ + CO32-
Since the total concentration of both bicarbonate and carbonate in the buffer is 0.2
mol/L:
[CO32-] = 2.51
0.2 - [CO32-]
362
WORKED ANSWERS TO FURTHER QUESTIONS
MW NaHCO3 = 23 + 1 + 12 + (3 x 16) = 84
363
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
LactH → Lact- + H+
The concentration is not given but we are told that the solution must be isotonic.
Assuming physiological osmolarity is 285 mmol/L:
Rearranging:
[Lact-] = 3467
285 - 2 [Lact-]
364
WORKED ANSWERS TO FURTHER QUESTIONS
365
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
7. A 24h urine collection has a pH of 5.5 and total phosphate content of 65 mmol. If
the arterial pH is 7.40 and the pKa for phosphate is 6.80, how many millimoles of
hydrogen ion are excreted as titratable acidity using HPO42- as buffer?
The reaction occurring when secreted hydrogen ions are buffered by phosphate in
the glomerular filtrate is:
HPO42- + H+ → H2PO4-
Calculate the ratio of the two phosphate ions in fresh glomerular filtrate (i.e. pH =
7.4):
366
WORKED ANSWERS TO FURTHER QUESTIONS
65 = [HPO42-] + [H2PO4-]
[H2PO4-] = 65 - [HPO42-]
[HPO42-] = 3.98
65 - [HPO42-]
Repeat this procedure for acidified glomerular filtrate i.e. urine pH = 5.5
[HPO42-] = 0.050
65 - [HPO42-]
367
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
8. A buffer solution (pH 4.74) contains acetic acid (0.1 mol/L) and sodium acetate
(0.1 mol/L) i.e. it is a 0.2M acetate buffer. Calculate the pH after addition of 4
mL of 0.025 M hydrochloric acid to 10 mL of the buffer.
HAc → H+ + Ac-
Calculate the adjusted cocnetrations of Ac- and HAc, and substitute into the
Henderson-Haseelbalch equation (using pKa = 4.74) then solve for pH:
368
WORKED ANSWERS TO FURTHER QUESTIONS
Addition of HCl to this buffer converts some of the acetate ions to acetic acid:
Ac- + H+ → HAc
Allowance must be made for the dilution resulting from mixing 10 mL buffer with
4 mL HCl (total volume = 14 mL):
Similarly:
369
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Chapter 4
x = 15.5 + 0.155 x
0.23
370
WORKED ANSWERS TO FURTHER QUESTIONS
0.075 x = 15.5
a) 95 b) 75 c) 50 d) 25 e) 10 f) 1
If Io is the intensity of incident light and I the intensity of transmitted light, then:
A = 2 - log10 %T
All that is required is to substitute values for %T into this expression to obtain A:
371
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
a) %T = 95
b) %T = 75
c) %T = 50
d) %T = 25
e) %T = 10
A = 2 - log10 10 = 2 - 1.000 = 1
f) %T = 1
A = 2 - log10 1 = 2 - 0 = 2
372
WORKED ANSWERS TO FURTHER QUESTIONS
A = 2 - log10 %T
A + log10 %T = 2
log10 %T = 2 - A
%T = antilog10 (2 - A)
Therefore substitute values for A into this expression then evaluate %T:
a) A = 0.1
b) A = 0.25
c) A = 0.50
d) A = 0.75
e) A = 1.00
f) A = 2.00
%T = antilog10 (2 - 2) = antilog10 0 = 1%
373
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
A = 2 - log10 %T
= 2 - log10 18.4
= 2 - 1.2648
Use this absorbance to calculate molar absorptivitiy using the Beer-Lambert Law:
A = abc
Since the question asks for calculation of molar absorptivity, the concentration
must be divided by 1,000 to convert it to mol/L (1 mol = 1,000 mmol):
0.735 = a x 1 x 0.1
a = 0.735 = 7.35
1 x 0.1
374
WORKED ANSWERS TO FURTHER QUESTIONS
The units can be derived by entering the individual units into the same equation
(remembering that absorbance is the logarithm of a ratio so has no units):
A = 2 - log10 %T
= 2 - log10 45
= 2 - 1.6532
= 0.3468
375
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
A = abc
The answer is required in nmol not mol so this value must be multiplied by 109
(since 1 mol = 109 nmol):
Since the faecal sample produced 4.5 mL of extract, the amount of porphyrin in
the sample is obtained by dividing by 1,000 (to convert from nmol/L to
nmol/mL), the multiplying by the volume of extract (4.5 mL):
376
WORKED ANSWERS TO FURTHER QUESTIONS
Porphyrin (nmol/g fresh stool) = 5.729 = 76.4 nmol/g fresh wt (3 sig figs)
0.075
To express this result on a dry weight basis, multiply by the fresh weight of faeces
used for the dry weight determination then divide by its dry weight:
Porphyrin content = 76.4 x 0.250 = 153 nmol/g dry faeces (3 sig figs)
0.125
A = 2 - log10 %T
A = 2 - log10 75
= 2 - 1.875
= 0.125
Provided the substance obeys Beer’s Law over the range of concentrations (i.e.
absorbance is directly proportional to concentration), then the absorbance of a
different concentration (4 g/L) can be calculated from the relationship:
Absorbance2 = Absorbance1
Concentration 2 concentration1
377
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
A = abc
378
WORKED ANSWERS TO FURTHER QUESTIONS
8. The absorbances of a solution containing NAD and NADH in a 1cm light path
cuvette were 0.337 at 340 nm and 1.23 at 260 nm. The molar extinction
coefficients are:
Both NAD and NADH absorb at the two wavelengths used (260 nm and 340 nm).
Absorbances are additive, therefore at either wavelength:
Therefore for each wavelength equations can be set up relating measured total
absorbance to the sums of the individual absorbances of NAD and NADH:
At 340 nm: 0.337 = 1.0 x 10-3 [NAD] + 6.3 x 103 [NADH] .................... (i)
At 260 nm: 1.23 = 1.8 x 104 [NAD] + 1.5 x 104 [NADH] ................... (ii)
These form a pair of simultaneous equations which can be solved for [NAD] and
[NADH] in the usual manner. However, solving a set of simultaneous equations
can be a lengthy process. Therefore we should look for approximations and short
cuts. In this particular example it is possible to considerably simplify the
calculation. The molar extinction coefficient of NAD at 340 nm is much lower
than that of NADH (by a factor of approx. 10-6) so that the contribution of NAD
to absorbance at this wavelength can be ignored. Equation (i) can then be
simplified to:
379
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
[NAD] can be calculated by substituting [NADH] = 5.35 x 10-5 into equation (ii):
Given that the molar absorptivity of bilirubin under these conditions is 6.07 x 104,
calculate the percentage purity of the bilirubin.
A = a x b x c
380
WORKED ANSWERS TO FURTHER QUESTIONS
Use this concentration of the final solution to calculate the bilirubin content of the
weighed bilirubin:
The final solution was prepared by diluting 200 µL (i.e. 0.2 mL) of stock to
250 mL
Convert to wt of bilirubin:
381
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
10. A method for creatinine determination based on the Jaffe reaction involved
mixing 0.1 mL of sample with 2.5 mL alkaline picrate reagent, incubating for 10
min at room temperature, then measuring the absorbance at 530 nm in a 1-cm
cuvette in a spectrophotometer set to read zero using a cuvette containing
distilled water. The following readings were obtained:
Calculate the creatinine concentration in the serum (in μmol/L) and urine (in
mmol/L).
First subtract the reagent blank (i.e. the reading obtained when using water as
sample) from each absorbance reading:
Absorbance Corrected
Absorbance
Concentration of unknown =
For serum:
382
WORKED ANSWERS TO FURTHER QUESTIONS
For urine the calculation is the same except that the result must be multiplied by
50 to allow for the predilution of the sample prior to assay, then divided by 1,000
to convert from μmol/L to mmol/L (1 mmol = 1,000 μmol):
11. A standard curve for a plasma glucose method was set up by preparing a series of
dilutions of a stock glucose standard (containing 50 mmol glucose/L) and
measuring the absorbance at 500 nm in a 1 cm cuvette using a blank with zero
glucose concentration to zero the instrument. The following readings were
obtained:
Glucose (mmol.L): 5 10 15 20 25 30
Absorbance: 0.102 0.203 0.305 0.375 0.410 0.432
Does the method obey Beer’s Law? What glucose concentration corresponds to
an absorbance reading of 0.250?
Plot the absorbance (vertical scale) against the standard concentration (horizontal
scale) including the zero as a point (since the blank was used to zero the
instrument):
0.5
0.45
Glucose standard curve
0.4
0.35
Absorbance
0.3
0.25
0.2
0.15
0.1
0.05
0
0 5 10 15 20 25 30 35
Glucose (mmol/L)
383
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Inspection of the curve shows that the method only obeys Beer’s Law up to a
concentration of 15 mmol/L (when absorbance = 0305).
384
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 5
First convert the filtration rate from mL/min to L/12 h. Multiply by 60 (to convert
from min to h), then by 12 (to convert from h to 12 h) and finally divide by 1,000
(to convert from mL to L):
385
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
3. A urine collection (volume 3.2 L) was handed in by a patient which he said he had
collected over the previous day. Calculate the creatinine clearance given that the
urine was found to have a creatinine concentration of 7.2 mmol/L. The plasma
creatinine concentration taken during the collection was 94 µmol/L. Give the
most likely cause for this result.
Creatinine clearance(mL/min) =
Urine flow rate = 3.2 L/24 h = 3.2 L/h = 3.2 L/min = 3.2 x 1,000 mL/min
24 24 x 60 24 x 60
This creatinine clearance is a little high. The most likely cause is that the 24
collection was made over a longer period than 24 h – perhaps the bladder was not
emptied at the start of the collection period (or if emptied it was added to the
collection instead of being discarded).
386
WORKED ANSWERS TO FURTHER QUESTIONS
First calculate the total amount of the compound filtered over a 24 h period (based
on the assumption that the compound is freely filtered at the glomerulus):
The rate of excretion (316.8 mg/24 h) is much less than this suggesting either that
the compound is either not freely filtered at the glomerulus or considerable
amounts are reabsorbed from the filtrate.
N.B. The urine volume was not used in this calculation. Another approach (which
would utilize urine volume) would be to calculate the clearance of the compound
(which comes out at 22 mL/min) then compare it with the GFR.
5. A subject with a GFR of 100 mL/min was infused with a 'drug' X at a rate of
100 µmol/min and the plasma concentration reached a steady state value of
200 µmol/L. It is known that this drug is not metabolized or excreted by organs
other than the kidney. What is the clearance of this drug? Comment on the
result.
When a steady state is reached the rate of excretion is equal to the rate of infusion
and the plasma concentration reaches a constant value.
387
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
The clearance of the drug far exceeds the GFR suggesting that the mode of
excretion is predominantly tubular secretion.
6. A patient who is severely water depleted and excreted only 100 mL of urine in the
last 6 hours was a short time before, found to have a creatinine clearance of 100
mL/min with a plasma creatinine concentration of 100 µmol/L. If renal function
has remained unchanged what concentration of creatinine would you expect to
find in the latest 100 mL (6 h collection) specimen of urine?
This question involves calculating the urinary excretion when the plasma
concentration and clearance is known. The expression for clearance is:
Clearance (mL/min) =
388
WORKED ANSWERS TO FURTHER QUESTIONS
If the amount reabsorbed decreases by 1% then the amount excreted in the urine
will increase by 1% of that filtered:
389
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
8. The following data were obtained for a hypertensive patient on a low sodium diet:
If the renal tubules reabsorb 90% of filtered sodium, how many grams of
sodium are excreted in the same 24 h period?
Substitute these values to obtain the urine sodium concentration N.B units must be
the same so convert plasma creatinine (μmol/L) to mmol/L by dividing it by
1,000.
Convert to g/24 h:
Na (g/24 h) = Na (mol/24 h) x MW
390
WORKED ANSWERS TO FURTHER QUESTIONS
Another approach to this problem would be to first calculate the GFR from the
creatinine results, then use this to calculate the Na filtered etc.
9. The following results were obtained in a 20 year old male admitted after a car
crash and found to be oliguric:
Urine Na 90 mmol/L
Creatinine 2.4 mmol/L
Osmolality 200 mOsm/kg
391
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
10. A 45 year old lady has a body weight of 56 kg and a height of 155 cm. If her
plasma creatinine is 150 μmol/L estimate her GFR expressing the result as
mL/min/1.73 m2.
Since body weight is given, the Cockcroft-Gault formula for females can be used:
= 36 mL/min
Next calculate the patient’s body surface area (A) using the body weight in kg (W)
and height in cm (H):
= antilog10 0.188
= 1.54 m2
= 36 x 1.73
1.54
= 40 mL mL/min/1.73 m2
Alternatively the abbreviated MDRD formula can be used (height not required).
392
WORKED ANSWERS TO FURTHER QUESTIONS
11. Calculate the tubular maximum reabsorptive capacity (Tm/GFR) for glucose from
the following data:
All units must be the same so first correct plasma creatinine to mmol/L:
This is the fraction of filtered glucose which is NOT reabsorbed by the tubules.
The fraction reabsorbed (TR) is next calculated:
TR = 1 - FE = 1 - 0.1 = 0.9
To convert this reabsorption fraction to the absolute amount reabsorbed (i.e. The
Tm/GFR), multiply by the plasma concentration:
= 0.9 x 10
393
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
12 A 6 h urine collection (volume 800 mL) has an osmolality of 200 mOsm/kg. If the
plasma osmolality is 260 mOsm/kg calculate the free water clearance in mL/min.
Cosm = Uosm x V
Posm
The free water clearance (Cwater) is the difference between the urine flow rate and
the osmolar clearance:
Cwater = V - Cosm
Calculate the GFR for a 57 year old Caucasian women whose serum creatinine is
130 μmol/L, and her creatinine clearance, given that a 24 h urine collection with
a volume of 1.1 L had a creatinine concentration of 4.7 mmol/L.
394
WORKED ANSWERS TO FURTHER QUESTIONS
First calculate the GFR using the abbreviated MDRD formula by substituting
values for serum creatinine (130 μmol/L) and age (57 y) – remembering to
multiply by 0.742 since the patient is female:
= 186 x antilog10 [-1.154 x log10 1.471] x antilog10 [-0.203 x log10 57] x 0.742
= 28 mL/min
395
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
There are several possible reasons for the discrepancy between the derived GFR and
the calculated clearance:
• Inaccuracy in the timed urine collection. This is potentially the greatest source
of error. Although the 24 h volume of 1.1 L seems reasonable the calculated
creatinine excretion seems low (1.1 x 4.7 = 5.2 mmol/24 h) - unless the lady has
a very low muscle mass – suggesting that the collection is incomplete.
• Failure to correct the creatinine clearance for body surface area (this would
require knowledge of weight and height). However, the MDRD formula does not
take into account individual variation in body surface area either, but just assumes
an average value based on the patient’s age and sex.
396
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 6
First calculate the osmolalities due to glucose and sodium chloride individually.
AW C = 12, therefore C6 = 6 x 12 = 72
AW H = 1, therefore H12 = 12 x 1 = 12
AW O = 16, therefore O6 = 6 x 16 = 96
MW = 180
Final concentration (after mixing with an equal volume of 5% glucose) is one half
of this i.e. 4.5 g/L. MW of NaCl = 23 + 35.5 = 58.5.
Sodium chloride dissociates to give two osmotically active species – Na+ and Cl-
397
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 139 + 154
= 549 + 308
= 857 mOsm/kg
398
WORKED ANSWERS TO FURTHER QUESTIONS
3. A patient was mistakenly given 500 mL 20% mannitol (C6H14O6) intended for the
patient in the next bed instead of the same volume of normal (0.9%) saline.
Calculate the extra osmolal load given over that which would have resulted from
isotonic saline.
AW C = 12, therefore C6 = 6 x 12 = 72
AW H = 1, therefore H14 = 14 x 1 = 14
AW O = 16, therefore O6 = 6 x 16 = 96
MW = 182
(factor of 2 introduced since NaCl dissociates into 2 ions (Na+ and Cl-).
399
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 550 - 154
= 396 mOsm
AW C = 12, therefore C2 = 2 x 12 = 24
AW H = 1, therefore H6 = 6 x 1 = 6
AW O = 16, therefore O = 1 x 16 = 16
MW = 46
400
WORKED ANSWERS TO FURTHER QUESTIONS
5. A 45-year old man is brought to casualty following a fit. He had been working
alone late in a garage, when he was found by the security guard who called an
ambulance. On admission, he has a large bruise on the left temple and is semi-
comatose, he smells of alcohol. The admitting team request urea and electrolytes,
glucose and an alcohol and blood gas estimation and arrange an urgent CT scan.
The results are as follows:
The CT scan does not show any bony injury or evidence of intracranial bleed.
The neurological registrar is called and asks for an osmolal gap to help provide a
quick estimation of whether there is a possibility that other toxic substances
present in the garage, such as antifreeze, have been taken in any quantity.
= 330 - 278
= 52 mOsm/kg
401
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
AW C = 12, therefore C2 = 2 x 12 = 24
AW H = 1, therefore H6 = 6 x 1 = 6
AW O = 16, therefore O = 1 x 16 = 16
MW = 46
402
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 7
Substitute these values into the rate equation and solve for t:
ln 20 = ln 50 - 0.023.t
403
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
2. A 15 year old boy presents to casualty following a convulsion. It turns out that he
had swallowed 30 of his mother’s lithium tablets about 10 hours previously.
On admission his lithium concentration is 4.1 mmol/L. A decision needs to be
made whether to haemodialyse him to reduce the lithium concentration. As this is
not going to be available quickly, the physicians want to know how long he will
have toxic levels just with endogenous clearance. Estimate the following,
indicating clearly any assumptions you have made:
a) The likely volume of distribution of the lithium at this stage in the situation,
given a body weight of 65 kg.
b) How long it will be before his lithium concentration drops to the relatively
safe level of 1.5 mmol/L below which toxicity is unlikely, given a clearance of
0.03 L/h/kg.
404
WORKED ANSWERS TO FURTHER QUESTIONS
t = time taken (in hours) to reach the “safe” level of 1.5 mmol/L
The clearance of the drug is given as 0.03 L/h/kg. Multiply by the patient’s
weight to give the total clearance:
The elimination rate constant (kd) can be calculated from the clearance (Cl)
and the volume of distribution (Vd):
405
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Assuming distribution throughout total body water, the Vd = total body water vol:
= 80 x 60 = 48 L
100
If drug is only distributed throughout the ECF, the Vd must be adjusted. ECF is
normally 20% of body wt.
= 80 x 20 = 16 L
100
Alternatively, since a third of body water is in the ECF, the drug level will be
3 times higher.
406
WORKED ANSWERS TO FURTHER QUESTIONS
Time mg/L
2.5h 32
5h 10
7.5h 3
a) Assuming the clearance of the drug follows first order elimination kinetics then
the data should be described by the expression:
5
ln C0
4.5
Elimination curve
4
3.5
3
ln C
2.5
2
slope = - kd
1.5
0.5
0
0 1 2 3 4 5 6 7 8
407
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
This plot clearly demonstrates that elimination of the drug follows first order
kinetics so that Cp0 and kd could be determined directly from the graph.
Alternatively any 2 values can be substituted into the rate equation and solved
for kd:
Therefore: ln 10 = ln 32 - kd.2.5
b) First calculate the initial concentration (Cp0) using one other value (e.g.
2.5 h = 32 mg/L as Cpt and t = 2.5 h) and the value for kd:
408
WORKED ANSWERS TO FURTHER QUESTIONS
5. The plasma concentration of a drug is found to be 200 nmol/L at 9.00 am. It’s
elimination follows first order kinetics with a rate constant is 0.34/h. Calculate
the times at which the plasma concentrations will reach 100 nmol/L and
75 nmol/L.
ln 75 = ln 200 - 0.34.t
409
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Substitute these values into the rate equation and solve for t:
ln 10 = ln 100 - 0.347.t
410
WORKED ANSWERS TO FURTHER QUESTIONS
7. The SHO decides to treat a patient (weight 80 kg) with intravenous theophylline
(salt factor = 0.8). What loading dose would you recommend in order to achieve
a theophylline level of 12 mg/L given a volume of distribution of 0.5 L/kg and an
initial plasma theophylline level of 4 mg/L?
= 80 x 0.5 = 40 L
LD = 40 x (12 - 4)
0.8
= 40 x 8
0.8
= 400 mg
411
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
8. A patient, (body weight 55 kg) unable to take oral medication, had been receiving
intravenous valproate for a number of days and achieved an average steady state
level of 75 mg/L. After regaining consciousness the doctors wished to change to
an oral twice daily regimen. In order to maintain the same average steady state
concentration what dose would you recommend. Assume a clearance of
10 mL/h/kg, a bioavailability of 0.7 and a salt factor of 0.85.
Where:
First correct the clearance for the body weight and express it in litres (to be
compatible with the drug concentration which is given in mg/L):
= 10 x 55 = 0.55 L/h
1,000
= 832 mg
412
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 8
Draw up a table of fluid gains and losses then calculate the total of each. Assume
a value of 400 mL per day for net insensible losses.
Fluid balance (mL) = Net fluid intake (mL) - Net fluid loss (mL)
= 2,750 - 2,250
= 500 mL
413
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
The average adult male has a body water content of approximately 60%. If the
body water deficit is x L, then the initial body water content can be calculated:
= 5100 + 60x
100
= 51 + 0.6x
Assuming a normal initial osmolality (say 285 mOsm/kg) the total amount (in
mOsm) of osmotically active species present in the body can be calculated:
= 14,535 + 171x
On presentation his body weight is 85 kg. Assuming the total amount of solutes
in the body is unchanged, then the body water volume can be calculated from the
current osmolality:
414
WORKED ANSWERS TO FURTHER QUESTIONS
301x = 1989
x = 1989
301
= 6.6 L
If it is assumed that the change in body wt is neglible (or that the initial body
water was the same as for an average 70 kg adult) then a simpler calculation
(using Eq. 8.3) can be used and gives a slightly different result which may be
adequate as a rough guide in clinical practice:
= 42 - [12000]
324
= 42 - 37
= 5L
415
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
3 A male adult insulin dependent diabetic forgot to take his insulin. His blood
glucose concentration, which was 5 mmol/L, rose to 15 mmol/|L in two hours.
Estimate the effect on his plasma sodium concentration, assuming that no other
water intake nor loss of water from the body takes place during this time, indicating
what assumptions you make.
• That it is plasma glucose which is measured rather than whole blood glucose.
• That the plasma glucose has equilibrated with interstitial fluid so that it’s
concentration in the extracellular fluid (ECF) is the same as in plasma.
• That there is negligible change in the concentrations of solutes other than glucose,
sodium and chloride.
• That the ratio of ICF:ECF volumes is 2 (i.e. ECF = 14L, ICF = 28L for average
adult male) and that the total body water is that of an average male i.e. 42 L
The effect of an increase in plasma (and hence ECF) glucose is to raise plasma (and
ECF) osmolarity. The body will retain water (stimulation of thirst increases intake
and stimulation of ADH reduces renal loss) until osmotic equilibrium is restored. If
there is a plentiful supply of water then the plasma osmolarity is returned to normal
and since the plasma glucose has risen by 10 mmol/L the plasma sodium must have
fallen by 10/2 = 5 mmol/L. However, this question states that there is no net loss or
gain of body water. Therefore, water will move, by osmosis, from the ICF
compartment (iso-osmolar) to the ECF (now hyper-osmolar) until osmotic
equilibrium is established. Since movement of water from the ICF leads to an
increase in ICF osmolarity, the movement of water is restricted and at equilibrium the
ECF will reach a value somewhere in-between normality and the original value i.e.
the osmotic load is shared between the ECF and ICF compartments, both of which
become hyperosmolar.
416
WORKED ANSWERS TO FURTHER QUESTIONS
= 10 x 14 = 140 mmol
(a slight underestimate since there has been a small expansion in ECF vol)
At equilibrium, the rise in osmolarity (which is the same in the ECF and ICF) is
given by:
Since the plasma osmolarity has risen by 3.33 mOsmol/L and the plasma glucose
by 10 mmol/L then the concentration of NaCl which has been displaced by
glucose is
417
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
4. A plasma sample with a total protein content of 70 g/L gave identical sodium
results of 140 mmol/L when measured using either a direct-reading ion-selective
electrode or a flame photometer. What plasma sodium result would you expect
the ion-selective electrode to give with the same plasma sample if its total protein
concentration had been 90 g/L?
There are two ways in which the ISE reading can be converted to the same as that
obtained by flame photometry (140 mmol/L):
418
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 9
ΔA = a.b.Δc
419
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Multiply by 1,000,000 to convert the concentration units from mol to μmol (1 mol
= 1,000,000 μmol):
The final step is to convert the activity to μmol/min/L serum. In the assay 100 μL
of serum was mixed with 2.7 mL buffer and 100 μl of substrate.
2. An assay for alkaline phosphatase activity involved mixing 0.05 mL of serum with
2.7 mL buffer, allowing temperature to reach equilibrium then starting the
reaction by adding 0.2 mL of substrate (4-nitrophenyl phosphate). The increase
in absorbance in a 1cm cuvette due to the liberation of product (4-nitrophenol)
was 0.180 over a 5-minute period. Calculate the alkaline phosphtase activity
expressing the result as a) international units per litre of serum, and b) katals per
litre of serum. Assume that the molar absorptivity of 4-nitrophenol is 1.88 x 104
L/mol/cm.
a) One international, unit is the amount of enzyme which liberates one μmol
of product per minute. Therefore to calculate the alk phos activity in IU/L
serum the following steps are involved:
420
WORKED ANSWERS TO FURTHER QUESTIONS
ΔA = a.b.Δc
b = pathlength of cuvette = 1 cm
Serum = 0.05 mL
Buffer = 2.70 mL
Substrate = 0.20 mL
Total = 2.95 mL
= 113 IU/L
421
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
3. The Somogyi saccharogenic method for the assay of amylase involves measuring
the rate of release of glucose from substrate. One Somogyi unit is the amount of
enzyme catalysing the release of 1 mg of glucose in 30 min per 100 mL serum.
Derive a factor to convert Somogyi units to international units per litre of serum.
AW C = 12, therefore C6 = 6 x 12 = 72
AW H = 1, therefore H12 = 12 x 1 = 12
AW O = 16, therefore O6 = 6 x 16 = 96
MW = 180
422
WORKED ANSWERS TO FURTHER QUESTIONS
= x x 1.85
423
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
ΔA = a.b.Δc
424
WORKED ANSWERS TO FURTHER QUESTIONS
v = Vmax [S]
Km + [S]
6. What information can be obtained from the double-reciprocal plot for an enzyme
under the following conditions: a) 1/v = 0 when 1/[S] = -12.5 x 106 L/mol, b)
1/[S] = 0 when 1/v = 5.2 x 106 min/mol, c) 1/[S] = 0 when 1/v = 6.5 x 106
min/mol and the slope of the line is 100 min/L?
425
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Km = 100
1.5 x 10-7
426
WORKED ANSWERS TO FURTHER QUESTIONS
1 = 1 = 103 L/mol
[S] 10-3 mol/L
1 = 1 = 106 min/mol
v 10-6 mol/min
427
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
0.5 33 9
1.0 50 17
2.0 67 29
4.0 80 44
10 91 67
The first step is to plot the data. Any linear transformation of the Michaelis-
Meneten equation can be used but the double-reciprocal plot is probably the
simplest. Calculated reciprocals are:
1/[S] 1/v
L/mmol No inhibitor Mucic acid
428
WORKED ANSWERS TO FURTHER QUESTIONS
0.12
0.1
1/v + 1.0 x 10-4 mol/L mucic acid
0.08
0.06
0.04
0.02 No inhibitor
0
-1.5 -1 -0.5 0 0.5 1 1.5 2 2.5
1/[S] L/mmol
Since the lines cross on the 1/v axis the type of inhibition is competitive.
429
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
1 12 33
2 21 50
4 35 67
10 57 83
20 73 91
Stating any assumptions that you make determine the pH at which the enzyme has
greatest affinity for the substrate.
430
WORKED ANSWERS TO FURTHER QUESTIONS
1/[S] 1/v
L/mmol pH 7.4 pH 5.5
1 0.083 0.030
0.5 0.048 0.020
0.25 0.029 0.015
0.1 0.018 0.012
0.05 0.014 0.011
0.09
0.08
1/ v 0.07
0.06
pH 7.4
0.05
0.04
0.03
0.02
pH 5.5
0.01
0
-1 -0.5 0 0.5 1
1/[S] L/mmol
431
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Assuming that equilibrium conditions apply (i.e. that k+1>>K+2) then the Km is the
dissociation constant of the enzyme-substrate complex and is inversely
proportional to the affinity of the enzyme for the substrate. The Km is lower at
pH 5.5 than at pH 7.4. Therefore the enzyme has greatest affinity for its
substrate at pH 5.5.
10. The apparent Km and Vmax of an enzyme were measured over a range of inhibitor
concentrations and the following data obtained:
5 10 7.5
10 7 5
15 5 4
20 4 3
432
WORKED ANSWERS TO FURTHER QUESTIONS
The value for Ki can be obtained from secondary plots of either 1/Km or
1/Vmax versus [I].
The relationship between Kmapp and [I] for an uncompetitive inhibitor is:
Kmapp = Km
(1 + [I]/Ki)
1 = (1 + [I]/Ki)
Km app
Km
1 = 1 x [I] + 1
Km app
KiKm Km
When 1/Kmapp = 0:
0 = 1 x [I] + 1
KiKm Km
- 1 = [I]
Km KiKm
Km Ki = - [I]
Km
Cancelling Km:
Ki = - [I]
433
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
[I] (mmol/L) 5 10 15 20
1/Kmapp (L/mmol) 0.1 0.14 0.2 0.25
0.3
0.25
app
1/K m L/mmol
0.2
0.15
0.1
0.05
0
-10 -5 0 5 10 15 20 25
[ I ] mmol/L
Vmaxapp = Vmax
(1 + [I]/Ki)
434
WORKED ANSWERS TO FURTHER QUESTIONS
Inversion gives:
1 = (1 + [I]/Ki)
Vmaxapp Vmax
1 = 1 x [I] + 1
Vmaxapp
KiVmax Vmax
When 1/Vmaxapp = 0:
0 = 1 x [I] + 1
KiVmax Vmax
- 1 = [I]
Vmax KiVmax
- KiVmax = [I]
Vmax
Ki = - [I]
Calculating 1/Vmax:
[I] mmol/L: 5 10 15 20
1/Vmaxapp 0.13 0.20 0.25 0.33
435
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
0.35
0.3
0.25
app
1/Vmax
0.2
0.15
0.1
0.05
0
-5 0 5 10 15 20 25
[ I ] mmol/L
The Ki s from the two plots do not agree exactly but the value is approximately
4 x 10-3 mol/L. This is due to errors inherent in manually constructing the plots
and reading off the values of the intercepts.
436
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 10
Total protein (g/L): 70, 68, 71, 65, 68, 70, 73, 69, 75, 74, 69, 71
Construct a table with columns for protein result (x) and x2, then obtain the sum of
the results in each column:
Result (x) x2
70 4900
68 4624
71 5041
65 4225
68 4624
70 4900
73 5329
69 4761
75 5625
74 5476
69 4761
71 5041
437
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 59,307 - 8432
12
= 59,307 - 59,221
= 86 g/L
Mean - (z x s) to mean + (z x s)
Substituting mean = 70.25 g/L and s = 2.80 g/L gives the 95% confidence limits:
438
WORKED ANSWERS TO FURTHER QUESTIONS
The mean will be the mean of the upper and lower limits:
The upper and lower reference limits will span 4 standard deviations:
From tables of z, the value of P when z is equal to 2.6 is 0.002. Therefore, 0.002
of results fall outside the range: mean ± 65 nmol/L and a half of these (0.001)
will be greater than 165 nmol/L.
= 10 results
439
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
3. Calculate the least significant difference for a change in cholesterol if the intra-
individual coefficient of variation for cholesterol is 4.7% and the analytical
coefficient of variation, 2.4%. A patient was changed from Atorvastatin 80 mg to
Rosuvastatin 40 mg and the total cholesterol fell from 6.9 to 5.9 mmol/L.
Calculate the percentage change in cholesterol and state whether this is
significant.
= √ (2.42 + 4.72)
= √ (5.76 + 22.09)
= √ 27.85
= 5.28 %
CV (%) = s x 100
m
440
WORKED ANSWERS TO FURTHER QUESTIONS
= 1.0 x 100
6.9
= 14.5%
Since this percentage change is not greater than 14.8%, the change is not quite
statistically significant at the 5% level of probability.
4. Your on-call laboratory service uses 30 different methods, each of which has a
1% probability of failing QC criteria during the course of a night. Assuming that
QC of any method is independent of that of the other methods, what is the
probability that on any one night all methods will pass the QC criteria?
Thus if a coin is tossed once the probability of ‘heads’ is 0.5. If the coin is tossed
again then the probability of it landing ‘heads’ on both occasions is 0.5 x 0.5 =
0.25. Similarly if the probability of one channel passing QC is 0.99, then the
probability of two channels passing is 0.99 x 0.99 = 0.98. The chance of three
different channels passing is given by 0.99 x 0.99 x 0.99 = 0.97 i.e. (0.99)3.
441
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
If your calculator does not have the facility to calculate x y then the result can be
easily calculated using logs:
= 30 x 0.00436
= - 0.131
5. You attempt to derive a reference range for TSH for an ethnic minority
population. The first 10 samples give the following results:
Result n
Between 0.5 and 1.49 5
Between 1.5 and 2.49 3
Between 2.5 and 3.49 0
Between 3.5 and 4.49 1
Between 4.5 and 5.49 1
On the basis of these results, what range of TSH values would encompass 95% of
the ethnic minority population?
442
WORKED ANSWERS TO FURTHER QUESTIONS
1. The individual results are not given, only the number of results falling into each
class interval. The easiest way to deal with this is to assume that the results fall in
the middle of the range i.e. there are 5 results within the range 0.5 to 1.49 so
assume there are 5 results of the mid-point value (1.0 mU/L), similarly there are 3
samples with a value of 2 mU/L. Using this approach 10 individual results are
produced which can be processed in the usual way.
2. The data are obviously skewed and do not form a Gaussian distribution. This can
be overcome to some extent by taking logarithms (to the base 10) of the results
then calculating the mean, SD and 95% confidence limits in the usual way.
Taking antilogarithms of the confidence limits then gives the reference range.
1.0 0 0
1.0 0 0
1.0 0 0
1.0 0 0
1.0 0 0
2.0 0.301 0.0906
2.0 0.301 0.0906
2.0 0.301 0.0906
4.0 0.602 0.3624
5.0 0.699 0.4886
n = 10 ∑x = 2.204 ∑ x2 = 1.123
s = √ 0.0708 = 0.266
443
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Alternatively the mean and s can be calculated directly on most modern pocket
calculators. The 95% confidence are given by mean – 1.96 s to mean + 1.96 s
= -0.301 to 0.741 (these values are logs and so do NOT have units)
Taking antilogs (to the base 10) gives the 95% confidence limits in mU TSH/L:
Although the original data may have been expressed to one or two decimal places,
this information has been lost by grouping the data into class intervals. Therefore
it would be more correct to quote a reference range of less then 6 mU/L.
6. You are required to pipette a 9ml volume and have available a 10 ml graduated
pipette which has a 2%CV associated with it’s delivery volume and 5 and 2 ml
volumetric pipettes each of which has a 1% CV associated with their delivery
volumes. What is the error of pipetting a 9 mL volume, expressed as plus/minus
mL volume?
CV (%) = s x 100 =
m
Therefore: s = CV (%) x m
100
444
WORKED ANSWERS TO FURTHER QUESTIONS
s = 2 x 9 = 18 = 0.18 mL
100 100
s = 1 x 5 = 5 = 0.05 mL
100 100
s = 1 x 2 = 2 = 0.02 mL
100 100
= √ 0.0033
= 0.0574 mL
445
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
7. It has been suggested that a proposed analytical goal for an analyte is that the
between batch analytical coefficient of variation should not exceed one half of the
“true biological” inter-individual coefficient of variation. Calculate the
percentage “expansion” of the measured reference range over the true biological
reference range when this analytical goal is exactly met.
The relationship between the overall variation, analytical variation and biological
variation is:
Both the analytical and biological CV’s share the same mean.
Substitute this value for the analytical CV so as to obtain the total CV expressed in
terms of the biological CV:
= √ (1.25 x CVBiological2)
= 1.118 CVBiological
Therefore biological reference range spans 4 CVs and the total reference range
spans 4 x 1.118 CVBiological = 4.47 CVBiological
446
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 11
1. The following analytical results were obtained on the same QC sample: 109, 91,
105, 112, 90, 115, 89, 113, 93, 94. Calculate the mean, standard deviation and
standard error of the mean.
Construct a table with columns for result (x) and x2, then obtain the sum of the
results in each column:
x x2
109 11,881
91 8,281
105 11,025
112 12,544
90 8,100
115 13,225
89 7,921
113 12,769
93 8,649
94 8,836
n = 10
= 103,231 - 10112
10
447
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 103,231 - 102,212
= 1019
2. Two laboratories measured sodium in the same plasma sample ten times. One
laboratory obtained a mean of 145 mmol/L with an SD of 3 mmol/L; the other
obtained a mean of 147 mmol/L with an SD of 2 mmol/L. Do the laboratories
differ in their bias or imprecision?
t = m1 - m2
√ (s12/n + s22/n)
= 145 - 147
√ (32/10 + 22/10)
448
WORKED ANSWERS TO FURTHER QUESTIONS
= -2
√ (0.9 + 0.4)
= -2 = -2 = - 1.75
√ 1.3 1.14
DF = (s12/n1 + s22/n2)2
[(s1 /n1) /(n1 – 1)] + [(s22/n2)2/(n2 – 1)]
2 2
= (0.9 + 0.4)2
0.92/9 + 0.42/9
= 1.32
0.09 + 0.018
= 1.69
0.108
= 15.6
From tables when t = 1.75 with 16 degrees of freedom, P = 0.10. Therefore there
is no significant difference between the means of the two set of results i.e.
no evidence of bias.
F = s1 2 = 32 = 9 = 2.25
s2 2 22 4
From tables when F = 3.18 (with 9 degrees of freedom for both variances),
P = 0.05. Therefore there is no significant difference between the two variances.
i.e. no evidence of difference in imprecision.
449
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
The mean is the average of the upper and lower reference limit:
The reference limits span 4s units so that that s is a quarter of the range:
= s = 25 = 25 = 8.33 nmol/L
√n √9 3
Calculate t for 9 results with m = 125 nmol/L, population mean (μ) = 100 nmol/L
and SEm = 8.33 nmol/L:
From tables, for t = 3.00 with 8 degrees of freedom P = approx. 0.02. Therefore
0.02 of results fall outside of the range mean ± 25 nmol/L and a half of these
results, 0.01, will be greater than 125 nmol/L.
450
WORKED ANSWERS TO FURTHER QUESTIONS
A B
6.8 7.2
4.2 4.5
5.0 4.8
5.6 5.9
8.5 8.7
2.9 2.8
4.8 4.9
7.6 8.1
6.5 6.4
5.0 5.2
Since these are paired samples the results should be compared using the paired
t-test. Construct a table with the individual differences between each pair of
results (d = A – B), d2, the difference between each d and the overall mean (md)
for all the values of d (i.e. d – md) and their squares i.e. (d - md)2.
A B d d2 d - md (d - md)2
451
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Paired t = md
sd/√n
= √ [0.592/9]
= √ 0.0658
= 0.256
Paired t = md = md x √n
sd /√n sd
452
WORKED ANSWERS TO FURTHER QUESTIONS
Lab
A B C D
Calculate n, ∑x, m, ∑x2, (∑x)2/n and ∑x2 - (∑x)2/n for each lab:
Lab
A B C D
453
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 2242.735 - 299.52/40
= 2242.735 - 2242.5063
= 0.2287
= 2243.91 - 2242.735
= 1.175
= 2243.91 - 299.52/40
= 2243.91 - 2242.5063
= 1.4037
From tables the probability of obtaining an F value greater than 2.84 (for 3 and 40
degrees of freedom) is 0.05. Therefore the data is homogeneous and there is no
evidence for bias between the four laboratories.
454
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 12
1. Regression analysis of results using new standards (y) against old standards (x)
showed a linear relationship. The regression coefficient (slope) was 1.10 and the
intercept on the y axis 1.0 mmol/L. Calculate the results which would be expected
using new standards for the analysis of old standards containing (a) 15 mmol/L
and (b) 150 mmol/L.
a) Regression equation for new standards (y) upon old standards (x):
y = 1.10 x + 1.0
y = 1.10 x 15 + 1.0
= 16.5 + 1.0
= 17.5 mmol/L
b) Substitute old standard containing 150 mmol/L for x then solve for y:
= 165 + 1.0
= 166 mmol/L
2. A laboratory changed its method for the assay of serum alkaline phosphatase
activity. Assay of a selection of patient’s samples by both methods yielded the
following data:
ALP (Old method), IU/L: 50 350 700 100 1500 2000 420 1200
ALP (New method), IU/L: 40 190 350 90 750 1500 280 600
455
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
The first step is to check that there is a linear relationship between the two
methods. This is best done by plotting the results using the new method (y-axis)
against those obtained using the old method (x-axis):
1200
New ALP (IU/L)
1000
Line of best fit
800 drawn by eye
600
400
200
0
0 500 1000 1500 2000 2500
Old ALP (IU/L)
The data appear to fit a straight line so linear regression analysis is appropriate.
x y x2 y2 xy
= ∑xy - (∑x∑y/n)
∑x2 - (∑x)2/n
456
WORKED ANSWERS TO FURTHER QUESTIONS
= 5,285,100 - 3,002,000
8,491,400 - 4,992,800
= 2,283,100
3,498,600
The value for the intercept (a) can be obtained by substituting the slope (b), the
mean of x for x and the mean of y for y into the linear expression y = bx + a,
then solving for a:
= 475 - 516
= - 41
457
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Drug dosage (mg/kg body wt): 50 100 150 200 250 300 350 400
Prolactin (IU/L) 750 1500 350 400 2000 1250 500 1800
Do these data show a linear relationship between drug dosage and serum
prolactin concentration?
The first step is to plot the data with prolactin as the y-axis and drug dosage as the
x-axis:
2000
Prolactin (IU/L)
1500
1000
500
0
0 50 100 150 200 250 300 350 400 450
Drug dose (mg/Kg body wt)
458
WORKED ANSWERS TO FURTHER QUESTIONS
x x2 y y2 xy
n = 8
r = ∑xy - (∑x∑y/n)
√ {[∑x - (∑x)2/n] [∑y2 - (∑y)2/n]}
2
= 2,090,000 - 1,923,750
√ { [510,000 - 405,000] [12,147,500 - 9,137,813]}
= 166,250
√ {105,000 x 3,009,687}
= 166,250
562,154
From tables, for r = 0.30 with 7 degrees of freedom, P >0.1. Therefore there is no
significant correlation between drug dosage and serum prolactin.
459
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
No evidence is presented that the relationship between the two variables is linear.
Correlation analysis is not the best approach to comparing two analytical methods
– as they both measure the same analyte it would be surprising if there were no
correlation. Analysis of difference plots would be more appropriate.
The standard deviation of the residual (sres or syx) is not given – this is the best
indicator of the goodness of fit of the data to the regression line.
460
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 13
1. A test for a particular disease has a sensitivity of 95% and a specificity of 95%.
Calculate the predictive value of both a positive and a negative test result in a
population in which the prevalence of the disease is:
a) 1 in 2
b) 1 in 5000
The next task is determine values for TP, FN, FP and TN using the stated
sensitivity and specificity:
Sensitivity = TP = 0.95
TP + FN
461
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Specificity = TN = 0.95
TN + FP
These values are then used to calculate positive and negative predictive values:
462
WORKED ANSWERS TO FURTHER QUESTIONS
463
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
2. The table shows data from two urinary screening tests for the detection of
phaeochromocytoma.
First calculate the proportion of false negatives i.e. use the sensitivity
expressed as a proportion (0.967) rather than percentage (96.7%) and the
prevalence calculated as follows:
Sensitivity = TP
TP + FN
464
WORKED ANSWERS TO FURTHER QUESTIONS
Sensitivity = TP = 0.967
0.005
Since TP + FN = 0.005
Specificity = TN = 0.98
TN + FP
Specificity = TN = 0.98
0.995
FP = (1 – prevalence) - TN
465
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 1990
c) Probably the best way to decide which is the best test is to calculate the
positive and negative predictive values for each test:
For VMA:
For metanephrines:
466
WORKED ANSWERS TO FURTHER QUESTIONS
To summarize:
Although the VMA test produces less false positives (i.e. higher PV+) this is
achieved at the expense of missing approximately 1 in 3 (FN/prevalence =
0.33) patients with phaeochromocytoma. Although the phaeochromocytoma
produces more false positives (i.e. lower PV+) this is achieved without
missing any cases of phaeochromocytoma (i.e. no false negatives). On
balance total metanephrines is the better test.
3. A new laboratory test has a sensitivity of 85% and a specificity of 90%. The
incidence of disease in a population considered at risk is 0.10. What is the
predictive value of
a) a positive result?
b) a negative result?
Total TP + FP TN + FN 1
467
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
468
WORKED ANSWERS TO FURTHER QUESTIONS
4. A proposed diagnostic serological test for coeliac disease was evaluated in 200
consecutive patients referred to a paediatric gastroenterology service in whom
the condition was suspected clinically. The test result was compared with the
diagnosis as established by biopsy, withdrawal of gluten and response to
re-challenge. On this basis 76 children had the condition of whom only 64 gave a
positive test result: 10 positive test results occurred in children who were shown
not to have coeliac disease. Calculate the sensitivity and specificity of the test
and the predictive value of a positive result.
Total TP + FP TN + FN 1
Fill in this table working with the data given in the question:
469
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
N.B. In a question like this when the sensitivity and specificity is not given and
actual numbers of results are supplied it is probably easiest to work with absolute
numbers rather than proportions.
Set up a 2 x 2 contingency table then fill in the gaps using a prevalence of 0.4,
sensitivity of 0.92 and specificity of 0.88:
Total TP + FP TN + FN 1
470
WORKED ANSWERS TO FURTHER QUESTIONS
471
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Total TP + FP TN + FN 1
Complete the table using a prevalence of 5% = 0.05 which with a total of 400
individuals gives a prevalence in absolute numbers of 0.05 x 400 = 20.
472
WORKED ANSWERS TO FURTHER QUESTIONS
= 20
380
Sensitivity = TP = 15 = 0.75
TP + FN 20
= 0.053 x 9.4
= 0.50
(1 + 0.50)
= 0.50
1.50
= 0.33
473
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
7. A two-stage sequential test strategy is used to screen for a rare inherited disease.
The prevalence of the disease is 0.0005. The initial test has a sensitivity of 98%
and specificity of 95%, the follow-up test a sensitivity of 95% and specificity of
99%. What is the probability of a patient with a positive result for the follow-up
test having the disease?
If the prevalence of disease is 0.0005 then the pre-test odds can be calculated:
Pre-test odds x likelihood ratio (1st test) x likelihood ratio (2nd test)
= 0.931
1.931
474
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 14
1. A study into the effect of nutritional supplements on patients with Crohn’s disease
involved measuring serum albumin both before and after supplementation for a
four week period. During this period the mean serum albumin level increased
from 25 g/L to 30 g/L. The study involved 40 patients with a standard deviation
for albumin concentration of 10 g/L. What is the power of this study to detect a
5 g/L change in serum albumin at the 5% level of probability?
zα + zβ = ∆√n
s
zα + zβ = 5 √ 40 = 5 x 6.32 = 3.16
10 10
Since the probability (P) used as a decision level is 0.05, the corresponding
z value (obtainable from tables) is 1.96 (the question only requires detection of a
change – which could be either positive or negative – so both sides of the
distribution are being used). Therefore, α = 0.05 and zα is 1.96. Substitute this
value for zα and solve for zβ:
From tables of z, the value for β (i.e. proportion of area under the curve)
corresponding a to zβ of 1.20 is 0.1151 (single sided probability).
475
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
= 6.75 - 7.5
= - 0.75 mmol/L
From tables the corresponding z value (i.e. zβ) is 1.28 (one sided value).
The decision level used is a probability of 0.05 (α) with a corresponding z value
for one side of the distribution (since we are required to detect a decrease in
cholesterol) (zα) of 1.64.
= 9.732
= 95 (2 sig figs)
476
WORKED ANSWERS TO FURTHER QUESTIONS
Chapter 15
Allowance must be made for dilution of both the sample and standard when they
are mixed – since only 0.5 mL of the mixture is used for the assay.
= 138.3 x 100
146.7
477
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Calculate the expected concentrations in the mixture from the urine and the
standard separately:
= 4,684 x 100
5,000
478
WORKED ANSWERS TO FURTHER QUESTIONS
Assuming the clearance of AFP follows first-order kinetics the rate equation is:
kd = elimination rate constant which can be calculated from the half-life (t½):
479
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
4. A radioisotope has a half-life of 21 days. How long will it take for the activity to
fall to 10% of the initial value?
ln At = ln A0 - kd.t
kd = decay constant which can be calculated from the given half-life (t½):
ln 0.1 = ln 1 - 0.033.t
-2.303 = 0 - 0.033.t
0.033.t = 2.303
t = 2.303
0.033
log10 AR = - 0.30.N
480
WORKED ANSWERS TO FURTHER QUESTIONS
N = 1 = 3.333 half-lives
0.30
ln 10 = ln 1 + 0.3465.t
2.303 = 0 + 0.3465.t
0.3465.t = 2.303
481
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
log10 CR = 0.30 N
log10 10 = 0.30 N
1 = 0.30 N
N = 1 = 3.333
0.30
= 3.333 x 2
= 6.7 days
= 580 x 28
1,000
= 16.24 g/24 g
= 11.8 - 16.24
= - 4.44 g/24h
482
WORKED ANSWERS TO FURTHER QUESTIONS
If 20% is added to the urinary excretion to allow for other urinary losses and a
further 2 g/day added to allow for losses by other routes then the nitrogen
excretion becomes:
= 19.49 + 2.0
= 21.49 g/24 h
7. A 30 min basal gastric secretion sample (total volume 27 mL) required 2.5 mL of
0.1 M NaOH to titrate 5 mL of the material to pH 7.4. Calculate the basal acid
secretion rate in mmol/h.
M1 x V1 = M2 x V2
M1 x 5 = 0.1 x 2.5
M1 = 0.1 x 2.5
5
= 0.05 M
483
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Divide by 1,000 to give the acid output per mL of gastric fluid then mutiply by the
total volume of gastric fluid collected (27 mL) to obtain the total output of acid:
Since the gastric fluid was collected over 30 min, multiply this result by 2 to
obtain the amount of HCl secreted in 1 h:
= 2.7 mmol/h
8. A five-day faecal fat collection was homogenised and diluted to 1500 mL. A 10
mL aliquot of the homogenate was subjected to hydrolysis and the fatty acids were
extracted. The volume of 0.05 M sodium hydroxide required to effect
neutralisation was 48 mL. Calculate the fat excretion in mmol/24 h.
M1 x V1 = M2 x V2
M1 x 10 = 0.05 x 48
484
WORKED ANSWERS TO FURTHER QUESTIONS
Multiply by the total volume (in litres) of the homogenate to obtain the total fatty
acid output over the 5-day collection period:
Fatty acid output = 240 x 1.50 = 360 mmol fatty acid/5 days
Assuming that all the fatty acids were liberated from triglyceride then division by
3 gives the total fat output:
485
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Divide the drug peak area by the internal standard peak area to give the peak
height ratio (PHR) for both standard and patient:
PHRPatient = PHRStandard
ConcentrationPatient ConcentrationStandard
Substitute the PHR values and standard concentration to obtain the drug
concentration in the patient sample:
3.75 = 4.00
ConcentrationPatient 200
486
WORKED ANSWERS TO FURTHER QUESTIONS
A 0.65
B 0.35
If the conditions for the Hardy-Weinberg equilibrium are met then the frequencies
of the three genotypes are:
AA = p2 = 0.652 = 0.4225
BB = q2 = 0.352 = 0.1225
487
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
Let the dominant gene be A and the recessive gene a. As the inheritance of the
disease is autosomal recessive only the homozygous recessive genotype (aa)
expresses the disease.
Since the total of all frequencies must equal 1, the frequency of the remaining
homozygous dominant genotype, AA (which does not express disease nor have
carrier status) can be calculated by difference:
= 1 - 0.0204
= 0.9796
Genotype AA Aa aa
Observed frequency 0.9796 0.020 0.00040
Since p + q = 1
p = 1 - q = 1 - 0.020 = 0.98
488
WORKED ANSWERS TO FURTHER QUESTIONS
Using these values for p and q the frequencies of the other two genotypes can be
calculated:
X2 = ∑ (O - E)2/E
aa 0.0004 0.0004 0 0 0
X2 is the sum of all the values in the final column = 0.010 (2 sig figs)
From tables, the value for P when X2 = 0.010 is somewhere between 0.95 and
0.99. Therefore there is no significant difference between the observed and
expected frequencies so that the data fit the Hardy-Weinberg equilibrium.
489
CALCULATIONS IN LABORATORY MEDICINE – A. DEACON
12. The following data were obtained for a digoxin radioimmunoassay employing
PEG precipitation of the primary antibody. The assay was performed in duplicate.
Calculate the digoxin concentration in the serum sample.
Calculate the mean for each pair of duplicates then B/B0 (%) using the formula:
490
WORKED ANSWERS TO FURTHER QUESTIONS
Sample Conc log conc Duplicate cpm Mean cpm Mean – NSB B/B0 (%)
100
90
Digoxin calibration curve
80
70
60
50
40
B /B 0 (%)
30
20
10
0
-0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8
log10 conc
From calibration curve log10 conc when serum B/B0 (%) = 0.21
491
INDEX
A
Absorbance measurements and analyte Correction of GFR for body surface area,
concentrations, 59 87-88
Absorption laws in spectrometry, 52-53 Correlation and regression, 239
Absorptivity, 60 Correlation coefficients, 241-244
Acid base, pH and buffers, 27-49 Creatinine clearance, 82-83
Analysis of variance (ANOVA), 229-233 calculation from plasma creatinine, 89-
Analytical imprecision, 193 Cumulative drug concentrations, 130
Antilogarithms, 29-30
Altman-Bland plots, 260-262 D
Dealing with mixtures in spectrometry, 69-71
B Deming regression, 258-259
Basis of statistics, 193-215 Dilutions
Beer’s law, 52 dealing with dilutions generally, 18-21
Beer-Lambert equation, 57 doubling dilutions, 23-24
Bioavailability, 113 linear dilutions, 22-23
Biological variability, 210-211 preparing a series of, 22-24
Blood urea nitrogen (BUN), 10 Dissociation constant of water, 28
Body surface area, 87 Distribution of body water, 139-140
Body fluids and electrolytes, 139-155
calculation of fluid balance, 141-142 E
composition of body fluids, 139-140 Eadie- Hofstee plot, 175-176
distribution of body water, 139-140 Effect of hyperglycaemia on plasma sodium,
effect of hyperglycaemia on plasma 147-148
sodium, 147-148 Eisenthal and Cornish-Bowden plot, 175-177
estimation of fluid losses, 141-146 Elimination rate constant, 118
extracellular fluid, 139-140 Enzymology, 157-189
intracellular fluid, 139-140 calculating enzyme activity, 161-162
Bouger’s law, 52 catalytic activity, 157-158
Buffers enzyme inhibition, 178-182
bicarbonate buffer system, 41 graphical solutions for Michaelis-Menton
calculations, 35 equation, 173-178
definitions, 34 interconversion of enzyme units, 164-165
quantitative properties, 38 measurement of activity, 157-160
urinary, 46 Michaelis-Menton equation, 167-172
Bunsen solubility coefficient, 42 Equivalent weights,
Estimation of fluid losses, 141-146
C Exponential growth, 306-307
Calculation of fluid balance, 141-142 Expression of degree of acidity/alkalinity,
Calibration curves in spectrometry, 62-68 28-29
Catalytic activity, 157-158 Extracellular fluid, 139-140
Clearance and GFR, 80-81
Clearance of a drug, 116-118 F
Clinical utility of laboratory tests, 265-283 Factors affecting enzyme actiity, 158-159
Cockcroft-Gault formula, 89-91 Fractional excretion, 94-98
Coefficient of variation, 197-198 Free water clearance, 99-100
Competitive binding immunoassays, 323-328
Composition of body fluids, 139-140
Correcting for purity, 16
492
INDEX
G N
Gaussian distribution, 194-195 Naperian logarithms, 29
Glomerular filtration rate (GFR), 77-93 Nitrogen balance, 308-309
correcting for body surface area, 87-88 Numbers and logarithms, 30
Graphical solutions for Michaelis-Menton
equation, 173-178 O
Odds and likelihood ratios, 279-280
H Optical density, 53
Hanes plots, 173-174 Osmolal gap, 108-111
Hardy-Weinberg equilibrium, 315-322 Osmolality, 105-111
Henderson-Hasselbalch equation, 35-38 Osmolar clearance, 99-100
Henry’s law, 42 Osmotic pressure, 105
Hydrogen ion gradient in the kidney, 46
P
I Partial pressure of gaseous phase, 42
Internal standardisation in chromatography, Pascals, 42
312-314 Pearson correlation coefficient, 241-244
Inter- and intraindividual variation, 193-195 pH, definition and use, 31-33
Interconversion of enzyme units, 164-165 Pharmacokinetics, 113-135
Interconversion of mass and SI units, bioavailability, 113
7-8, 14 clearance of a drug, 116
Ionic product of water, 28 elimination rate constant, 118
Ion-specific electrodes, 152-154 practical applications, 124-135
salt conversion factor, 113
K volume of distribution, 113
Karmen units, 165-166 Population genetics, 314
Katals, 166 Predictive
King-Armstrong units, 164-165 values, 268-272
Kurtosis, 198-199 Prefixes for powers of 10, 3
Preparing a series of dilutions, 22-24
Preparing solutions from liquids, 17
L Preparing solutions from solids, 13
Laboratory manipulations, 13-24
Progress curves of an enzyme catalysed
Lambert’s law, 52
reaction, 159
Levy-Jennings QC charts, 207-208
Punnett’s square, 315
Linear dilutions, 22-23
Linear regression, 250-257
Lineweaver-Burk double reciprocal plot, R
173-174 Radioactive decay, 304-305
Loading dose of drugs, 124, 128 Rate of filtration, 79-81
Logarithms, 28-31 Rate of urinary excretion, 79-81
Receivor operator characteristic (ROC) curves,
276-278
M Recovery experiments, 297-299
Maintenance dose of drugs, 126
Relationship between clearance,
Measures of tubular function, 94-95
plasmaconcentration and urinary excretion,
Method comparison, 260
84-85
Michaelis-Menton equation, 135, 167-172
Renal function, 77-103
graphical solutions for, 173-178
Renal threshold, 96-98
Modification of Diet in Renal Disease
(MDRD) equations, 92-93
493
INDEX
S
Salt conversion factor, 113 probability points of a normal distribution,
Sensitivity and specificity, 266-268 204
SI units, 1 ROC curves, 276-278
Significant figures, 4-5 sensitivity and specificity, 266-267
Solubility coefficient, 43 skewness of data, 198-199
Specific gravity, 17 standard error, 219-220
Spectrophotometry, 51-73 statistical power, 287
absorbance, 53 tests for populations, 201
absortion laws, 52 t-tests, 219-227
basic principles, 51-52 variance ratio (F-test), 227-228
calibration curves, 62-68 variances, 210-211
transmittance of light, 51-55 Westgard rules, 207-208
Standard deviation, 197-198 Steady state of a drug, 129
Standard error of the mean (SEM), 219-221
Statistics T
Altman-Bland plots, 260-262 Titratable acidity, 46, 309-311
analysis of means and variance, 219-236 Total acid excretion, 46
analysis of variance (ANOVA), 229-233 t-tests, 219-227
basis of, 193-215 Tubular maximum, 94-99
coefficient of variation (CV), 196-198 Tubular reabsorption, 95
comparison of means, 219-236 Tumour marker kinetics, 301-303
correlation coefficients, 241-244
correlation and regression, 239 U
Gaussian or normal distribution, 194-195 Units and their manipulation, 1-12
in QC, 206-208 Units for expressing enzyme activity, 160
kurtosis, 198-199 Urinary buffers, 46
Levy-Jennings QC charts, 207-208
linear regression, 250-257
measures of central location, 196
V
Variance ratio (F-test), 227-228
measures of dispersion, 196-197
Volume of distribution,
median and interquartile range,
method comparison, 260
null W
hypothesis, 289-292 Westgard rules, 207-208
odds and likelihood ratios, 279-280
other modes of regression, 258-259 Z
predictive values, 268-272 Zollinger-Ellison syndrome, 309
presentation and descrition of laboratory
data, 194-195
494
Allan Deacon BSc PhD FRCPath was, until
his retirement in 2006, Consultant Clinical
Scientist in the Clinical Biochemistry
Department at Bedford Hospital.