0% found this document useful (0 votes)

83 views60 pages

Intro to Biostatistics Course

The document is a course syllabus for an introductory biostatistics class that covers topics like descriptive statistics, statistical inference, hypothesis testing, and statistical software. The class aims to teach students how to apply basic statistical procedures and analyze primary biological literature. Evaluation will include exams, quizzes, and a final exam assessing students' understanding of statistics and ability to apply concepts using statistical software. The syllabus outlines expectations for class attendance, computer and cell phone use, academic integrity, and provides information on textbooks and software.

Uploaded by

pearl ikebuaku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

83 views60 pages

Intro to Biostatistics Course

Uploaded by

pearl ikebuaku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

1

Biostatistics
www.harding.edu/plummer/biostats/biostats.pdf

Spring 2018

Introduction Descriptive Graph Inferential Hypothesis Hypothesis Practice

to Statistics Statistics Construction Statistics Testing I Testing II Problems
Will I use Statistical Faculty Statistical Advanced Protocol
Grades
this stuff? Tables Interests Tests Stat Tests Sheet

Course Description
An introductory computer-based statistics course that includes instruction in SYSTAT. Topics
covered include populations and samples, variables, probability distributions, descriptive statistics,
statistical inference, and hypothesis testing. Included are selected parametric and non-parametric tests
for examining differences in means, variances, and frequencies as well as correlation, regression, and
tests of independence.
Emphasis is given to practical matters such as how to choose appropriate analyses and how to
interpret results, both statistically and biologically. High school algebra is the only math background
you need. Biostats is a practical application course - to learn it, you have to do it. Failing to apply
statistical concepts and procedures on a regular basis will diminish your chances of understanding the
material and earning the grade you desire.

What we have to learn to do, we have to learn by doing. – Aristotle

Student Learning Outcomes – By the end of the semester you will be able to:
 understand how science and statistics interact
 apply basic statistical procedures using professional statistical software
 read and understand primary biological literature

Textbooks and Software

 Primary text - www.harding.edu/plummer/biostats/biostats.pdf
 Supplementary text - www.khanacademy.org/math/statistics-probability
 Primary software - SYSTAT (provided on computers in S161 and S182).
 Student software - MYSTAT (free student version of SYSTAT; download at www.systat.com)

Evaluation
Exam 1 20% Exams 1-3 are comprehensive and consist of Content (scantron/short answer
50%) and Practical (SYSTAT problems/graphing 50%) sections. An extra
Exam 2 20%
point may be earned on each exam if you are present in class when feedback is
Exam 3 20% given on your graded exams. Exam study guides
Quizzes 20% ~10 announced quizzes and exercises
The final exam is a comprehensive scantron exam taken during the regularly
Final 20% scheduled final exam period. Unlike Exams 1-3, you will not use a computer on
Exam the final for any task; this includes SYSTAT. Exam study guides
2

Classroom Policies
 Computer resources that may be viewed during lecture include the course website, SYSTAT, and
your M-drive. All other uses (e.g., social notworking sites such as Facebook, Twitter, Instagram,
email, blogs, sports news, pictures of your girl/boyfriend, etc.) are off limits during lecture.
 Cell phone use during lecture is prohibited. If you must send or receive a text or call during
lecture time, please excuse yourself from the classroom and take it to the hallway.
 Regular class attendance is necessary to do well in this course. Excessive unexcused absences will
be handled on an individual basis. An official HU class excuse or prior arrangements with the
instructor is necessary to be excused from an exam.
 Cheating in all its forms is inconsistent with Christian faith and practice and will result in
sanctions up to and including dismissal from the class with a failing grade. Instances of
dishonesty will be handled according to the procedures delineated in the Harding University
catalog.
 The visual appearance or use of any unapproved electronic device during an exam will be
interpreted as cheating and will result in a zero for that exam.
 In accordance with the official Time Management Policy of the University, you are expected to
spend two hours outside of class for each credit hour spent in class each week. That amounts to six
additional hours per week, two of which are imposed on you in conjunction with regular class
time.
 THE ONLINE BIOSTATS LECTURE NOTES ARE NOT COMPLETE SOURCES OF INFORMATION
FOR EXAMS. In general, students are responsible for anything discussed in class.

My Responsibilities
Because, as your teacher, I have a substantial responsibility to you and to the Lord (James 3:1), I
promise my best effort to you in Biol. 254. I pray that my lectures will be clear, my expectations
reasonable, and my exams vigorous, thorough, challenging, and fair. I also pray that your grade will
reflect both your ability and your preparation. Finally, I hope that you will learn something
substantive in my class regardless of what you think about the subject matter. For further insight into
my teaching philosophy, click here - Good luck!

Misc.
 You will need a personal Dropbox account. Data files for the course are available in a shared Dropbox
folder called “Student Biostats.” You should download these files to your M-drive.
 Statements on academic dishonesty, teaching evolution, and students with disabilities

WILL I EVER USE THIS STUFF?

You may be thinking (and perhaps hoping?) you’ll never have to use statistics. The
reality is that if you become a professional of any kind, you will very likely use statistics
according to at least one of the three objectives of this course. Do yourself a favor and
read the unsolicited testimonials from former biostats students.
3

Introduction to Statistics
Home

The relationship of science and statistics

A. Process of science represents an interplay of ideas and data (ID); BC ideas only
1. Use data to make inferences (decisions) relative to ideas [=analyze data]
2. Statistics is a tool that assists decision-making
3. Great increase in use of statistics
 basic science: American Naturalist
 applied science: “Evidence-based medicine (EBM) is an approach to medical
practice intended to optimize decision-making by emphasizing the use of evidence
from well-designed and carefully-conducted research” (Wikipedia); Textbook
4. Manufacturing-based society service/information-based society
 great need for quantitative methods of making decisions using information available
 e.g., demo quantitative decision-making with scatterplot

B. Levels of organization within biology and the relative use of statistics

Community biology -Greater need for stats

Population biology -More uncertainty
Organism biology -Less understanding
Organ biology -More variation
Cell biology -More factors (=more
Molecular biology complexity)

How do we know what we know? - mechanics of the process of science

5. R.B. Fischer - “Science is what scientists do when they’re working.”
6. What DO scientists do when they are working?
7. Several processes: HYPOTHETICO-DEDUCTIVE (IF-THEN) MODEL

OBSERVATION HYPOTHESIS PREDICTION DESIGN TEST 

IF  THEN -observation
-experiment

COLLECT DATA ANALYZE DATA CONCLUSION COMMUNICATION

(statistics) (probability) (talks, publications)

REFINE AND GENERALIZE HYPOTHESIS THEORY (= model)

Absolute certainty is a privilege of uneducated minds-and fanatics.

It is, for scientific folk, an unattainable ideal. - C. J. Keyser
4

8. Advancement comes by disproving false hypos; "proof" in science means disproving

false hypotheses
 A. Einstein - “No amount of experimentation can ever prove me right; a single
experiment can at any time prove me wrong.”
 D. Hull - “The scientific method does not guarantee that you are right; it guarantees
that if you are wrong, someone will find it out.”
 S. Connery - “Isn’t that what science is all about,...eliminating possibilities?” (video)
9. “proof” is tentative - most models have historically been either discarded or radically
modified - no reason to believe that it will be different in the future
 scientific models are pragmatic (useful) - if model works, use it!
 scientific “truth” - not necessarily “TRUTH”

What constitutes the study of “statistics?” (often misunderstood)

-e.g., “There are three kinds of lies: lies, damn lies, and statistics.” -B. Disraeli

What Statistics Is Really About

Population Sample
numerical properties = numerical properties =
“parameters” “statistics”
“Error”
The population is what we want to The sample is what we use to understand the
understand. population.
-Descriptive stats: describe data in the sample
-Inferential stats: infer from sample to
population
5

Descriptive Statistics
Home
Statistical Basics
A. Definitions
1. variable - characteristics that may differ (vary) among individuals
a. measured
b. derived (non-measured); derived from measured variables
c. dependent/response variable vs. independent/predictor variable
2. data - values of variables for individuals (singular datum)
3. case/observation - an individual; symbolize: x1, x2, ...xn (n=sample size)

B. Collection of data
1. population - all individuals of a defined universe (= whatever we say it is!)
2. sample - subset of population; used to make inferences regarding the population
3. statistical error - difference between the real population value and the estimates (from
sample data) of the population value
4. randomness - all individuals have equal probability of being sampled
5. independence - value of one case does not affect the value of other cases

C. Scales of measurement and variable types

1. Categorical scale (Nominal)
a. values not quantitative or ranked; no mathematical or value relationship
b. mutually exclusive categories (e.g., male/female)
c. 1 variable type: categorical
2. Ranked scale (Ordinal)
a. relative differences (e.g., greater than/less than)
b. no mathematical relationship between values (e.g., small/medium/large; highly
active/active/not active)
c. 1 variable type: ranked
3. Ratio scale
a. mathematically defined distance between values; quantitative
b. absolute zero point (e.g., mass)
c. 2 variable types:
 Discrete - may assume only certain values within given range (e.g., 1, 2, 3, 4)
 Continuous - may assume any value within given range (e.g., 1.0, 2.34, 2.344)
d. may convert ratio data to ranked/categorical data (but not vice versa)
4. Interval scale
a. mathematically defined distance between values; quantitative
b. arbitrary zero point (e.g., Celsius temperature scale)
c. 2 variable types:
 Discrete - may assume only certain values within given range (e.g., 1, 2, 3, 4)
 Continuous - may assume any value within given range (e.g., 1.0, 2.0, 2.34,
2.344, etc.)
d. may convert interval data to ranked/categorical data (but not vice versa)

D. Identify variables and measurement scale (variable ID practice)

6
E. SYSTAT Demo
1. windows (output, data, graph); menus
2. data files
 columns (variables); numerical vs. string (categorical) variables (e.g., SEX vs. SEX$)
 rows (values of variables [cases, observations, sample size])
3. creating data files (entering and editing data)
 raw data file (stacked [=indexed] vs. unstacked data
4. opening existing data files (.SYZ files)
5. graphing frequency distributions (GraphHistogram)
6. creating frequency tables (AnalyzeOne-Way Frequency Tables;
(AnalyzeTables Two-Way)
7. calculating an average (AnalyzeBasic Statistics)
8. selecting cases (DataSelect Cases)
9. analyze by groups (DataBy Groups); groups = categories
10. transforming data (DataTransformLet and DataTransformIf..,Then Let

Introduction to SYSTAT
Prepare a SYSTAT data file using the data below. These data are measurements taken from
10 specimens of spiny guanotzits from Arkansas and Missouri. The variables are: collection
locality (categorical), length of body (continuous), sex (categorical), weight of body
(continuous), amount of pigment on the lower jaw (ranked), and number of scales on the chin
(discrete).
Case 1 2 3 4 5 6 7 8 9 10
Locality AR AR MO MO MO AR AR MO AR MO
Length (mm) 22.5 21.4 20.8 20.6 19.8 20.1 22.3 21.7 20.4 21.1
Sex m m f f f f m f m f
Weight (g) 333 298 401 257 21 30 478 400 35 288
Pigment 4 5 5 3 2 1 1 5 4 5
No. scales 23 22 14 26 9 21 17 12 15 12

Name your data file first.syz (the file extension .syz identifies a SYSTAT data file). After you
finish entering the data, proofread the file to make sure that the data are correct, edit if
necessary, save the file and close it. Reopen the file and use it to learn the following menus
and functions:
 File Menu (New, Open, Save, Save As, Print, Exit)
 Edit Menu (Undo, Cut, Copy, Paste, Copy Graph, Delete, Options)
 Data Menu (Variable properties, Transform [Let and If - Then Let], By Groups, Select Cases)
 Graph Menu (Histogram)
 Analyze Menu (One-Way Frequency Tables, Basic Statistics, Tables)
6
Exercises
1. calculate the average guanotzit weight (254.1g) 5 0.5
Proportion per Bar

2. calculate the average guanotzit weight separately for 4 0.4

Count

males and females (m=286.0g; f=232.8g) 3 0.3

3. calculate the average weight for guanotzits from 2 0.2

Arkansas (234.8g) 1 0.1

4. draw a histogram of guanotzit lengths
0 0.0
5. transform weight to the common logarithm of weight 19 20 21
LEN
22 23
7

(case 1: 333.0 to 2.522)

6. create the new variable USE$ and let its value (“yes” or “no”) be
determined by a combination of values of the variables SEX$
and LOC$. Example: If SEX$=”m” and LOC$=”AR”… Then LOC$ by NOSCALES
Let USE$=”yes.” Notice that the variable USE$ is a derived 9 12 14 15 17 21 22 23 26 Total
variable, not a measured variable AR 0 0 0 1 1 1 1 1 0 5
7. how many quanotzits from Missouri were measured? (n=5) M 12 1 0 0 0 0 0 1 5
8. determine the number of guanotzits by scale number and state O

********************************************

Description of Data (from a frequency distribution)

A. Descriptive statistics
1. measures of central tendency
a. mode - most frequent class (of frequency distribution)
b. median (ordinal or ratio/interval data) - middle class
c. mean (ratio/interval data) = “average”; x/n
d. weighted mean (ratio/interval data) - fx/n; used when cases have different
levels of importance (weights); e.g., grade point average

2. measures of dispersion - describe the amount that each

observation is likely to vary from the mean/median
a. maximum, minimum (range): sensitive to
extreme values
b. interquartile range: (quartiles, middle 50%
of observations (Q3 – Q1; difference
between 25th and 75th percentiles)
c. sum of squares(SS): (x -x)2
d. variance: SS/n
e. standard deviation: √variance

3. symbols for statistics (sample) and parameters (population)

Parameter Statistic
Mean  = x/n x = x/n (=“x-bar”)
Variance 2 = (x-) /n
2
s2 = (x-x)2/n-1
Standard Deviation  = √2 s = √(s2) (=“SD”)

4. coefficient of variation (CV)

-expresses SD as a percent of the mean a. CV = (SD/x) 100
-used to compare relative variation in one variable between groups with different means
Example:
mean SD CV Note that group 2 is relatively
Group 1 14.2 2.5 17.6 more variable despite a greater
Group 2 7.2 1.8 25.0 SD in group 1.
8
B. Calculating Descriptive Statistics (mean ± SD)

1. calculate descriptive statistics from raw data file; AnalyzeBasic Statistics

-Use CAVESALYS.SYZ (Sanders)

QUESTION: What are the descriptive statistics of snout-vent-length for female salamanders
collected in Arkansas?

2. calculate descriptive statistics from frequency distribution

 Step 1: DataCase WeightingBy Frequency
 Step 2: AnalyzeBasic Statistics

Value of No. times

Variable observed
0 7
1 24
2 93
3 99
4 24
Total 247

C. How to report sample means (must include a measure of error)

a. Text (example)
b. Tables (example)
c. Graphs (error bars; example)

-how reduce variance? ((x -x)/n)

-what limits n? (availability, money, time)

You now have sufficient knowledge to begin the Graph Construction Exercise on p15.
9

Probability Distributions (expected probabilities associated with all possible outcomes)

 cannot know if experimental result is due to chance alone unless we know

what the expected is (hypothesis testing - basis for much of much of this
course!)
 basic question: How well does an observed frequency distribution fit an expected
frequency distribution? (goodness of fit - GOF)

Discrete probability distribution – Binomial (mutually exclusive categories;

either/or); e.g., male/female, red/white, red/not red

Probability Basics
Example: 1 coin toss- possibilities: 1H, 1T
a. probabilities: no. ways an event (H or T) can
occur /total no events (2) possible; “division”
rule; 1H [1/2] = 0.5; 1T [1/2] = 0.5
b. add all possibilities = 1 [0.5 + 0.5 = 1]
c. probability distribution shape

Example: 2 coin toss- possibilities: 2H, 1H1T, 1T1H, 2T

(mutually exclusive, independent events)
a. probabilities:
1) simultaneous events (“and” rule, multiply): 2H [0.5 x
0.5] = 0.25; 2T [0.5 x 0.5] = 0.25
2) alternative events (“or” rule, add): 2HT [0.5 x 0.5] +
[0.5 x 0.5] = 0.5
b. add all probabilities [0.25 + 0.5 + 0.25 = 1]
c. probability distribution shape

Binomial Distribution
1. formula: P(x) = (n!/(x!(n-x)!))pxq(n-x)
-no need to memorize the formula but you must be able to recognize the formula and
each of its terms
2. terms
P = probability of the number of
occurrences of the event of interest
p = probability of event of interest =
head (”success”)
q = probability of other event (1-p) =
not head (”failure”)
n = number of “simultaneous” events (trials)
x = number of occurrences of the event of interest

3. binomial shape determined by values of n and p

10
EXAMPLE: A reproductive physiologist counted the number of males in 247 litters of 4
siblings each in a species of Dimetrodon (Table). Do these data support the hypothesis of sex
being determined by a XX, XY system as occurs in mammals?

No. males Observed Expected

Observed frequency frequency
0 7
1 24
2 93
3 99
4 24
Total 247 247

Based on the theory of sex determination in mammals (equal chance of being male or female),
calculate the expected frequencies for the number of males in these litters.

P(x) = (n!/(x!(n-x)!))pxq(n-x) Expected No. Expected

proportion litters number(frequency)
prop (0 males) = (4!/(0!(4-0)!)) x 0.50 x 0.5(4-0) = 0.0625 x 247 = 15.438
prop (1 male) = (4!/(1!(4-1)!)) x 0.51 x 0.5(4-1) = 0.2500 x 247 = 61.750
prop (2 males) = (4!/(2!(4-2)!)) x 0.52 x 0.5(4-2) = 0.3750 x 247 = 92.625
prop (3 males) = (4!/(3!(4-3)!)) x 0.53 x 0.5(4-3) = 0.2500 x 247 = 61.750
prop (4 males) = (4!/(4!(4-4)!)) x 0.54 x 0.5(4-4) = 0.0625 x 247 = 15.438
Total 1.00 247

SYSTAT calculation of expected frequencies (UtilitiesProbability CalculatorUnivariate

Discrete)

Question: Is the sex of Dimetredon determined by a mechanism similar to that of mammals?

Expect 1:1. Compare observed with expected.

No. males Observed Expected Conclusion: because of

Observed frequency frequency the large deviations
0 7 15.438 between the expected
1 24 61.750 and observed numbers,
2 93 92.625 we reject the idea of
3 99 61.750 there being equal
4 24 15.438 chances of having equal
Total 247 247 sexes.

So, what determines sex in Dimetredon?

Importance of sample size for observed data (1 coin example, compare to theoretical)
 IF observed = norm coin, THEN the larger the n, the closer we approximate expected
conversely, THEN the smaller the n, the more we deviate from expected
11

Exercise: Binomial Distribution

Assuming that the sex of hatchling turtles is determined by a particular combination of
chromosomes as in mammals (i.e., an XX, XY system), fill in the expected frequencies
below:

Data are number of male hatchlings emerging from 84 nests of kaw turtles (kaw turtles
always lay 6 eggs per nest).

No. Males Observed Expected Compare the observed and

Observed No. Nests No. Nests expected frequencies. Do these
0 4 data support the hypothesis that
1 7 sex of hatchlings is genetically
2 15 determined? (yes or no)
3 24 Support your conclusion.
4 22
5 7
6 5 ans: exp- 1.310, 7.875, 19.688, 26.250, 19.688,
7.875, 1.310
Total 84 84

Discrete probability distribution - Poisson (expected distribution for rare and random events)
1. Poisson: = 2 (2/= 1) - distribution defined by mean only; low value (rare
events; e.g., recapture rates, bacterial viruses infecting bacteria)
2. Poisson formula: P(x) = (x xe-x)/x!
-Students: no need to memorize the formula
but you must be able to recognize the formula
and each of its terms
3. terms
-P = probability of the number of
occurrences of the event of interest
- x = mean occurrence of event of interest
- e = mathematical constant (=2.71828)
- x = number of occurrences of
the event of interest
6. Poisson shape determined byx
Example: An ecologist counted the number of maple seedlings in 100 quadrats

No. Obs. No. Exp. No.

Plants Quadrats Quadrats
0 35
1 28
2 15
3 10
4 7
5 5
Total 100 100
12
Using the mean calculated from the observed frequency distribution of maple seedlings per quadrat
in the table (x = 1.41), calculate the expected frequencies assuming that occurring in a quadrat is a
random event.

Expected Expected
proportion number (frequency)
prop (0 seedlings) = (1.410e-1.41)/0! = 0.244 x 100 = 24.41
prop (1 seedling) = (1.411e-1.41)/1! = 0.344 x 100 = 34.42
-etc.

SYSTAT calculation of expected frequencies (UtilitiesProbability CalculatorUnivariate

Discrete)

Question: Do seedlings occur randomly in quadrats?

No. Obs. No. Exp. No. Conclusions:

Plants quadrats Quadrats 1. Is it rare? (mean=1.41)
0 35 24.41
1 28 34.42 2. Is it random?
2 15 24.27 a. compare obs and exp
3 10 11.41 distributions
4 7 4.02 b. calculate variance/mean
5 5 1.11 ratio (2.18/1.41=1.55)
Total 100 100

Exercise: Poisson Distribution

Assuming that being killed by a horse is a rare and random event, fill in the expected frequencies
below.

Men killed by being kicked by a horse in the Prussian Army Corps.

No. killed/ Observed Expected ans: exp- 108.67, 66.29, 20.22, 4.11, 0.63
yr/corps Number Number
0 109
1 65 x = (ans: 0.610)

2 22
3 3 s2 = (ans: 0.611)

4 1
Total 200 200 s2/x = (ans: 1.002)

Compare the observed and expected frequencies.

Do these data support the hypothesis that the chance of being killed by a horse in the
Prussian Army Corps is a rare and random event? Support your conclusion.
13

Exercise: Testing Your Concept of Randomness

1. draw 100 dots on the 10x10 grid on the next page (keep your eyes open, try to place dots
randomly
2. count the number of cells with different numbers of dots
3. create a frequency table of your data
4. calculate the mean and variance of the number of dots per cell

mean = variance =

5. calculate the variance/mean ratio =

6. interpret: ratio = 1 (random); ratio <1 (evenly spaced); ratio >1 (clumped)

7. Application: patterns of distribution in space reflect biological processes; for example, disease
spread and behavioral/ecological interactions
14
15

Graph Construction
Home

In this exercise, you will learn to construct five basic graphs used by biologists. The rules for
graph construction presented here will apply to all graphs you construct during the semester. As
you finish graphs, copy and paste each image to a Word file named graphexercise, add the caption,
and save. There are three parts to the exercise:
1. You will reproduce 5 finished graphs given to you;
2. You will be given data and asked to construct 5 appropriate graphs;
3. You will find an example of each of the 5 graph types in the primary literature.

A. Basic graph types

1. Histogram (GraphHistogram) - plots the frequency (counts/proportions/percentages) of
occurrence as a bar on the Y-axis against a variable on the X-axis
2. Bar (GraphBar) - plots the mean and error bars of a variable as a bar on the Y-axis against
a categorical variable on the X-axis
3. Dot (GraphSummary ChartsDot) - plots the mean and error bars of a variable as a
symbol on the Y-axis against a categorical variable on the X-axis
4. Box Plot (GraphBox Plot) – plots the median and quartiles of a variable on the Y-axis
against a categorical variable on the X-axis
5. Scatterplot (GraphScatterplot) – plots cases of one variable on the Y-axis against cases of
another variable on the X-axis

Requirements of all graphs

 The Y variable is always read before the X variable. For example, “plot Y against X”, “plot Y by X”,
and “Y is regressed against X”. For this class, X is never plotted against Y.
 Essential graph elements: axes (Y, X), axis labels (with units of measurement, if applicable), ticks, tick
labels, caption
 Elements essential for specific graph types: bars, symbols, error bars, data points, line, linear smoother
 Each graph must be self-explanatory and be able to stand alone (figure captions are considered part of
the graph). Captions should be descriptive, not interpretative.
 Non-standard abbreviations must be defined.
 Graphs displaying means (Bar, Dot) must portray the mean, error bars, and sample size for each mean.

B. Graph reproduction - Reproduce each graph (1-5) illustrated below. Read the description of
each data file before beginning. Copy and paste your SYSTAT output into a Word file named
graphexercise, add captions, and save.

1. HISTOGRAM - A SYSTAT Histogram plots the frequency (counts/proportions/percentages)

of a single variable. Duplicate the Histogram below. Note axis titles, axis ranges, data
plotted, bar fill, etc. The data are in RANDOM.SYZ (Plummer).
16
100

90
0.10
80

Number of Captures

Proportion per Bar

70 0.08
60

50 0.06

40
0.04
30

20
0.02
10

0 0.00
1300 1400 1500 1600 1700 1800 1900 2000
Location (m)
Fig. 1. The distribution of captures of green snakes according to location.

2. BAR - A SYSTAT Bar graph plots the mean of one variable against another variable.
Duplicate the BAR graph below. Note bar fill, axis titles, error bars, data plotted, etc. The
data are in MOUSEDIET.SYZ (Cooper).

300

250
BODY MASS (g)

200

150

100
5K-96 AIN-cas AIN-spi P5001
DIET

Fig. 2. The relationship of mean body mass and diet in laboratory mice fed
different diets. Plotted are mean  1 SD. Sample sizes are: 5K-96, n=34; AIN-
cas, n=35; AIN-spi, n=32; P5001, n=42.
17

3. DOT - A SYSTAT Dot graph plots the mean of one variable against a discrete or categorical
variable. Duplicate the Dot graph below. Note symbols, error bars, fill, axis titles, axis
ranges, data plotted, etc. The data are in WORMSURVIVE.SYZ (JMGoy).

NO. UNIMPAIRED MOVEMENT

-10
0 1 2 3 4 5 6 7
TRIAL DAY
Fig. 3. Mean number of C. elegans exhibiting unimpaired movement according to trial
day. Plotted are mean  1 SD. Sample sizes are day 1, n=48; day 2, n=51; day 3,
n=49; day 4, n=15; day 5, n=7; day 6, n=2.

4. BOX – A SYSTAT Box Plot plots the quartiles of one variable against a discrete or
categorical variable. Duplicate the Box Plot below. Note symbols, axis titles, axis ranges,
selected data plotted, etc. The data are in CAVESALYS.SYZ (Sanders).

Fig. 4. Box plot of the body lengths of female Eurycea lucifuga captured in Arkansas
and Kentucky caves in February and March. Plotted are the median (horizontal line),
the 25th and 75th quartiles (box) and the maximum and minimum values (whiskers).
18
5. SCATTERPLOT - A SYSTAT Scatterplot plots individual cases of one variable against
another variable. Duplicate the scatterplot below. Note symbols, axis titles, axis ranges,
selected data plotted, etc. The data are in LONOKE.SYZ (Plummer).

800

600
BODY WEIGHT (g)

400

200

0
50 60 70 80 90 100
SNOUT-VENT LENGTH (cm)
Fig. 5. The relationship of body weight and snout-vent length in 99 adult (=individuals
>50 cm SVL) male diamondback water snakes.

C. Graph construction: Construct an appropriate graph for each of the following problems and
save in your graphexercise file.

6. Use the following data on bill lengths (mm) of 42 belted kingfishers to construct a graph (Fig.
6) that plots the median and other quartiles separately for males, females, and the sexes
combined (3 groups).

males: 48.1, 47.7, 48.0, 50.6, 50.8, 49.9, 49.3, 50.8, 46.9, 49.9, 48.8,
47.5, 48.2, 51.0, 48.8, 52.0, 51.8, 51.0, 50.1, 47.7, 49.9
females: 53.8, 59.2, 52.3, 59.3, 56.5, 56.2, 55.6, 57.7, 52.5, 47.8,
51.5, 55.8, 57.5, 56.8, 47.0, 50.4, 58.0, 61.2, 56.5, 59.3, 59.2

For graphs 7-10, use the data file LONOKE.SYZ (Plummer).

7. Construct a graph (Fig. 7) that plots cases of weight against length for snakes collected in
ponds #53 and #54. Indicate sample size.
8. Construct a graph (Fig. 8) that illustrates the mean body weight for each sex. Restrict cases to
snakes ≥30 and ≤90 cm SVL. You can more easily make the X-axis readable by creating a
derived variable with this transform: IF sex=1 THEN LET sex$=”male”
9. Construct a graph (Fig. 9) that illustrates the frequency of female snakes captured in minnow
ponds by snout-vent length. Indicate sample size.
10. Transform variable WGT with common logarithms. Construct a graph (Fig. 10) that plots
cases of the transformed variable against SVL. Indicate sample size.
19

D. Literature Graphs: The third part of this exercise consists of finding an example of each of the
five graph types in primary literature papers.

What is the Primary Literature?—Journals (evidence-based science; ID)

1. Original research written by the researcher
2. Peer reviewed
3. Publishing process
4. Some useful working categories
a. First tier—Science, Nature
 Broad subject content
 Publish only the best of the best
 Papers usually report a major advance in the
field
b. Second tier—Proceedings of the National Academy
of Sciences, Ecology, Cell
 Content frequently has restricted subject areas
 Publish most of the top papers in that subject
area
 Reject many technically sound papers if they do not advance our knowledge
sufficiently
c. Third tier—Journal of Herpetology, American Midland Naturalist, Journal of Immunology
 Content limited in subject area and/or geographical coverage
 Publish the bulk of papers in the subject area
 Most technically sound papers are accepted even if they do not dramatically advance
our knowledge

Structure of a Primary Literature Paper

1. Abstract
-provides an overview of the paper
2. Introduction
-provides a theoretical framework for the study
-provides an overview of what is already known
-clearly states the question and why it is important
3. Materials and Methods
-provides details of the experimental design
-provides details about how the data were collected and analyzed (including statistical
analysis)
4. Results
-provides a textual description of the results of analyses
-provides tables and/or graphs showing quantitative and statistical results of analyses
5. Discussion
-compares the results to what was previously reported in the primary literature
-points out how the results either strengthen or weaken current theoretical models
-if appropriate, makes suggestions on how theoretical models should be modified
-highlights questions in need of further research
20
6. Literature Cited
-contains the full citation for every paper cited in the text. Does not contain citations that
are not cited in the text

As you locate an example of each graph in the literature, download a digital copy, insert into
graphexercise, and save in order - Fig. 11 Histogram, Fig. 12 Bar, Fig. 13 Dot, Fig. 14 Box,
and Fig. 15 Scatterplot. Make sure to include the caption. Under each graph caption, type the
citation of the paper where you found the graph. Proper citation format is: last name, initials,
initials, last name, and initials, last name. year. title. journal volume:pages. Here’s an
example;

Harless, M.L., A.D. Walde, and D.K. Delaney. 2010. Sampling considerations for improving
home range estimates of desert tortoises: effects of estimator, sampling regime, and sex.
Herpetological Conservation and Biology 5:374-387.

Note: Histogram, Bar, Dot, Box, and Scatterplot are names given to particular graphs by SYSTAT.
You may find different names in other statistical software and in the literature; for example, a
histogram may be called a frequency distribution or a bar graph. Don’t let that confuse you! You
should be skilled enough to quickly determine the type of graph just by looking and applying your
knowledge. For example, ask yourself what statistic is plotted on the graph; is it frequencies, means,
medians, or individual cases?

Turn in a printed copy of graphexercise on the due date. Print two graphs per page. Do
not separate the graphs from their respective captions.

How to Search Primary Literature (Google Scholar; Library)

Inferential Statistics
Home

The Normal distribution

-very important frequency distribution for 2 reasons:

A. Data that are influenced by many small and unrelated random effects are approximately
normally distributed (math: Fuzzy Central Limit Theorem); extremely widespread and
common in nature
B. Forms the conceptual basis of a large number of statistical procedures - one of the most
important theoretical distributions in statistics
C. Properties
1. formula: 1/(2)exp(-(x-)2/22)
2. students – no need to memorize the formula but you must
be able to recognize it
3. shape determined by mean and SD
4. symetrical around the mean (mean=mode=median)

5. x1SD = approx. 68% of cases; 2SD = approx 95%

D. Standard normal distribution

1. many different “normal” distributions
2. standardize any normal distribution (directly compare)
3. express individual cases in terms of SND; z = (x -x)/s;
“z-score”
4. z-score = distance from mean in standard deviation units;
e.g., z = 1 (=1SD greater than the mean)
5. Areas of normal curve (Tables)

E. Testing observed data for normality; SYSTAT output (TREAT.SYZ, EGGWGT)

1. qualitative: Probability plot (GraphDistribution PlotsProbability Plot): DEMO
2. quantitative: Kolmogorov-Smirnov Test: DEMO
3. SYSTAT path: AnalyzeNonparametric TestsOne-sample KS (Enter selected variable
and Lilliefors distribution)
 hypothesis: frequency distribution of EGGWGT is normally distributed
 test statistic, probability
22
 if probability <0.05, reject the hypothesis; conclusion: EGGWGT distribution is not
normally distributed (=”skewed”)
 if probability >0.05, cannot reject the hypothesis; conclusion: EGGWGT distribution
is normally distributed
____________________________

Exercise: practice SYSTAT Probability Plot and One-sample KS Test using the variable
H2OOUT from file DLWMEANS.SYZ. Note that H2OOUT is not normally distributed
(skewed)

-Data transformation has the potential to normalize non-normal data)

1. Data transformations - many procedures in statistics assume that data are normally
distributed. If data are not normally distributed, one can transform the data to another
measurement scale in an effort to normalize them. Deciding which transformation to use is
entirely practical, i.e., the “right” transformation is whatever makes the data normally
distributed. Trial-and-error applications of various transformations may be necessary to
determine which will work. However, some transformations work better in some situations
than in others. Examples of transformations commonly used in biology are the logarithmic,
arcsine, and square-root transformations.

 the logarithmic transformation is useful in a wide variety of situations and is by far the
most commonly used transformation in biology
 the arcsine (inverse sine) transformation is used specifically when data are in the form of
proportions or percentages
 the square-root transformation is used specifically when data are in the form of counts

2. Transform the variable H2OOUT with common logarithms and retest for normality with both
Probability Plot and KS. Note that the SYSTAT designation for common logs is L10
(always use common logs in Biol. 254). After transformation, the new variable
L10H2OOUT should now be normal

Always create a NEW variable name for the transformed variable!

Statistical inference - draw conclusions regarding populations based on analysis of samples

from those populations

Population Sample
(numerical properties= parameters) “Error” (numerical properties = statistics)

The population is what we want to The sample is what we use to understand the
understand. population.
 Descriptive stats: describe data in the sample
 Inferential stats: infer from sample to population

1. Two major categories of statistical inference

a. Estimate parameters (e.g., , σ)
b. Test hypotheses (infer population from sample)
2. The foundation for both concepts is the Sampling Distributioin
a. take repeated samples from population
b. examine distribution of sample means
3. Two major predictions of the Central Limit Theorem regarding sampling distributions
a. Means of samples from a normally distributed population will be normally
distributed
 mean of means = x/n
 SD of means (=standard error of mean, SE or SEM); SE = SD/√n
b. Means of samples from a non-normally distributed population will be normally
distributed if n is sufficiently large (required n is proportional to amount of
variation)

Simulation: Rice University Virtual Stats Lab

Estimation of parameters
1. How well does the sample mean (x) estimate the population mean (µ)?
a. in a normally distributed population, 95% of the cases lie betweenx - 1.96 SD andx
+ 1.96 SD
b. in a normal sampling distribution, 95% of the means lie betweenx - 1.96 SE and x
+ 1.96 SE
c. interpretation: 95% chance that population mean is enclosed within these limits (95%
confidence limits)

Absolute certainty is a privilege of uneducated minds-and fanatics.

It is, for scientific folk, an unattainable ideal. - C. J. Keyser

d. problem: sampling distributions of means may depart from normality if sample size is
small (central limit theorem)
e. solution: use distribution that adjusts for sample size - Student’s t-distribution (shape
determined by 3 characteristics):
24

 mean, SD, df
 areas of curve that
exclude a given
proportion of the
distribution vary
with n (Tables)
 at infinity df, t0.05 =
1.96 as in normal
distribution

f. to calculate 95% CLs using a t-distribution, replace 1.96 with value from t-table
 UL: mean + (t[0.05, n-1]) x SE
 LL: mean - (t[0.05, n-1]) x SE

g. examples: calculate 95% CLs for these sample means:

 x = 4.7, SD = 0.27, N = 25 95% CI = 4.58 – 4.81 (higher n; narrower CLs)
 x = 4.7, SD = 0.27, N = 7 95% CI = 4.45 – 4.95 (lower n; broader CLs)

2. 95% CL in the public media: GPS accuracy, political polls, church surveys

3. How to report sample means

 x ± SD - provides idea of how much variation there is in the
data but does not provide information on how well statisticx
estimates parameter µ
  x (95% CLs) provides information on how wellx estimates
 and if two means are significantly different from each other
 x ± 1SE (most common way of reporting means in text,
tables, and graphs)
25

Differences in means: graphic methods for ‘informed guessing’ whether means are
statistically different

Fig. 1. (A) Feces production (x 1000) in

juvenile and adult green snakes by month.
(B) Feces production (x1000) in adult male
Caution! The most common and female green snakes by month.
way of reporting descriptive Plotted are means ± 2 SE.
statistics in the literature is
mean ± 1 SE; Proper inter-
pretation requires that you VS.
visually double the value
of the SE to get 95% CLs.

Fig. 1. (A) Feces production (x 1000) in

juvenile and adult green snakes by month.
(B) Feces production (x1000) in adult male
and female green snakes by month.
Plotted are means ± 1 SE.

To properly interpret graphs displaying descriptive statistics, you must know what the
error bars represent! (info found in the figure caption or in the M&M)
___________________________________

II. Hypothesis testing

A. Scientific hypothesis testing (sci_method)
1. Scientific method (ID; science begins when we try to explain observations (hypothesis)
2. Primary attributes of a good hypothesis
a. if it is correct, then it will explain what has been observed (consistent with
observations)
b. if it is false, it can be shown to be false (falsifiable)
3. Cannot prove a true hypothesis; science advances by disproving false hypotheses
4. Process of hypothesis testing
a. if-then logic (IF the hypothesis is true, THEN this should be the result); MP?
26
b. if testing results in something other than expected outcome, we reject hypothesis and
look for a better explanation
B. Statistical hypothesis testing - similar procedure
1. State hypothesis such that there are only 2 possible outcomes, e.g.,
a. HA: A  B (cannot test directly) = research
[alternative] hypothesis
b. H0: A = B (if false; assume HA by default) = null
hypothesis

2. Example 1: compare case with known population

H0: case is from population
HA: case is not from population
What is the probability that the null hypothesis is true?
-if low, the research hypothesis is more likely true

Population Sample
3. Example 2: compare sample mean with known µ = 568 x = 598
population
 SYSTAT: Analyze->Hypothesis Testing->Mean-
>one-sample t-test

a. SYSTAT (onesamplet.syz):x = 598; SD = 70.3;

n = 30
b. assume population mean is known [µ= 568]

c. H0:x = µ; HA:x  µ
d. calculate (SYSTAT); one-sample t-test; test statistic, tcalc = 2.31
e. determine probability by comparing tcalc to ttab (tabled value; df=29; Tables); P =
between 0.02 and 0.05)
f. at P=0.05 (alpha level); ttab = 2.045 (critical value)
g. tcalc (2.31) is greater than ttab (2.045), therefore P<0.05
h. two explanations for obtaining a high t value (2.31)
 null hypothesis is true; sample mean differed by chance alone (unlikely)
 null hypothesis is false (more likely)
i. 1-sample t-test: rarely done in science… Why?

4. Example 3: compare two sample means

(populations unknown - common question in many
areas of biology)
 Hypothesis Testing 1 (next lecture section)
27

C. Writing null hypotheses for parametric difference tests and their nonparametric counterparts
(does not include tests of frequencies or tests of relationships): required components

1. indicator (H0)
2. parameter (e.g., µ, 2)
3. variable (e.g., length, mass)
4. group (e.g., sex, color); for questions of differences between independent data only
(no grouping variable for dependent data)
5. relational operator (e.g., =, ≥, ≤)

-groups are designated by being enclosed in parentheses

-examples: independent: H0: µlength(males) = µlength(females)
dependent: H0: µbeforelength = µafterlength

D. Two-tailed vs. one-tailed hypotheses

1. two-tailed research hypothesis: HA: A ≠ μB (non-directional)
-null hypothesis (opposite of HA:): H0: A = μB

2. one-tailed research hypothesis: HA: A < μB (directional)

-null hypothesis (opposite of HA:): H0: A ≥ μB

3. one-tail: use only 1/2 of distribution (divide probability by 2)

4. how know if one-tail or two-tail? read question carefully

III. Statistical decision-making

1. researchers set alpha level before statistical test is performed (usually 0.05)
2. onesample.syz example: what would happen if you changed alpha to 0.01 after the test was
done? (Tables; tcalc (2.31) < ttab (2.756; P>0.01)
3. possible to reject or not reject null hypothesis with the same set of data! Which one is
“true?” (two types of errors)
TRUTH TABLE The real world; H0 is actually:
TRUE FALSE
Your analysis; you say true Correct Type II error
that H0 is: false Type I error Correct

 type I error (rejecting a true null hypothesis); fixed value set by scientific community
(P=0.05); make mistake 1 out of 20 times
 type II error (failure to reject a false null hypothesis); can be minimized by:
1. increasing sample size
2. choosing the most powerful test (power = probability of rejecting a false null
hypothesis); minimum power of 80% generally necessary for an acceptable
biological conclusion when you cannot reject the null hypothesis
 Why not reduce probability of type I error? – increases probability of type II error
 Alpha set at 0.05 because it represents a compromise between making type I and type
II errors
 SYSTAT - how to calculate power or to determine minimum sample size needed for
a specific power level (Utilities->Power Analysis->specific test)
28

4. Medical application of Truth Table – Diagnostic Testing Outcomes

Disease present Disease absent
(pregnant) (not pregnant)
Test positive (cannot True positive False positive
reject H0; pregnant) (Type II error)
Test negative False negative True negative
(reject H0; not pregnant) (Type I error)

I II

Reporting significance levels (definition of “significant” = H0 has been rejected)

a. conventional method (non-exact probability from statistical table)

 nonsignificant = P>0.05 = ns Statistical decisions are always
 significant = P0.05 = * made at the P0.05 level.
 highly significant = P0.01 = **
 very highly significant = P0.001 = ***

Absolute certainty is a privilege of uneducated minds-and fanatics.

It is, for scientific folk, an unattainable ideal. - C. J. Keyser

b. modern method (exact probability from computer calculation)

c. both methods are correct, so students may use either method in Biol. 254
d. “Statistically significant” is one of those phrases scientists would love to have a
chance to take back and rename. “Significant” suggests importance; but the test of
statistical significance, developed by the British statistician R.A. Fisher, doesn’t
measure the importance or size of an effect; only whether we are able to
distinguish it, using our keenest statistical tools, from zero. “Statistically noticeable”
or statistically discernable” would be much better.” -Mathematician Jordan Ellenberg
e. if you are talking science, avoid using the non-qualified term “significant” in a non-
statistical context

5. Why is it incorrect to “accept” a null hypothesis?

a. it implies that the null hypothesis has been proven true (NO!); the null hypothesis is
only assumed true
b. legal analogy: defendant is assumed innocent until proven guilty (jury decisions:
“guilty” or “not guilty”)
29

c. modern experimental design was developed by Ronald Fisher (1930s). “…it should
be noted that the null hypothesis is never proved or established, but is possibly
disproved in the course of experimentation.”

IV. Statistical Software (usually found toward the end of M&M in primary literature
papers)
 SAS (no. 1 statistical software for scientists); high learning curve
 SYSTAT
 Minitab
 SPSS
 many others (http://en.wikipedia.org/wiki/Comparison_of_statistical_packages)
 Excel is not recommended for inferential statistical analysis.
30

STATISTICAL TESTS Parametric Nonparametric Assumptions of parametric

Home -more power, more -less power, fewer tests
assumptions assumptions  Data are randomly
sampled and independent
Goodness-of-fit (except dependent
(GOF) designed tests( = repeated
Frequencies -----  Chi-square measures)
 Kolmogorov-  Data are measured on
Smirnov (KS) ratio or interval scale
 Data (or residuals in
Variances Bartlett’s Levene’s ANOVA and regression)
are normally distributed
t- tests
for each group
 Independent Mann-Whitney  For questions regarding
Differences samples t means, the variances
2 Means
 Paired samples t Wilcoxon (assumes among groups (or
(assumes data are data are dependent) residuals in ANOVA and
dependent) regression) are
Analysis of Variance Kruskal-Wallis homogeneous
 One-way ANOVA Post-hoc pairwise
comparisons (Dwass-
 Post-hoc pairwise Assumptions of non-
>2 Means Steel-Critchlow- parametric tests
comparisons
(Tukey)
Fligner; DSCF)  Data are randomly
sampled and independent
 Two-way ANOVA (except dependent
designed tests)
Test of Independence
(contingency table ______________________
Frequencies ----- analysis)
 Chi-square
Relationships  Fisher Exact Test
Tests covered on Exam II
Pearson correlation Spearman correlation
Variables/
Cases
Linear Regression -----

How does one know which test is appropriate?

 Read question carefully; make sure you understand what the question is asking
 Look for key words in the question: difference, differ, same as, more/less than, relationship, association,
correlation, linked
 A “v” word, (vary, variance, variation) will be present in the question for differences in variances
 If a “v” word does not appear in a difference question and question does not concern frequencies, assume
question concerns means
 “Affect” and “effect” can be used in both difference and relationship questions. You must understand their
use in context; for example, it likely is a difference question if there is a grouping variable present.
31

Protocol for hypothesis testing - fill in each blank; write "NA” for questions that are not applicable.
Home

A. Justify test used [2].

1. What are the variables? [.2] ____________________________________________________

2. What is the respective measurement scale of each variable? [.2] ______________________

3. Is the question about differences or relationships? [.2] ______________________________

a. If a difference question, does it concern means, variances, or frequencies? [.2] _______

b. If a relationship question, does it concern variables or frequencies? [.2] _____________

4. To determine if a parametric test can be used, ask these questions:

a. Means: If you think the appropriate test is a parametric test of differences in means

-are the data independent or dependent? [.2] ______________________________________

-is each group/variable normally distributed? [.2] Y/N_; probs ____

-are the variances homogeneous? [.2] Y/N_; prob. _____

b. Variances: If you think the appropriate test is a parametric test of differences in variances,

-is each group normally distributed? [.2] Y/N_; probs.____

c. Variables: If you think the appropriate test is a parametric test of relationships between variables,

-are the residuals or each variable normally distributed? [.2] Y/N_______; probs__________

B. State research hypothesis(es) [0]. HA: _________________________________________

C. State null hypothesis(es) [2]. H0: _____________________________________________

(variables must match answers in A1)

D. What is the most appropriate test? [1] _________________________________________

(an incorrect answer limits further points)

E. Execute test(s) and identify and state value of each test statistic [2]. _________________
(an incorrect answer limits further points)

F. State probability of each test statistic [1]. __________________________________________

G. State reject or cannot reject for each null hypothesis [1]. _________________________

H. Concisely state a biological conclusion for each test [1].

Hypothesis Testing 1
Home

Frequencies: Goodness-of-Fit StatTests

1. Test whether an observed frequency distribution fits an expected frequency distribution
2. One variable, mutually exclusive categories, each frequency occurs in one category, no cell has an
expected frequency <5 (must pool categories if violated), no proportions or percentages
3. Null hypothesis: H0: Ovar = Evar
4. Test statistic (χ2) and probability source: Calculator/Statistical Table

-calculation: χ2 = ((O-E)2/E); reading a chi-square table (Tables)

5. Probability models used for determining expected frequencies
 The equal probability model occurs if all categories are equally likely. The expected number of
outcomes for each category is n / no. categories.
 The unequal probability model occurs if there are several categories with unequal probabilities.
The expected number of outcomes for each category is np1, np2, ..etc.
 The binomial distribution model occurs if there are two possible outcomes for any item, with a
constant probability of success with repeated independent encounters of subjects. To calculate the
expected number of outcomes in n experiments, multiply the binomial probabilities by n.
 The Poisson distribution model is used as a probability model for events that occur randomly. To
calculate the expected number of outcomes in n experiments, multiply the Poisson probabilities by
n.
8. df: extrinsic hypothesis (theoretical): df = no. categories – 1
df: intrinsic hypothesis (empirical; e.g., estimating the mean from the data): df = no.
categories – 2
9. Examples:
 Question 1: Is the sex ratio of Wood Ducks skewed? (equal probability model; extrinsic)
 Question 2: Do Rough Green Snakes prefer a particular kind of tree when sleeping? (unequal
probability model; intrinsic); pic)
 Question 3: Do the sample data fit a binomial distribution? (Binomial model; extrinsic; PP#36)
 Question 4: Are seedlings randomly distributed among quadrats? (Poisson model; intrinsic; PP
#60)

Example problems
1. Two purple-flowered pea plants, both heterozygous for flower color, were crossed, resulting in 78
purple-flowered offspring and 22 white-flowered offspring. Question: Does this outcome differ from
the expected 3:1 ratio of purple-flowered to white-flowered offspring? (Protocol link)

2. The data below are number of juvenile manatees killed by boats in Florida. Question: Are males and
females equally susceptible to being killed by boats? (Protocol link)
no. males killed (1985-1995): 206
no. females killed (1985-1995): 127
33

Frequencies: Test of Independence (=test of association) StatTests

1. Test whether the frequencies of two categorical variables are independent (unrelated)
2. Two categorical variables, each frequency occurs in multiple mutually exclusive categories, no
proportions or percentages, no cell has an expected frequency of <5 (Systat will inform you of
violations)
3. Null hypothesis: H0: row var independent of column var
4. Test statistic (X2) and probability source: Systat/Systat
5. SYSTAT path: AnalyzeTablesTwo-Way (enter row and column variables)

6. Question: Is habitat dependent on (related to) sex?

SYSTAT output: (GINMOVE.SYZ; Plummer)

Frequencies
HAB$ (rows) by SEX$ (columns)

F M Total
+----------------+
P | 480 420 | 900
R| 2 25 | 27
+----------------+
Total 482 445 927

Test statistic Value DF Prob

Pearson Chi-square 22.1511 1.0000 0.000

6. Frequency table data - start with table (no raw data)

a. example 1 – Question: Is there an association between the hemoglobin S allele and
resistance to malaria?

Did not
contract Contracted
malaria malaria
Heterozygotes 1 14
Homozygotes 13 2

SYSTAT output:
Frequencies
MALARIA$ (rows) by GENES$ (columns)

het hom Total

+----------------+
n| 1 13 | 14
y | 14 2 | 16
+----------------+
Total 15 15 30

Test statistic Value DF Prob

Pearson Chi-square 19.286 1.000 0.000
34

b. example 2 – Question: Is the frequency of breaking bones independent of taking calcium

supplements? (supplements)

Example problems
1. The following data are frequency of rabies in skunks collected from three geographic areas.
Question: Is the incidence of rabies dependent on geographic area? (Protocol link)

With Without
Area Rabies Rabies
Ozarks 14 29
Ouachitas 12 38
Delta 11 35

2. The following data are frequency of individuals with different hair colors according to sex.
Question: Is human hair color dependent on sex? (Protocol link)

sex black brown blond red

male 32 43 16 9
female 55 65 64 16

_________________________

Frequencies: Fisher Exact Test StatTests

1. Test whether the frequencies of two categorical variables are independent; 2 x 2 table only
2. Two categorical variables, each frequency occurs in multiple mutually exclusive categories, no
proportions or percentages; no minimum expected cell frequency
3. Null hypothesis: H0: row var independent of column var
4. Calculates probability directly; no intermediate test statistic
5. SYSTAT path: AnalyzeTablesTwo-Way (check Fisher’s Exact Test in Measures, enter row and
column variables)

6. Question: Is phenotype independent of genotype?

Measures of Association for genetics$ and malaria$

genetics(rows) by
malaria(columns)
n y Total Test Statistic Value df p-Value
het 14 1 15
hom 2 13 15 Fisher Exact 0.0000
Test (Two-Tail)
Total 16 14 30
35

Variances: Bartlett’s and Levene’s Tests StatTests

1. Test whether sample variances are from the same population (=homogeneous)
2. Bartlett’s is sensitive to departures from normality (not robust)
3. Null hypothesis: H0: 2var(group a) = 2var(group b) = 2var(group c), etc.
4. Test statistic for Bartlett’s test (χ2) and Levene’s test (F) and probability source: Systat/Systat
5. SYSTAT path: AnalyzeHypothesis TestingVarianceEquality of Several Variances (enter
dependent and grouping variables)

6. Question: Does variation in total absorbance differ between concentrations?

SYSTAT output: (ABSORBANCE.SYZ; Moore)

-Equality of Several Variances

Variable CONC N Mean Variance Median

ABSORB_TOT 8 6.000 0.476 0.020 0.467
16 6.000 0.412 0.052 0.449

Bartlett's Test
Variable Chi-Square df p-Value
ABSORB_TOT 1.004 1.000 0.316

Levene's Test - *For Levene’s, use the F-ratio based on the median.
Variable F-Ratio df p-Value
ABSORB_TOT Based on Mean 1.173 1, 10 0.304
Based on Median 1.045 1, 10 0.331

Example problems
1. The following data are systolic blood pressure in two breeds of domestic cats. Question: Does
variation in pressure (mm/Hg) differ between Siamese and Mynx cats? (Protocol link)
Siamese:122, 138, 129, 152, 149, 166, 110, 114, 155, 136, 189, 145, 129, 115, 144, 134
Mynx: 129, 128, 109, 115, 108, 116, 125, 124, 117, 132, 111, 113, 127

2. Three different methods were used to determine the dissolved oxygen content of lake water. Each of
the three methods was applied to a sample of water six times, with the following results. Question:
Do the three methods yielded equally variable results? (Protocol link)
method 1 method 2 method 3
10.96 10.88 10.73
10.77 10.71 10.79
10.90 10.88 10.78
10.69 10.86 10.82
10.87 10.70 10.88
10.60 10.89 10.81

3. The following data are growth rate (g/d) in newborn rats fed four different diets. Question: Is growth
rate equally variable among diets? (Protocol link)
diet A: 1.6, 1.9, 0.9, 1.1, 1.5, 1.0, 1.8, 1.6 diet C: 0.8, 0.9, 0.5, 0.6, 0.7, 0.5, 0.9, 0.8
diet B: 2.5, 2.0, 2.8, 2.6, 2.6, 2.9, 1.9, 2.1 diet D: 1.0, 1.1, 0.7, 0.8, 0.9, 0.7, 1.1, 1.0
36

4. The following data are number of moths caught during the night by four different trap types.
Question: Is there a difference in the variance of trap effectiveness? (Protocol link)
Trap type 1: 41, 34, 33, 36, 40, 25, 31, 37, 34, 30, 38
Trap type 2: 52, 55, 62, 56, 64, 56, 56, 55
Trap type 3: 25, 33, 34, 37, 41, 34, 40, 36
Trap type 4: 36, 41, 33, 28, 34, 40, 27, 37

REVIEW
Graphic methods for ‘informed guessing’ whether means are statistically
different (not a substitute for a formal statistical test)

Fig. 1. (A) Feces production (x 1000) in juvenile and adult

green snakes by month. (B) Feces production (x1000) in adult
male and female green snakes by month. Plotted are
means ± 2 SE.

VS.
Fig. 1. (A) Feces production (x 1000) in juvenile and adult
green snakes by month. (B) Feces production (x1000) in adult
male and female green snakes by month. Plotted are
means ± 1 SE.

What is the message of

this image? Is there
anything wrong with
how it is portrayed?
37

Means: Independent samples t-test StatTests

1. Test whether two sample means are from the same population
2. Powerful, robust (in literature =“Students” t-test”; William Gossett 1904)
3. Null hypothesis: H0: var (group a) = var (group b)
4. Test statistic (t; absolute value) and probability source: Systat/Systat
5. SYSTAT path: AnalyzeHypothesis TestingMeanTwo-Sample t-test (enter dependent and
grouping variables)
6. Calculate power if you cannot reject H0

Question: Do IAA levels differ between the wild type and triple mutants in the 4D germination
treatments?
a. H0:  IAA(WS) =  IAA(ILR/IAR/ILL)
b. pooled variance t (“regular” t-test - assumes homogeneous variances); use this one
c. separate variance t (“approximate” t-test - does not assume homogeneous variances)
__________________________________________________

SYSTAT output: (DRAMPEY.SYZ; Rampey)

Variable PLANT$ N Mean Standard

Deviation
IAA ILR/IAR/ILL 3.000 11.133 3.350
WS 3.000 21.000 3.378

Separate Variance
Variable PLANT$ Mean Difference 95.00% Confidence Interval t df p-Value
Lower Limit Upper Limit
IAA ILR/IAR/ILL -9.867 -17.493 -2.240 -3.592 4.000 0.023
WS

Pooled Variance
Variable PLANT$ Mean Difference 95.00% Confidence Interval t df p-Value
Lower Limit Upper Limit
IAA ILR/IAR/ILL -9.867 -17.493 -2.241 -3.592 4.000 0.023
WS

:1
Example problems
1. The effect of copper sulfate on the mucus cells in the gill filaments of a species of fish was
investigated. The number of mucus cells per square micron in the gill filaments of untreated fish and
in fish exposed for 24 hours to copper sulfate (mg/l) was as follows. Question: Does exposure to
copper sulfate affect the number of mucus cells in these fish? (Protocol link)
untreated: 16, 17, 12, 18, 11, 18, 12, 15, 16, 14, 18, 12
exposed: 8, 10, 12, 13, 14, 6, 5, 7, 10, 11, 9, 8

2. A species of bacterium was grown with either glucose or sucrose as a carbon source. After a period of
incubation, the number of cells (X 106) was determined. Question: Is there a difference in growth
rate of the bacterium between the two carbon sources? (Protocol link)
glucose: 6.3, 5.7, 6.8, 6.1, 5.2
sucrose: 5.8, 6.2, 6.0, 5.1, 5.8
38

Means: Mann-Whitney StatTests

1. Test whether two sample means are from the same population
2. Null hypothesis: H0: var (group a) = var (group b) (technically testing differences in medians)
3. Test statistic (U) and probability source: Systat/Systat (if provided an outside answer, may need
to convert test statistic (U’=n1n2-U)
4. SYSTAT path: AnalyzeNonparametric TestsKruskal-Wallis (enter dependent and grouping
variables)
5.
6. Question: Does weight differ between the sexes?

SYSTAT output: (LONOKE.SYZ; Plummer)

Mann-Whitney U Test for female length within range as male
<50) length.

The categorical values encountered during processing are

Variables Levels
SEX (2 levels) 1.000 2.000

Dependent Variable WGT

Grouping Variable SEX

Group Count Rank Sum

1 27 912.000
2 44 1,644.000

Mann-Whitney U Test Statistic : 534.000

p-Value : 0.477
Chi-Square Approximation : 0.505
df : 1

Example problems
1. Twenty people were randomly assigned to two groups of ten each. One group viewed a hairy spider,
and the other group viewed a similar but nonhairy spider. Each person was asked to score the spider
she or he viewed on a ranked scariness scale from 1 to 10 (10 being the most scary). The results are
below. Question: Do people find hairy spiders scarier than nonhairy spiders? (Protocol link).
hairy: 10, 8, 7, 9, 9, 10, 9, 9, 5, 8
nonhairy: 7, 6, 8, 6, 1, 5, 4, 5, 6, 3

2. The mass (g) of random samples of adult male tuatara from two localities in New Zealand are given
below. Question: Do animals from locality A differ in mean mass from locality B? (Protocol link)
loc A: 510, 773, 840, 505, 765, 780, 235, 790, 440, 435, 815, 460, 690
loc B: 650, 600, 600, 575, 452, 320, 660
39

Means: Paired samples t-test StatTests

1. Test whether two sample means are from the same population
2. Each individual is measured twice or selected pairs are matched (“repeated measures”); more
powerful than independent t-test (reduced error variance); robust; Exercise in Twins, NASA Twins
3. Data must be in an unstacked format
4. Null hypothesis: H0: var1 = var2 (no grouping variable)
5. Test statistic (t) and probability source: Systat/Systat
6. SYSTAT path: AnalyzeHypothesis TestingMeanPaired t-test (enter paired variables)
7. Calculate power if you cannot reject H0

Question: Does early field metabolic rate differ from late field metabolic rate?

SYSTAT output: (DLWMEANS.SYZ; Plummer)

Paired samples t test on EARLYFMR vs LATEFMR with 6 cases

Mean EARLYFMR = 0.1552

Mean LATEFMR = 0.1268
Mean Difference = 0.0283 95.00% CI = -0.0580 to
0.1147
SD Difference = 0.0823 t = 0.8437
DF = 5 Prob = 0.4373

How to stack dependent data files for testing equality of variances

1. manual stacking (create grouping variable)
2. SYSTAT stacking (DataReshapeStack)

Example problems
1. Brucella abortus antibody titers (pfc/106 cells) in 15 turkeys were measured before and after a period
of stress. Question: Did stress decrease antibody titer in these turkeys? (Protocol link)
turkey no.: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
before stress: 20 18 19 18 17 14 17 10 13 16 20 17 16 19 8
after stress: 17 14 16 19 14 18 8 10 12 15 8 6 17 5 3

2. Male hoop snakes, upon encountering one another, may engage in a protracted ritualized combat
behavior until one establishes himself as dominant over the other. Six males were tested in the
presence of a female and again in the absence of a female. Whether each male was tested first with or
without a female was randomly determined. The results in interaction time (min.) are below.
Question: Do these encounters last longer in the presence of a female? (Protocol link)
snake no.: 1 2 3 4 5 6
w/o female: 10 15 8 30 1 80
w/ female: 59 35 70 65 43 90
40

Means: Wilcoxon (Chap. 9) StatTests

1. Test whether two sample means are from the same population
2. Each individual is measured twice or selected pairs are matched (repeated measures)
3. Data must be in an unstacked format
4. Null hypothesis: H0: var1 = var2 (no grouping variable)
5. Test statistic (Z) and probability source: Systat/Systat
6. SYSTAT path: AnalyzeNonparametric TestsWilcoxon (enter paired variables)

7. Question: Does field metabolic rate differ between early and late measurements?

SYSTAT output: (DLWMEANS.SYZ; Plummer)

Wilcoxon Signed Ranks Test Results
Counts of differences (row variable greater than column)
EARLYFMR LATEFMR
EARLYFMR 0 4
LATEFMR 2 0
Z = (Sum of signed ranks)/square root(sum of squared ranks)
EARLYFMR LATEFMR
EARLYFMR 0.0
LATEFMR -0.3145 0.0

Two-sided probabilities using normal approximation

EARLYFMR LATEFMR
EARLYFMR 1.0000
LATEFMR 0.7532 1.0000

Example problems
1. The wattle thickness (mm) of 10 randomly selected chickens was measured before and after treatment
with PHA. Question: Does treatment with PHA affect wattle thickness? (Protocol link)

Chicken no. 1 2 3 4 5 6 7 8 9 10
pretreatment 1.05 1.01 0.78 0.98 0.81 0.95 1.00 0.83 0.78 1.05
posttreatment 3.48 5.02 5.37 5.45 5.37 3.92 6.54 3.42 3.72 3.25

2. Ten young men were asked to rate their feeling of well-being on a scale of 1 (worst) to 10 (best)
before and after taking an experimental drug. Question: Does the drug increase a person’s sense of
well-being? (Protocol link)
individual no.: 1 2 3 4 5 6 7 8 9 10
before drug: 5 8 2 7 5 2 9 3 9 6
after drug: 7 9 1 9 5 9 9 9 10 7

You are responsible for knowing how to work all the Practice Problems concerning differences
in frequencies, association of frequencies, and differences in variances and two means
(Goodness-of-Fit, Test of Independence, Fisher’s Exact Test,, Bartlett’s, Levene’s, Independent
Samples t-test, Paired Samples t-test, Mann-Whitney, Wilcoxon). Exam problems will be taken
directly or modified from Example and Practice Problems.
41

Hypothesis Testing 2
Home

Analysis of Variance StatTests

1. ANOVA – important part of experimental design (Fisher 1935); extremely common in the literature
2. Goal is to partition the sources of natural variability for any given system
-total variability = source1 + source2 + source3, etc. (additive)
3. Also permits measurement of interaction (e.g., drug interaction); source1 x source2 (not additive)
4. Many different ANOVA models; e.g.,
 One-way ANOVA (1 dependent variable, 1 independent variable)
 Two-way ANOVA (1 dependent variable, 2 independent variables)
 Analysis of Covariance; ANCOVA (1 dep, 1 indep, 1 covariate) – at end of course if enough time

One-way ANOVA
1. Test whether sample means are from the same population
2. Powerful and robust
3. Null hypothesis: H0: var(group1) = var(group2) = var(group3), etc.
4. Why not use multiple t-tests? – “The problem of multiple comparisons”

1 2 1 2 3
4 means = 30%
5% Type I error 15% 5 means = 50%

5. Partition total variation into between-group and within-group (“error”) variation

 between group: variation due to being part of a certain group (treatment)
 error variation: all variation not due to being in that group
6. Calculate ratio of between-groups variance/within-groups variance (F-ratio; test statistic)
 F-ratio relatively large when treatment accounts for significant variation
7. Determine probability; compare F-ratio with F-distribution (shape determined by 2 separate dfs)
 numerator (no. treatments – 1)
 denominator (no. observations in all groups - no. groups)
8. Test statistic (F) and probability source: Systat/Systat

REVIEW: Required components of a null hypothesis for questions of differences in means or variances.
1. Indicator (H0)
2. Parameter (e.g., µ, 2)
3. Variable (e.g., length, mass)
4. Group (e.g., sex, color); for questions of differences between independent data only (no grouping
variable for dependent data). Groups are designated by being enclosed in parentheses.
5. Relational operator (e.g., =, ≥, ≤)
Examples:
-independent: H0: µlength(males) = µlength(females)
-dependent: H0: µbeforelength = µafterlength
42

9. SYSTAT path: AnalyzeANOVAEstimate Model (enter dependent and grouping[=factor]

variables)Options (KS, Levene)
10. Calculate power if you cannot reject H0

SYSTAT output: (TREAT.SYZ; Plummer); treat.ppt

Categorical values encountered during processing are: CLUTNO (24 levels)
1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,24,25

Dep Var: EGGWGT N: 245 Multiple R: 0.7090 Squared multiple R: 0.5027 Normality and homogeneity
assumptions are tested after
Analysis of Variance ANOVA with the residuals
Source Sum-of-Squares DF Mean-Square F-Ratio P (=difference between
CLUTNO 334.2372 23 14.5321 9.7115 0.000 observed value and value
Error 330.6987 221 1.4964 predicted by the model)

Example Problems
1. Random samples of a certain species of zooplankton were collected from five lakes and their selenium content (ppm) was
determined. Was there a difference among lakes with respect to selenium content? (Protocol link)

lake A: 23, 30, 28, 32, 35, 27, 30, 32

lake B: 34, 42, 39, 40, 38, 41, 40, 39
lake C: 15, 18, 12, 10, 8, 16, 20, 19
lake D: 18, 15, 9, 12, 10, 17, 10, 12
lake E: 25, 20, 22, 18, 30, 22, 20, 19

2. The following data are amount of food (kg) consumed per day by adult deer at different times of the year. Test the null
hypothesis that food consumption was the same for all the months tested. (Protocol link)

February May August November

4.7 4.6 4.8 4.9
4.9 4.4 4.7 5.2
5.0 4.3 4.6 5.4
4.8 4.4 4.4 5.1
4.7 4.1 4.7 5.6
4.2 4.8
_____________________________________

After significant ANOVA: Which means are different from which other means?
Post hoc pairwise tests counteract the problem of maintaining an alpha level of 0.05 for multiple
comparisons; many different post hoc tests

1. Example: Tukey test

2. Test statistic (Difference) and probability source: Systat/Systat
3. SYSTAT path: AnalyzeANOVAPairwise comparisonsTukey (add group)
4. MYSTAT path: not available
43

SYSTAT output: (TREAT.SYZ; Plummer, select clutno<6); treat.ppt

Categorical values encountered during processing are:
CLUTNO (5 levels) - 1, 2, 3, 4, 5

Dep Var: EGGWGT N: 63 Multiple R: 0.5851 Squared multiple R: 0.3423

Analysis of Variance
Source Sum-of-Squares DF Mean-Square F-Ratio P
CLUTNO 166.0769 4 41.5192 7.5471 0.0001
Error 319.0783 58 5.5014

Post Hoc Test of EGGWGT

Using least squares means.
Tukey's Honestly-Significant-Difference Test
CLUTNO(i) CLUTNO(j) Difference p-Value 95% Confidence Interval
Lower Upper
1 2 -0.4850 0.9911 -3.4991 2.5291
1 3 1.2625 0.7430 -1.6643 4.1893
Note that each Tukey comparison
1 4 -0.3904 0.9957 -3.3171 2.5364 in the matrix, e.g., 1 vs. 5, 2 vs.
1 5 3.6288 0.0070 0.7378 6.5199 5, etc., is a separate statistical
2 3 1.7475 0.3321 -0.8504 4.3454 test. Each test requires its own
2 4 0.0946 1.0000 -2.5032 2.6925 null hypothesis, test statistic,
2 5 4.1138 0.0003 1.5562 6.6714 probability, and conclusion.
3 4 -1.6529 0.3478 -4.1488 0.8431
3 5 2.3663 0.0639 -0.0877 4.8203
4 5 4.0192 0.0002 1.5652 6.4732

Example Problems

1. In a study of snake hibernation, fifteen pythons of similar size and age were randomly assigned to three groups. One
group was treated with drug A, one group with drug B, and the third group was not treated. Their systolic blood pressure
(mmHg) was measured 24 hours after administration of the treatments. Do the drugs affect blood pressure? If so, do they
have similar effects? (Protocol link)
control: 130, 135, 132, 128, 130
drug A: 118, 120, 125, 119, 121
drug B: 105, 110, 98, 106, 105

2. Fourteen hucksters were assigned at random to one of three experimental groups and fed a different diet for six months.
Use the following data on huckster mass (kg) at the end of the experiment to determine if diet affected body size. Which
diet produced the heaviest hucksters? (Protocol link)

diet 1 diet 2 diet 3

60.8 68.7 102.6
57.0 67.7 102.1
65.0 74.0 100.2
58.6 66.3 96.5
61.7 69.8
_______________________________________
44

Kruskal_Wallis Test
1. Test whether three or more sample SYSTAT output: (TREAT.SYZ; Plummer, select clutno<8); treat.ppt
means are from the same population Categorical values encountered during processing are:
2. Non-parametric counterpart to one-way CLUTNO (7 levels)
ANOVA 1, 2, 3, 4, 5, 6, 7
3. Null hypothesis: H0: var (group1) =
var(group2) = var(group3), etc. Kruskal-Wallis One-Way Analysis of Variance for 89 cases
Dependent variable is EGGWGT
4. Test statistic (H) and probability
Grouping variable is CLUTNO
source: Systat/Systat
5. SYSTAT path: Group Count Rank Sum
AnalyzeNonparametric 1 8 374.0000
testsKruskal-Wallis (enter dependent 2 12 731.5000
and grouping (=factor) variables) 3 14 245.0000
4 14 833.5000
5 15 490.0000
Dwass-Steel-Chritchlow-Fligner
Test for All Pairwise Comparisons 6 9 720.0000
7 17 611.0000
Group(i) Group(j) Statistic p-Value
Kruskal-Wallis Test Statistic [H] = 46.9358
1 2 7.8558 0.0000
Probability is 0.0000 assuming Chi-square distribution with 6 DF
1 3 1.2552 0.9745
1 4 9.8964 0.0000
1 5 5.8438 0.0007
1 6 6.1237 0.0003  For post hoc pairwise comparisons after significant KW
1 7 6.5521 0.0001
2 3 -4.1468 0.0524 Dwass-Steel-Critchlow-Fligner Test (DSCF)
2 4 0.9094 0.9954
2 5 0.8282 0.9972
etc.

Example Problems
1. Twenty-four freshwater clams were randomly assigned to four groups of six each. One group was placed in deionized
water, one group was placed in a solution of 0.5 mM sodium sulfate, and one group was placed in a solution of 0.74 mM
sodium chloride. At the end of a specified time period, blood potassium levels (M K+) were determined. Did treatment
affect blood potassium levels? (Protocol link)

pond water: 0.518, 0.523, 0.499, 0.502, 0.520, 0.507

deionized water: 0.308, 0.385, 0.301, 0.390, 0.307, 0.371
sodium sulfate: 0.393, 0.415, 0.351, 0.390, 0.385, 0.397
sodium chloride: 0.383, 0.405, 0.398, 0.352, 0.381, 0.407

2. An entomologist interested in the vertical distribution of a fly species collected the following data on numbers of flies (no.
flies/m3) from each of tree different vegetation layers. Use these data to test the hypothesis that fly abundance was the
same in all three vegetation layers. (Protocol link)
herbs shrubs trees
14.0 8.4 6.9
12.1 5.1 7.3
5.6 5.5 5.8
6.2 6.6 4.1
12.2 6.3 5.4
45

Two-way ANOVA - factorial design; 2 independent variables (=factors)

1. Test whether sample means are from the same population; access interaction between independent
variables
2. Powerful and robust; test assumptions with residuals
3. Test statistic (F) and probability source: Systat/Systat
4. Null hypotheses:
 H0: var(group1) = var(group2) = var(group3), etc. (for each main effect)
 H0: no interaction among factors (interaction = the extent to which the effects of one factor
differ according to the levels of another factor; synergism or antagonism)
5. SYSTAT path: AnalyzeANOVAEstimate Model (enter dependent variable and >1 grouping
variable)

Example 1:
SYSTAT output; MOUSEDIET.SYZ; Cooper

Variables Levels
DIET$ (4 levels) 5K-96 AIN-cas AIN-spi P5001
NPDOSE$ (2 levels) 0 2000

Dependent Variable BODWGT

N 143
Multiple R 0.783
Squared Multiple R 0.614

Analysis of Variance
Source Type III SS df Mean Squares F-ratio p-value
DIET$ 66249.445 3 22083.148 47.793 0.000 *Note there are 3 separate
NPDOSE$ 29869.989 1 29869.989 64.645 0.000 hypotheses tested
DIET$*NPDOSE$ 1538.354 3 512.785 1.110 0.347
Error 62378.033 135 462.060

350
Conclusions
 Diet explains a significant amount of Interaction Plot
300
variation in body weight. Body weight
is greater in mice with the P 5001 diet.
BODWGT

 NPdose explains a significant amount 250

of variation in body weight. Body

weight is greater in mice not receiving 200

NPdose.
 There is no interaction between diet 150
NPDOSE$

and NPdose. Body weight responds 0

2000
the same to diet and NPdose. 100
5K-96 AIN-cas AIN-spi P5001
DIET$
46

Example 2: effect of diet and stress on weight gain in mice

SYSTAT output: (dietstress.syz)

Variables Levels
DIET$ (2 levels) c j
STRESS$ (2 levels) h l
170
Dependent Variable WGTGAIN
Interaction Plot
N 32 160
Multiple R 0.844
Squared Multiple R 0.712 150

WGTGAIN
140
Analysis of Variance
Source Type III SS df Mean Squares F-ratio p-value
130
DIET$ 1568.000 1 1568.000 32.449 0.000
STRESS$ 1458.000 1 1458.000 30.173 0.000 120
DIET$
DIET$*STRESS$ 312.500 1 312.500 6.467 0.017 c
j
Error 1353.000 28 48.321 110
h l
STRESS$

How affect? Conclusions

 Diet explains a significant amount of variation in weight gain. Mice with junk food diets gain more
weight than mice with regular diets.
 Stress explains a significant amount of variation in weight gain. Mice experiencing high stress gain
more weight than mice experiencing low stress.
 The interaction between diet and stress explains significant variation in weight gain. Weight gain
caused by a junk food diet is exacerbated (i.e. made worse) by high stress. Or stated from another
perspective, the weight gain caused by high stress is exacerbated by a junk food diet.

Example Problems

1. Use USOPHEO.SYZ; Plummer to determine if body size is affected by sex and/or location. Read the description of the data
file before proceeding. (Protocol link)

2. Qualime epithelial cancer is hypothesized to result from either genotype or several environmental factors that vary by
season. To address this hypothesis, use the data below on QSA level (g/g; the diagnostic test indicator of qualime
cancer) that were collected on 20 individuals in different seasons. (Protocol link)
QSA Genotype Season QSA Genotype Season QSA Genotype Season QSA Genotype Season
478 ZZ Winter 425 ZW Summer 428 ZZ Summer 466 ZW Winter
538 ZZ Winter 467 ZW Summer 478 ZZ Summer 522 ZW Winter
502 ZZ Winter 444 ZW Summer 455 ZZ Summer 489 ZW Winter
496 ZZ Winter 438 ZW Summer 446 ZZ Summer 475 ZW Winter
483 ZZ Winter 431 ZW Summer 432 ZZ Summer 501 ZW Winter

3. Work practice problem #56. Why is it a one-way rather than a two-way ANOVA? You will have to create a derived variable
to work the problem. There are two ways to do this: (1) enter the derived variable directly on the SYSTAT data sheet or
(2) enter all of the data shown and use TRANSFORM If.., Then Let to create the derived variable. You likely will need
to review how to create derived variables.
47

Correlation
 correlation analysis is a test of association that makes no assumption about a cause-and-effect
relationship (i.e., there is no dependent and independent variable)
 addresses two questions
- does an association exist between two variables?
- if the association exits, what is its strength (effect)?
 requires that both variables be normally distributed random variables

Pearson correlation StatTests

1. Test whether the cases of two variables are correlated (positive or negative)
2. Linear relationships only
3. Null hypothesis: assume no relationship; H0: var1,var2 = 0 (Note there is no grouping variable,
just two ratio or interval variables)
4. Test statistic (correlation coefficient, r (varies from -1 to +1; measure of strength) and probability
source: Systat/Systat
5. r2 (coefficient of determination) - proportion of variation in one variable that is explained by variation
in the other variable (r2 is not a test statistic)
6. SYSTAT path: AnalyzeCorrelationSimple (enter variables; Continuous Data)
7. Calculate power if you cannot reject H0

SYSTAT output (AMPHIBIANS.SYS; Mills); assume normality

for purposes of demonstration only

Number of Non-Missing Cases: 40

Means
BUFO SPECIES
1.5750 2.5000

Pearson Correlation
Matrix
BUFO SPECIES
BUFO 1.0000
SPECIES 0.6198 1.0000

Matrix of Bonferroni
Probabilities
BUFO SPECIES
BUFO 0.0000
SPECIES 0.0000 0.0000
48

Bonferroni probability correction (counteracts the “The problem of multiple comparisons“); reduces
chances of making a Type 1 error (= “false negative” in the medical literature)

SYSTAT output: (AMPHIBIANS.SYZ; Mills); assume normality for purposes of

demonstration only

Number of Non-Missing Cases: 40

Means
BUFO RASP HYLA INDIVIDUALS SPECIES
1.5750 1.7750 0.7500 5.1750 2.5000

Pearson Correlation Matrix

BUFO RASP HYLA INDIVIDUALS SPECIES
BUFO 1.0000
RASP 0.2408 1.0000
HYLA 0.1034 0.2000 1.0000
INDIVIDUALS 0.7103 0.7239 0.5245 1.0000
SPECIES 0.6198 0.5630 0.4854 0.8761 1.0000

Matrix of Bonferroni Probabilities

BUFO RASP HYLA INDIVIDUALS SPECIES
BUFO 0.0000
RASP 1.0000 0.0000
HYLA 1.0000 1.0000 0.0000
INDIVIDUALS 0.0000 0.0000 0.0051 0.0000
SPECIES 0.0002 0.0016 0.0150 0.0000 0.0000

Example problems
1. Use the following data on wing length (cm) and tail length (cm) in cowbirds to determine if there is a relationship between
the two variables. (Protocol link)

Wing 10.4 10.8 11.1 10.2 10.3 10.2 10.7 10.45 10.8 11.2 10.6
Tail 7.4 7.6 7.9 7.2 7.4 7.1 7.4 7.2 7.8 7.7 7.8

2. Use the following data taken from crabs to determine if there is a relationship between weight of gills (g) and weight of
body (g) and between weight of thoracic shield (g) and weight of body. (Protocol link)

Body 159 179 100 45 384 230 100 320 80 220 320
Gill 14.4 15.2 11.3 2.5 22.7 14.9 11.4 15.81 4.19 15.39 17.25
Thorax 80.5 85.2 49.9 21.1 195.3 111.5 56.6 156.1 39.0 108.91 160.1

Spearman correlation StatTests

1. Test whether the cases of two variables are correlated
2. Linear relationships only
3. Null hypothesis: H0: svar1, var2 = 0 (Note there is no grouping variable, just two ratio, interval, or
ranked variables)
49

4. Test statistic (rs) and probability source: Systat/Statistical Table

5. SYSTAT path: AnalyzeCorrelationSimple (enter variables; Rank Order Data)

SYSTAT output: (AMPHIBIANS.SYZ; Mills)

Number of Non-Missing Cases: 40

Spearman Correlation Matrix
BUFO RASP HYLA GACA NOVI INDIVIDUALS SPECIES Spearman probabilities
BUFO 1.0000 are not available in
RASP 0.3113 1.0000 SYSTAT; must get
HYLA 0.2886 0.3879 1.0000 probabilities from a
GACA 0.3407 0.3682 0.1901 1.0000 Spearman Table
NOVI 0.2314 0.0436 0.3467 -0.0526 1.0000
INDIVIDUALS 0.7264 0.7678 0.5804 0.3506 0.2044 1.0000
SPECIES 0.7512 0.6482 0.6438 0.3226 0.3001 0.9173 1.0000

Example problems
1. The following data are ranked scores for ten students who took both a math and a biology aptitude examination. Is
there a relationship between math and biology aptitude scores for these students? (Protocol link)
Math 53 45 72 78 53 63 86 98 59 71
Biology 83 37 41 84 56 85 77 87 70 59

2. Test the following data to determine if there is a relationship between the total length of aphid stem mothers and the
mean thorax length of their parthenogenetic offspring. (Protocol link)
Mother 8.7 8.5 9.4 10.0 6.3 7.8 11.9 6.5 6.6 10.6
offspring 5.95 5.65 6.00 5.70 4.40 5.53 6.00 4.18 6.15 5.93

_________________________________________________________

Correlation vs. causation

1. Earlier: alcoholics in FL vs HU grads; spurious correlations
2. sometimes results from a common correlation with 3rd variable (e.g., B correlated with C because
both B&C are functionally correlated with A); Cause and effect

Regression analysis is a test of association that

 assumes a cause-and-effect relationship between an independent and dependent variable
 is used to address the same basic questions as correlation analysis (with one important additional
question), but from the perspective of cause-and-effect
- does the independent variable explain significant variation in the dependent variable?
- how strong is the explanatory power of the independent variable?
- what is the mathematical relationship between the variables? (i.e., what is the mathematical
equation that describes the relationship?)
 requires that the dependent variable be a normally distributed random variable. The independent
variable may be controlled or selected and thus may not be a normally distributed random
variable.
50

Regression (Chap. 14) StatTests

1. Test whether the cases of one variable are functionally (mathematically) related to the cases of
another variable (i.e., can be predicted from)
2. Linear relationships only
3. Normality assumptions are analyzed with residuals after the regression analysis; robust
4. Null hypothesis: H0: yvar, xvar = 0 (Note there is no grouping variable, just two ratio or
interval variables)
5. Test statistic (F-ratio) and probability source: Systat/Systat
6. SYSTAT path: AnalyzeRegressionLinearLeast Squares (enter dependent and independent
variables; enter KS on options tab)

Procedure
a. Fit regression line (least squares method; minimize (residuals2)
b. Test for significance of slope
c. Write the regression equation (general form Y = a (intercept) + b (slope) X
-do NOT use math format (y = mx + b)
d. Add regression statistics and variable names

SYSTAT output: (SHRIMP.SYZ; Goy)

Output format
 Regression statistics: intercept (=constant), slope (=regression coefficient); standard error
 ANOVA table (test statistic, probability)
 KS test of assumptions

Dependent Variable EGGNO

N 68
Multiple R 0.7763 < Regression
Squared Multiple R 0.6027 statistics
Adjusted Squared Multiple R 0.5967
Standard Error of Estimate 1142.1881

Regression Coefficients B = (X'X)-1X'Y

Effect Coefficient Standard Error Std. Tolerance t p-Value
intercept
intercep Coefficient
t
CONSTANT -4914.5822 683.9281 0.0000 . -7.1858 0.0000
FEMLEN 561.5867 56.1225 0.7763 1.0000 10.0065 0.0000
slope
slope

Analysis of Variance
Source SS df Mean Squares F-Ratio p-Value
Regression 1.3063E+008 1 1.3063E+008 100.1291 0.0000  Test statistic and
Residual 8.6103E+007 66 1304593.7089 probability

Test for Normality

Test Statistic p-Value
K-S Test (Lilliefors) 0.0775 0.3660  KS test of normality assumption for residuals
51

The regression equation from the above analysis and represented on the
graph is:
EGGNO = -4914.6 + 561.6 FEMLEN
In the regression equation, note that 'X' and 'Y' are replaced with the specific
variables in question, i.e., FEMLEN and EGGNO. Also note that the
dependent variable, EGGNO, is plotted on the Y axis, and the independent
variable, FEMLEN, is plotted on the X axis. Another way of stating this is,
"EGGNO is plotted against FEMLEN", or "EGGNO is regressed on
FEMLEN."

8000
A regression plot is a SYSTAT
7000 Scatterplot with a linear smoother.
6000

5000
EGGNO

4000

3000

2000

1000

0
5 10 15 20
FEMLEN

Example problems
1. The following data are rate of oxygen consumption (ml/g/hr) in crows at different temperatures (C). Does
temperature affect oxygen consumption in crows? Determine the equation for predicting oxygen consumption from
temperature. (Protocol link)

temp -18 -15 -10 -5 0 5 10 19

oxygen 5.2 4.7 4.5 3.6 3.4 3.1 2.7 1.8

2. Use the following data on mean adult body weight (mg) and larval density (no./mm 3) of fruit flies to determine if there
is a functional relationship between adult body mass and the density at which it was reared. Determine the equation
for predicting body weight from larval density. (Protocol link)

density 1 3 5 6 10 20 40
weight 1.356 1.356 1.284 1.252 0.989 0.664 0.475
52

Extrapolation: linear regressions are statistically valid only within limits of the data (independent
variable, X); beyond data - do not know if relationship is linear

A regression of tooth size on actual body length for the living Carcharodon carcharias indicates by
extrapolation (assuming continued linearity) that C. megalodon was “only” 13 m (43 ft) in length!

Model building in regression (goal is to build a better model by increasing r2; results in
more accurate prediction)

Data transformation
1. SYSTAT e.g.: calibrate transmitters; DEMO
2. Linear vs. log10 data regressions - note increase in r2 and linearity with log transformation

3500 3.6

3000 3.5

3.4
LOGPI

2500
PI

3.3
2000
3.2
1500 3.1

1000 3.0
0 10 20 30 40 0 10 20 30 40
TEMP TEMP
53

Dep Var: PI N: 7 Multiple R: 0.989 Squared multiple R: 0.978

Effect Coefficient Std Error Std Coef Tolerance t P

CONSTANT 3172.273 97.857 0.000 . 32.417 0.000
TEMP -65.363 4.390 -0.989 1.000 -14.888 0.000

Analysis of Variance
Source Sum-of-Squares df Mean-Square F-ratio P
Regression 3518606.514 1 3518606.514 221.638 0.000
Residual 79377.200 5 15875.440

Dep Var: LPI N: 7 Multiple R: 1.000 Squared multiple R: 0.999

Effect Coefficient Std Error Std Coef Tolerance t P

CONSTANT 3.540 0.004 0.000 . 834.735 0.000
TEMP -0.015 0.000 -1.000 1.000 -78.645 0.000

Analysis of Variance
Source Sum-of-Squares df Mean-Square F-ratio P
Regression 0.184 1 0.184 6185.068 0.000
Residual 0.000 5 0.000

Predicting dependent variable Y from independent variable X

A. Linear (Y, X) equations: Y = a + bX

Example 1: using the regression equation Y = 14.5 + 2.56X, predict Y when X = 63

Y = 14.5 + 2.56(63) = 175.78

____________________________________________
Example 2: inverse prediction (predict X from Y); Y = 14.5 + 2.56X; by algebraic manipulation
Y-14.5 = 2.56X; (Y-14.5)/2.56 = X

predict X when Y = 175.78:

X = (175.78-14.5)/2.56 = 63

B. Semilog (logY, X) equations: log Y = log a + bX (must take the inverse log of
log Y to get final answer on linear scale)
Example: using the regression equation log Y = 1.42234 +0.047560X, predict Y when X = 12.1

logY = 1.42234 + 0.047560(12.1) = 1.99782 (calculate regression coefficients and

answer to at least 5 decimal places); inverse log 1.99782 = 99.49

Note that the intercept (1.42234) is a log value (i.e., log a = 1.42234). You must not take the log of
this value when calculating log Y; that would be the equivalent of taking the log of a log!
54

C. Log-log (logY, logX) and exponential equations: log Y = log a + b(log X); Y = aXb

Example 1 (logarithmic form): using the regression equation log Y = 2.53403 + 0.72000(log X),
predict Y when X = 1.98

log Y = 2.53403 + 0.72000 (log 1.98) = 2.74763 (calculate regression coefficients and

answer to at least 5 decimal places)

inverse log 2.74763 = 559.28

The most common form of the log-log regression equation, and one that is much easier to use is the
exponential form:

log Y = log a + b(log X) = log a + log xb ; take inverse logs: Y = aXb (exponential form)

 Example 2 (exponential form): using the regression equation Y = 342X0.720, predict Y when

X = 1.98; *note that 342 = the inverse log of 2.53403

Y = 342(1.980.720) = 559.28

Examples of important uses of exponential regressions in biology

1. Ecology: species-area curves (Isle Biogeography Theory)

Common slope in some (0.3)

-West Indian snakes: S = 1.19A0.33
-Galapagos land plants: S = 28.6A0.32
-Sierra Nevada mammals: S = 1.18A0.32
55

2. Morphology: effects of scaling; e.g., brain size

Physiology: effects of scaling; e.g., metabolic rate and body mass

Advanced statistical procedures commonly seen in the literature

Home

1. Analysis of Covariance (ANCOVA)

 Method of comparing regression lines: eg,
-marsupials: MR = 0.409 M0.75
-eutherians MR = 0.676 M0.75 (>60% higher)
 Detect differences among means of two or more groups when the dependent variable is affected
by a third (continuous) variable (=covariate)
 A covariate adds unwanted variability to the dependent variable. ANCOVA removes that
variability and yields least squares means (means adjusted for the covariate effect)
 ANCOVA combines the use of both ANOVA and regression methods

Example1: A common belief is that men are stronger than women. Is this belief due to men being
bigger or are men actually stronger when compared to women of similar body size? Test this question
on data from a sample of healthy young adults (stronger.syz). The variables are sex, lean body mass,
and a measure of strength called “slow, right extensor knee peak torque.”

Add Example problems

2. Circular statistics (Raleigh Test) – techniques for data measured on an angular scale. Angular scales
are circular in nature, have no designated zero, and the designation of high and low values is arbitrary.
For example, 0 and 360 point to the same direction.
3. Principal component analysis (PCA) - variable reduction technique that describe variability among
multiple observed variables in terms of a lower number of non-measured derived variables
57

4. MANOVA (multivariate analysis of

variance) – a generalized form of ANOVA
in which there are two or more independent
and/or two or more dependent variables.
MANOVA assesses main effects and
possible interactions among the dependent
variables and among the independent
variables
5. Repeated measures ANOVA – each
individual is measured ≥ two times
6. Logistic regression – regression with a
binary dependent variable (e.g.,
presence/absence
7. Non-linear regression
8. Multiple regression – regression with >1
independent variable (Fig. 2)
58

Statistical Tables
Home
59
60

L1 Introduction and Basic Concepts
No ratings yet
L1 Introduction and Basic Concepts
25 pages
1.1 What Is Biostatistics?
No ratings yet
1.1 What Is Biostatistics?
3 pages
1 Introduction
No ratings yet
1 Introduction
14 pages
Biostatistics Syllabus
No ratings yet
Biostatistics Syllabus
2 pages
Biology Assignment
No ratings yet
Biology Assignment
9 pages
BIOMEDE 503 - Lecture 1 - 20220106 - Class Introduction
No ratings yet
BIOMEDE 503 - Lecture 1 - 20220106 - Class Introduction
28 pages
BIOL 2060 W22 Course Outline
No ratings yet
BIOL 2060 W22 Course Outline
7 pages
BioInteractive Math & Stats in Biology
No ratings yet
BioInteractive Math & Stats in Biology
42 pages
Introduction To Biostatistics Student Lecture Notes
100% (2)
Introduction To Biostatistics Student Lecture Notes
130 pages
Bio Stat Lec 1
No ratings yet
Bio Stat Lec 1
2 pages
Introduction to Biostatistics Basics
No ratings yet
Introduction to Biostatistics Basics
52 pages
Practice of Statistics in The Life Sciences 4th Edition Brigitte Baldi PDF Download
No ratings yet
Practice of Statistics in The Life Sciences 4th Edition Brigitte Baldi PDF Download
133 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
3 pages
Biostat Manual
100% (1)
Biostat Manual
97 pages
Biostatistics Kitabu
No ratings yet
Biostatistics Kitabu
97 pages
810-Article Text-1612-1-10-20161231
No ratings yet
810-Article Text-1612-1-10-20161231
5 pages
Intuitive Biostatistics A Nonmathematica
No ratings yet
Intuitive Biostatistics A Nonmathematica
605 pages
HST 190 Introduction To Biostatistics 2019
No ratings yet
HST 190 Introduction To Biostatistics 2019
3 pages
BTS1
No ratings yet
BTS1
3 pages
Biost 1
No ratings yet
Biost 1
59 pages
(Ebook PDF) Statistics For The Life Sciences 5th Global Edition PDF Download
100% (7)
(Ebook PDF) Statistics For The Life Sciences 5th Global Edition PDF Download
46 pages
Week 1 Intro Biostat
No ratings yet
Week 1 Intro Biostat
60 pages
SPPH 400 Trad2022
No ratings yet
SPPH 400 Trad2022
6 pages
BIOS 201 Course Book Fall Sem AY 2021-2022
No ratings yet
BIOS 201 Course Book Fall Sem AY 2021-2022
14 pages
Lec 1
No ratings yet
Lec 1
3 pages
Biostatistics I 2020-1 AH PDF
No ratings yet
Biostatistics I 2020-1 AH PDF
13 pages
Expect The Unexpected PDF
No ratings yet
Expect The Unexpected PDF
315 pages
18BBT0272 VL2019205001339 Da
No ratings yet
18BBT0272 VL2019205001339 Da
8 pages
Introduction To Biostatistics Syllabus
No ratings yet
Introduction To Biostatistics Syllabus
8 pages
Introductoin To Biostatistics (1st and 2nd Lec)
No ratings yet
Introductoin To Biostatistics (1st and 2nd Lec)
47 pages
Intuitive Biostatistics: A Nonmathematical Guide To Statistical Thinking 4th Edition Harvey Motulsky
No ratings yet
Intuitive Biostatistics: A Nonmathematical Guide To Statistical Thinking 4th Edition Harvey Motulsky
51 pages
Intuitive Biostatistics & Normality Test & Sample PDF
100% (11)
Intuitive Biostatistics & Normality Test & Sample PDF
605 pages
0214 Lecture Notes
No ratings yet
0214 Lecture Notes
316 pages
Biostatistics Manual
No ratings yet
Biostatistics Manual
95 pages
Biostatics c1-2
No ratings yet
Biostatics c1-2
81 pages
00seminarinstatistics Feb2022 Vgarga 221007125112 E1c74264
No ratings yet
00seminarinstatistics Feb2022 Vgarga 221007125112 E1c74264
68 pages
SBR Fall 2023 Syllabus Section 1
No ratings yet
SBR Fall 2023 Syllabus Section 1
9 pages
Statistics in Medicine Syllabus
No ratings yet
Statistics in Medicine Syllabus
2 pages
Module 1 - 2 - 3
No ratings yet
Module 1 - 2 - 3
45 pages
Lecture 1 NSU
No ratings yet
Lecture 1 NSU
68 pages
Fundamental Biostatistics Dillon Jones
No ratings yet
Fundamental Biostatistics Dillon Jones
68 pages
Biostatistics - Course Syllabus Spring 2022-2023
No ratings yet
Biostatistics - Course Syllabus Spring 2022-2023
8 pages
Intuitive Biostatistics: A Nonmathematical Guide To Statistical Thinking. ISBN 0190643560, 978-0190643560
100% (22)
Intuitive Biostatistics: A Nonmathematical Guide To Statistical Thinking. ISBN 0190643560, 978-0190643560
23 pages
Syllabus
No ratings yet
Syllabus
5 pages
Chapter 1 Introduction To Biostat
No ratings yet
Chapter 1 Introduction To Biostat
62 pages
Bma3116biostatistics 1
100% (1)
Bma3116biostatistics 1
76 pages
Intuitive Biostatistics A Nonmathematical Guide To Statistical Thinking, 4th Edition Complete Chapter Download
No ratings yet
Intuitive Biostatistics A Nonmathematical Guide To Statistical Thinking, 4th Edition Complete Chapter Download
14 pages
(Ebook PDF) Biological Science, Third Canadian Edition 3rd Edition Download
100% (1)
(Ebook PDF) Biological Science, Third Canadian Edition 3rd Edition Download
53 pages
Biostatistics Lecture Notes 1
No ratings yet
Biostatistics Lecture Notes 1
18 pages
Basi Concepts
No ratings yet
Basi Concepts
32 pages
Practice of Statistics in The Life Sciences Brigitte Baldi PDF Download
No ratings yet
Practice of Statistics in The Life Sciences Brigitte Baldi PDF Download
82 pages
HHSM ZG513 Course Handout
No ratings yet
HHSM ZG513 Course Handout
5 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
(Ebook) Statistics Explained: An Introductory Guide For Life Scientists by Steve McKillup ISBN 9781107005518, 1107005515 Available All Format
No ratings yet
(Ebook) Statistics Explained: An Introductory Guide For Life Scientists by Steve McKillup ISBN 9781107005518, 1107005515 Available All Format
97 pages
Bio-Statistics Lecture#01 Suit
No ratings yet
Bio-Statistics Lecture#01 Suit
30 pages
Lecture Notes For Degree Students Class - 1
No ratings yet
Lecture Notes For Degree Students Class - 1
9 pages
BIOSTAT501
No ratings yet
BIOSTAT501
12 pages
Be Careful When All Is Well
No ratings yet
Be Careful When All Is Well
2 pages
Intro to Pharmacology Course
No ratings yet
Intro to Pharmacology Course
10 pages
CubeRootFormulaMeaning, Table, Formulas, SolvedExamples 1710827181576
No ratings yet
CubeRootFormulaMeaning, Table, Formulas, SolvedExamples 1710827181576
7 pages
Full Text
No ratings yet
Full Text
53 pages
Geometric Series Formula
No ratings yet
Geometric Series Formula
8 pages
The Seven Habits of A Godly Life
100% (2)
The Seven Habits of A Godly Life
2 pages
ConfidenceIntervalFormulaMeaning, Calculation, SolvedExamples 1710827614746
No ratings yet
ConfidenceIntervalFormulaMeaning, Calculation, SolvedExamples 1710827614746
8 pages
C2CStegosaurusBlanketPattern-PixelCrochet 1709270724151
100% (2)
C2CStegosaurusBlanketPattern-PixelCrochet 1709270724151
10 pages
The Courage To Say No
No ratings yet
The Courage To Say No
2 pages
3.2: Truth Tables and Propositions Generated by A Set
No ratings yet
3.2: Truth Tables and Propositions Generated by A Set
3 pages
OvercomingHPAAxisSuppression (FormerlyKnownasAdrenalFatigue) BodyLogicMD 1699524238527
No ratings yet
OvercomingHPAAxisSuppression (FormerlyKnownasAdrenalFatigue) BodyLogicMD 1699524238527
11 pages
Collodion, U. S. P.: 1. Product Identification
No ratings yet
Collodion, U. S. P.: 1. Product Identification
8 pages
Sichuan BBQ Spice Mix Recipe
No ratings yet
Sichuan BBQ Spice Mix Recipe
2 pages
Facing A Satanic Attack
No ratings yet
Facing A Satanic Attack
2 pages
DefinitionandPreparationofSyrups, Elixirs, LinimentsandLotionsPharmaguideline 1708131932947
No ratings yet
DefinitionandPreparationofSyrups, Elixirs, LinimentsandLotionsPharmaguideline 1708131932947
3 pages
Yoked With Jesus
100% (1)
Yoked With Jesus
2 pages
C2CBrachiosaurusBlanketPattern-PixelCrochet 1709270790860
100% (1)
C2CBrachiosaurusBlanketPattern-PixelCrochet 1709270790860
10 pages
Aspen Blanket: Created by Cocoabe Designs
No ratings yet
Aspen Blanket: Created by Cocoabe Designs
3 pages
TheRecommendedExpectedWeightOfCatfish Fashion Nigeria - 1711426462532
No ratings yet
TheRecommendedExpectedWeightOfCatfish Fashion Nigeria - 1711426462532
2 pages
TangentFormulaTangentFunctions, Formulas, SolvedExamples 1710827247707
No ratings yet
TangentFormulaTangentFunctions, Formulas, SolvedExamples 1710827247707
7 pages
HLB Scale and Surfactant Properties
No ratings yet
HLB Scale and Surfactant Properties
2 pages
adsLawsofMatrixAlgebra 1708555811229
No ratings yet
adsLawsofMatrixAlgebra 1708555811229
2 pages
C2COceanWavesCrochetPattern-PixelCrochet 1709271121898
100% (3)
C2COceanWavesCrochetPattern-PixelCrochet 1709271121898
9 pages
DIYFringeStatementEarrings CrochetEarringsPattern PersiaLou - 1705154648880
No ratings yet
DIYFringeStatementEarrings CrochetEarringsPattern PersiaLou - 1705154648880
30 pages
313pm - 20.epra Journals 12122
No ratings yet
313pm - 20.epra Journals 12122
9 pages
5.3LawsofMatrixAlgebra-MathematicsLibreTexts 1708554879233
No ratings yet
5.3LawsofMatrixAlgebra-MathematicsLibreTexts 1708554879233
2 pages
EHR Perceptions
No ratings yet
EHR Perceptions
27 pages
ChordLengthFormulaMeaning, Properties, SolvedExamples 1710827687134
No ratings yet
ChordLengthFormulaMeaning, Properties, SolvedExamples 1710827687134
7 pages
Catfish Farming Guide for Nigerians
No ratings yet
Catfish Farming Guide for Nigerians
14 pages
Crochet Wallflower Pattern Guide
No ratings yet
Crochet Wallflower Pattern Guide
29 pages
Graphs and Situations Practice
No ratings yet
Graphs and Situations Practice
12 pages
Study Guide Test 1
No ratings yet
Study Guide Test 1
1 page
Decaying Winter
No ratings yet
Decaying Winter
9 pages
El3356 0010
No ratings yet
El3356 0010
3 pages
SAS B.Inggris
No ratings yet
SAS B.Inggris
4 pages
Control Systems Compensators Guide
No ratings yet
Control Systems Compensators Guide
2 pages
Methodological Rigour Within A Qualitative Framework
No ratings yet
Methodological Rigour Within A Qualitative Framework
9 pages
Overall Dimensions and Mounting: Solar Water Pump Controller Mu - G3 Solar Mu - G5 Solar Mu - G7.5 Solar Mu - G10 Solar
No ratings yet
Overall Dimensions and Mounting: Solar Water Pump Controller Mu - G3 Solar Mu - G5 Solar Mu - G7.5 Solar Mu - G10 Solar
2 pages
P Area
No ratings yet
P Area
4 pages
Understanding Elevated Copper Levels in Used Oil Samples
No ratings yet
Understanding Elevated Copper Levels in Used Oil Samples
3 pages
Adjusting Brightness and Contrast
No ratings yet
Adjusting Brightness and Contrast
5 pages
Math8 Q4 Week3 Hybrid Version1 1
No ratings yet
Math8 Q4 Week3 Hybrid Version1 1
13 pages
FEMSnap User Guide
No ratings yet
FEMSnap User Guide
10 pages
ASQ PracticeTest CSSGB v2021-06-02 by Lyanna 86q
100% (3)
ASQ PracticeTest CSSGB v2021-06-02 by Lyanna 86q
34 pages
Bridge
No ratings yet
Bridge
2 pages
Business Statistics With Solutions in R (Mustapha Abiodun Akinkunmi)
No ratings yet
Business Statistics With Solutions in R (Mustapha Abiodun Akinkunmi)
278 pages
Wiki Landau Lifitz
No ratings yet
Wiki Landau Lifitz
3 pages
Nasa
No ratings yet
Nasa
36 pages
Lionrock 45 kVA LRC45X-60Hz
No ratings yet
Lionrock 45 kVA LRC45X-60Hz
4 pages
Sol hw5 2 PDF
No ratings yet
Sol hw5 2 PDF
6 pages
Cads RC V8 4
0% (1)
Cads RC V8 4
202 pages
Tja 1040
No ratings yet
Tja 1040
22 pages
Operator Portal Tech Stack
No ratings yet
Operator Portal Tech Stack
2 pages
DSC-7 Introduction To Business Analytics Sol Chapter
No ratings yet
DSC-7 Introduction To Business Analytics Sol Chapter
101 pages
FP1 Chapter 1 PDF
No ratings yet
FP1 Chapter 1 PDF
31 pages
Process For Obtaining Honey Form Husk Coffe
No ratings yet
Process For Obtaining Honey Form Husk Coffe
10 pages
For K 0,1, ..... ..,9
No ratings yet
For K 0,1, ..... ..,9
2 pages
HTML Notes by Manthan
No ratings yet
HTML Notes by Manthan
9 pages
Projectiles: Mujungu Herbert
No ratings yet
Projectiles: Mujungu Herbert
22 pages