100% found this document useful (1 vote)

146 views8 pages

Probable Error Mean

This document summarizes William Sealy Gosset's seminal 1908 paper "The Probable Error of a Mean", published under the pseudonym "Student". [1] Gosset developed new statistical methods for analyzing small sample sizes while working as a brewer. He derived the t distribution to estimate sampling error when sample variances are used to approximate population variances. [2] Gosset validated his theoretical work through early use of simulation. His work paved the way for modern statistical inference and small-sample research.

Uploaded by

Raviteja Kvns

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

146 views8 pages

Probable Error Mean

Uploaded by

Raviteja Kvns

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Probable Error of a Mean, The ("Student") [Biometrika, 6, 1-25]

Initially appreciated by only a handful of brewers and statisticians, “The Probable Error

of a Mean” is now, 100 years later, universally acclaimed as a classic by statisticians and

behavioral scientists alike. Written by William Sealy Gosset under the pseudonym

“Student”, its publication paved the way for the statistical era that continues today, one

focused on how best to draw inferences about large populations from small samples of

data.

Gosset and “Student”

Schooled in mathematics and chemistry, Gosset was hired by Arthur Guinness, Son, &

Co., Ltd. to apply recent innovations in the field of statistics to the business of brewing

beer. As a brewer, Gosset analyzed how agricultural and brewing parameters (e.g., the

type of barley used) affected crop yields and, in his words, the “behavior of beer”.

Because of the cost and time associated with growing crops and brewing beer, Gosset and

his fellow “experimental” brewers could not afford to gather the large amounts of data

typically gathered by statisticians of their era. Statisticians, however, had not yet

developed accurate inferential methods for working with small samples of data, requiring

Gosset to develop methods of his own. With the approval of his employer, Gosset spent a

year (1906-1907) in Karl Pearson’s biometric laboratory, developing “The Probable Error

of a Mean” as well as “Probable Error of a Correlation Coefficient”.

The most immediately striking aspect of “The Probable Error of a Mean” is its
pseudonymous author: “Student”. Why would a statistician require anonymity? The

answer to this question came publicly in 1930, when fellow statistician Harold Hotelling

revealed that “Student” was Gosset, and that his anonymity came at the request of his

employer, a “large dublin Brewery”. At the time, Guinness considered its use of statistics

a trade secret and forbade its employees from publishing their work. Only after

negotiations with his supervisors was Gosset able to publish his work, agreeing to neither

use his real name nor publish proprietary data.

The Problem: Estimating Sampling Error

As its title implies, “The Probable Error of a Mean” focuses primarily on determining the

likelihood that a sample mean approximates the mean of the population from which it

was drawn. The “probable error” of a mean, like its standard error, is a specific estimate

of the dispersion of its sampling distribution, and was used commonly at the start of the

20th century. Estimating this dispersion was then and remains today a foundational step

of statistical inference: To draw inference about a population parameter from a sampled

mean (or, in the case of null hypothesis significance testing, infer the probability that a

that a certain population would yield a sampled mean as extreme as the obtained value),

one must first specify the sampling distribution of the mean. The Central Limit Theorem

provides the basis for parametrically specifying this sampling distribution, but does so in

terms of population variance. In nearly all research, however, both population mean and

variance are unknown. To specify the sampling distribution of the mean, therefore,

researchers must use the sample variance.

Gosset confronted this problem with using sample variance to estimate the

sampling distribution of the mean, namely that there is error associated with sample

variance. And because the sampling distribution of the variance is positively skewed, this

error is more likely to result in the underestimation than the overestimation of population

variance (even when using an unbiased estimator of population variance). Furthermore,

this error, like the error associated with sampled means, increases as sample size

decreases, presenting a particular (and arguably exclusive) problem for small sample

researchers such as Gosset. To draw inferences about population means from sampled

data, Gosset could not – as large sample researchers did – simply calculate a standard z

statistic and rely on a unit normal table to find the corresponding p values. The unit

normal table does not account for either the estimation of population variance or the fact

that the error in this estimate depends on sample size. This limitation inspired Gosset to

write “The Probable Error of a Mean” in a self-described effort to 1) determine at what

point sample sizes become so small that the above method of normal approximation

becomes invalid, and 2) develop a set of valid probability tables for small samples sizes.

The Solution: “z”

To accomplish these twin goals, Gosset derived the sampling distribution of a new

statistic he called “z”. He defined z as the deviation of the mean of a sample ( X ) from

the mean of a population (u) divided by the standard deviation of the sample (s),
!
or (X " u) / s . In his original paper, Gosset calculated s with the denominator n (leading to

a biased estimate of population variance, s 2 ) rather than the unbiased n "1, likely in
!

! !
response to Karl Pearson’s famous attitude that “only naughty brewers take n so small

that the difference is not of the order of the probable error!’’ To determine the sampling

distribution of z, Gosset first needed to determine the sampling distribution of s. To do so,

he derived the first four moments of s 2 , which allowed him to make an informed guess

concerning its distribution (and the distribution of s). Next, he demonstrated that X and s
!
were uncorrelated, presumably in an effort to show their independence. This
!
independence – in conjunction with equations to describe the distribution of s – allowed

Gosset to derive the distribution of z.

This first portion of “The Probable Error of a Mean” is noteworthy for its

speculative, incomplete, and yet ultimately correct conclusions. Gosset failed to offer a

formal mathematical derivation for the sampling distribution of s, despite the fact that,

unbeknownst to him, such a proof had been published 30 years earlier by the German

statistician Friedrich Robert Helmert. Nor was Gosset able to prove that the sampling

distributions of s 2 and X were completely independent of each other. Nevertheless,

Gosset was correct on both counts, as well as his ensuing derivation of the sampling
! ! z, leading many to note that his statistical intuition more than compensated
distribution of

for his admitted mathematical shortcomings.

Pioneering Use of Simulation

“The Probable Error of a Mean” documents more than Gosset’s informed speculation,

however: it presents one of the first examples of simulation in the field of statistics.

Gosset used simulation to estimate the sampling distribution of z non-parametrically, and

then compared this result to his parametrically-derived distribution. Concordance

between the two sampling distributions, he argued, would confirm the validity of his

parametric equations.

To conduct his simulation, he relied on a biometric database of height and finger

measurements collected by British police from 3000 incarcerated criminals; this database

served as his statistical population. Gosset randomly ordered the data – written

individually on pieces of cardboard – then segregated them into 750 samples of 4

measurements each (i.e., n = 4). For every sample, he calculated z for height and finger

length, then compared these two z distributions with the curves he expected from his

parametric equations. In both cases, the empirical and theoretical distributions did not

differ significantly, thus offering evidence that Gosset’s preceding equations were

correct.

Tables and Examples

Gosset dedicated the final portion of “The Probable Error of a Mean” to tabled

probability values for z and illustrative examples of their implementation. To construct

the tables, he integrated over the z distributions (for sample sizes of 4-10) to calculate the

probability of obtaining certain z values or smaller. For purposes of comparison, he also

provided the p-values obtained via the normal approximation to reveal the degree of error

in such approximation. The cumbersome nature of these calculations deterred Gosset

from providing a more extensive table.

In further testament to his applied perspective, Gosset concluded the main text of
“The Probable Error of a Mean” by applying his statistical innovation to four sets of

actual experimental data. In the first and most famous example, Gosset analyzed data

from a 1904 experiment that examined the soporific effects of two different drugs. In this

experiment, researchers had measured how long patients (n = 15) slept after treatment

with each of two drugs and a drug-free baseline. To determine whether the drugs helped

patients sleep, Gosset tested the mean change in sleep for each of the drug conditions

(compared to the baseline) against a null (i.e., zero) population mean. To test whether one

drug was more effect than the other drug, he tested the mean difference in their change

values against a null population mean. All three of these tests – as well as the tests used

in the three subsequent examples – correspond to modern-day one-sample t tests (or

equivalent paired t tests).

Postscript: From z to t

With few exceptions over nearly twenty years following its publication, “The Probable

Error of a Mean” was neither celebrated nor appreciated. In fact, when providing a

expanded copy of the “Student” z tables to the then little known statistician Ronald Fisher

in 1922, Gosset remarked that Fisher was “the only man that’s ever likely to use them!”

Fisher ultimately disproved this gloomy prediction by championing Gosset’s work and

literally transforming it into a foundation of modern statistical practice.

Fisher’s contribution to Gosset’s statistics was threefold. First, in 1912 and at the

young age of 22, he used complex n-dimensional geometry (that neither Gosset nor

Pearson could understand) to prove Gosset’s equations for the z distribution. Second, he
extended and embedded Gosset’s work into a unified framework for testing the

significance of means, mean differences, correlation coefficients, and regression

coefficients. In the process of achieving this unified framework (based centrally on the

concept of degrees of freedom), Fisher made his third contribution to Gosset’s work: he

multiplied z by n "1 , transforming it into the famous t statistic that now inhabits every

introductory statistics textbook.

!During Fisher’s popularization, revision, and extension of the work featured in

“The Probable Error of a Mean”, he corresponded closely with Gosset. In fact, Gosset is

responsible for naming the t statistic, as well as calculating a set of probability tables for

the new t distributions. Despite Gosset’s view of himself as a humble brewer, Fisher

considered him a statistical pioneer whose work had not yet received the recognition it

deserved.

Historical Impact

The world of research has changed greatly in a century, from a time when only “naughty

brewers” gathered data from samples sizes not measures in hundreds, to an era

characterized from small sample research. “The Probable Error of a Mean” marked the

beginning of serious statistical inquiry into small sample inference, and its contents today

underlie behavioral science’s most frequently used statistical tests. Gosset’s efforts to

derive an exact test of statistical significance for such samples (as opposed to one based

on a normal approximation) may have lacked in mathematical completeness, but their

relevance, correctness and timeliness shaped scientific history.

Samuel T. Moulton

(Independent Samples), t Test (One Sample), t Test (Paired Samples)

Fisher, R. A. (1925). Applications of ‘Student’s’ distribution. Metron, 5, 90-104.

Pearson, E. S. (1939). ‘Student’ as a statistician. Biometrika, 30, 210-250.

Boland, P. J. (1984). A biographical glimpse of William Sealy Gosset. American

Statistician, 38, 179-183.

Box, J. F. (1981). Gosset, Fisher, and the t-distribution. American Statistician, 35, 61-66.

Eisenhart, C. (1979). Transition from Student’s z to Student’s t. American Statistician,

33, 6-10.

Hanley, J. A., Julien, M., & Moodie, E. E. M. (2008). Student's z, t, and s: What if Gosset

had R? American Statistician, 62, 64-69.

Lehmann, E. L. (1999). "Student" and small-sample theory. Statistical Science, 14, 418-

426.

Pearson, E. S. (1968). Studies in history of probability and statistics XX: Some early

correspondence between W. S. Gosset, R. A. Fisher, and K. Pearson with note and

comments. Biometrika, 55, 445-457.

Zabell, S. L. (2008). On Student's 1908 article - "The Probable Error of a Mean". Journal

of the American Statistical Association, 103, 1-7.

HanleyJulienMoodie Student TAS2008
No ratings yet
HanleyJulienMoodie Student TAS2008
6 pages
Gosset Fisher and T Distribution
No ratings yet
Gosset Fisher and T Distribution
6 pages
History and Origin of Statistics
67% (3)
History and Origin of Statistics
11 pages
Variances Normal Distributions Degrees of Freedom Analysis of Variance (ANOVA)
No ratings yet
Variances Normal Distributions Degrees of Freedom Analysis of Variance (ANOVA)
2 pages
William S Gosset (1876-1937) : The Application of The Law of Error To The Work of The Brewery (1904
No ratings yet
William S Gosset (1876-1937) : The Application of The Law of Error To The Work of The Brewery (1904
12 pages
Sawilowsky 2002
No ratings yet
Sawilowsky 2002
13 pages
5.module 5
No ratings yet
5.module 5
9 pages
Degreesoffreedom 170927142450
No ratings yet
Degreesoffreedom 170927142450
9 pages
Add Math Folio
0% (1)
Add Math Folio
30 pages
Statistics Homework Analysis
No ratings yet
Statistics Homework Analysis
6 pages
SM-2 Basic Statistics
No ratings yet
SM-2 Basic Statistics
35 pages
12.0 PP 293 294 Conclusion
No ratings yet
12.0 PP 293 294 Conclusion
2 pages
Dear MR Gosset
No ratings yet
Dear MR Gosset
8 pages
Molecular Genetics Exam
No ratings yet
Molecular Genetics Exam
3 pages
Ronald Aylmer Fisher
100% (1)
Ronald Aylmer Fisher
3 pages
Deciphering Cryptographic Messages, Containing Detailed Discussions On Statistics. (1) It
No ratings yet
Deciphering Cryptographic Messages, Containing Detailed Discussions On Statistics. (1) It
4 pages
Testing of Hypothesis
100% (1)
Testing of Hypothesis
20 pages
Statistical Hypothesis Testing Guide
0% (1)
Statistical Hypothesis Testing Guide
19 pages
Lecture Notes 17 Calculus in The Eighteenth Century
No ratings yet
Lecture Notes 17 Calculus in The Eighteenth Century
25 pages
Statisticians
No ratings yet
Statisticians
4 pages
C.2.4 Q3 Probability Theory
No ratings yet
C.2.4 Q3 Probability Theory
3 pages
Assignment 4 Area Under The Stanard Normal Curve
No ratings yet
Assignment 4 Area Under The Stanard Normal Curve
7 pages
Stats P1 (End of Feb)
No ratings yet
Stats P1 (End of Feb)
5 pages
Statistics for Computer Science
No ratings yet
Statistics for Computer Science
4 pages
Probability & Statistics Concepts
No ratings yet
Probability & Statistics Concepts
19 pages
Single Sample Hypothesis Testing
No ratings yet
Single Sample Hypothesis Testing
6 pages
History of Statistics for Scholars
No ratings yet
History of Statistics for Scholars
4 pages
Evolution of Statistics Through Time
0% (1)
Evolution of Statistics Through Time
4 pages
01 RSD Exercises
No ratings yet
01 RSD Exercises
5 pages
Biografia de William Sealy Gosset
No ratings yet
Biografia de William Sealy Gosset
2 pages
P&S U V Practice
No ratings yet
P&S U V Practice
4 pages
Lecture 4 Probability Distributions
No ratings yet
Lecture 4 Probability Distributions
4 pages
History and Nature of Statistics
No ratings yet
History and Nature of Statistics
8 pages
Wsheet 6
No ratings yet
Wsheet 6
7 pages
Nov 2024 p1 (1 of 3) Stats (Last Supper)
No ratings yet
Nov 2024 p1 (1 of 3) Stats (Last Supper)
6 pages
Hypothesis Testing (Discrete & Continuous Variables)
No ratings yet
Hypothesis Testing (Discrete & Continuous Variables)
5 pages
Deber 1 en Ingles
No ratings yet
Deber 1 en Ingles
16 pages
Hypothesis Testing II
No ratings yet
Hypothesis Testing II
98 pages
Statistics Theory (Soyaib)
No ratings yet
Statistics Theory (Soyaib)
13 pages
Hypothesis Testing Guide
No ratings yet
Hypothesis Testing Guide
6 pages
Econometrics
No ratings yet
Econometrics
20 pages
Uncertainties in Measurement
No ratings yet
Uncertainties in Measurement
7 pages
Statistics - Important Notes
No ratings yet
Statistics - Important Notes
2 pages
Nov 2024 p2 (1 of 3) Stats (Last Supper)
No ratings yet
Nov 2024 p2 (1 of 3) Stats (Last Supper)
7 pages
Sampling Distributions Problems
No ratings yet
Sampling Distributions Problems
4 pages
Doc-20250914-Wa0004 250914 212506
No ratings yet
Doc-20250914-Wa0004 250914 212506
12 pages
Z-Test PP
No ratings yet
Z-Test PP
9 pages
Statistics (Spot) (Volume 2)
No ratings yet
Statistics (Spot) (Volume 2)
106 pages
ss1 Probability
No ratings yet
ss1 Probability
37 pages
Assignment Stats 2
No ratings yet
Assignment Stats 2
7 pages
Sampling Distribution of Mean
No ratings yet
Sampling Distribution of Mean
6 pages
07 - Chapter 1
No ratings yet
07 - Chapter 1
8 pages
Exercise 2 Exam1practice Sa
No ratings yet
Exercise 2 Exam1practice Sa
11 pages
Extra Math QN
No ratings yet
Extra Math QN
11 pages
William Sealy Gosset
No ratings yet
William Sealy Gosset
4 pages
Google
No ratings yet
Google
170 pages
Central Limit Theorem Explained
No ratings yet
Central Limit Theorem Explained
4 pages
s3 Hypothesis-2018
No ratings yet
s3 Hypothesis-2018
16 pages
Skewness: Paul Von Hippel, Ohio State University, USA
No ratings yet
Skewness: Paul Von Hippel, Ohio State University, USA
4 pages
3.2 Power Point 2
No ratings yet
3.2 Power Point 2
35 pages
Learning Guide Week4 EDF 202 Statistics in Education
No ratings yet
Learning Guide Week4 EDF 202 Statistics in Education
8 pages
Sma 2103 Probability and Statistics-Print Ready
No ratings yet
Sma 2103 Probability and Statistics-Print Ready
4 pages
Rethinking The Effects of Performance Expectancy and Effort Expectancy On New Technology Adoption - Evidence From Moroccan Nursing Students
No ratings yet
Rethinking The Effects of Performance Expectancy and Effort Expectancy On New Technology Adoption - Evidence From Moroccan Nursing Students
9 pages
Laboratory Report - Experiment No. 4
No ratings yet
Laboratory Report - Experiment No. 4
24 pages
LGT Stochastic Mapping
No ratings yet
LGT Stochastic Mapping
11 pages
Aldo Febriananto Kurniawan 22010112140205 Lap - Kti Bab Vii
No ratings yet
Aldo Febriananto Kurniawan 22010112140205 Lap - Kti Bab Vii
19 pages
Monte Carlo Simulation for CER Model
No ratings yet
Monte Carlo Simulation for CER Model
52 pages
Probability and Statistics MATH-361 (3-0) : Instructor: Sophia Siddique
No ratings yet
Probability and Statistics MATH-361 (3-0) : Instructor: Sophia Siddique
22 pages
Certification Guide On SAS Big Data Preparation, Statistics, and Visual Exploration (A00-220) Professional Exam
100% (1)
Certification Guide On SAS Big Data Preparation, Statistics, and Visual Exploration (A00-220) Professional Exam
15 pages
Ator (1999) Statistical Inference in Behavior Analysis - Enviromental Determinants
No ratings yet
Ator (1999) Statistical Inference in Behavior Analysis - Enviromental Determinants
5 pages
AGE 301 - NOTE - A-1
No ratings yet
AGE 301 - NOTE - A-1
8 pages
Assignment2 MGSC5125 Fall24
No ratings yet
Assignment2 MGSC5125 Fall24
3 pages
Noack Online Material - AnswersFullVersion
No ratings yet
Noack Online Material - AnswersFullVersion
51 pages
Unit 4 - Final
No ratings yet
Unit 4 - Final
28 pages
BSAMT
No ratings yet
BSAMT
19 pages
Course Paper On Regression Analysis of Gold Prices
33% (3)
Course Paper On Regression Analysis of Gold Prices
16 pages
Regression Examples
No ratings yet
Regression Examples
26 pages
Session 3
No ratings yet
Session 3
92 pages
Statistics and Probability Formulas
No ratings yet
Statistics and Probability Formulas
10 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Instructional Material Sample ES209
No ratings yet
Instructional Material Sample ES209
17 pages
Data Analysis Course: Time Series Analysis & Forecasting (Version-1)
No ratings yet
Data Analysis Course: Time Series Analysis & Forecasting (Version-1)
43 pages
Multiple Regression Analysis Guide
No ratings yet
Multiple Regression Analysis Guide
11 pages
Dummy Variable Models Guide
100% (1)
Dummy Variable Models Guide
16 pages
Economic Rate of Return Using Multiple Regression
No ratings yet
Economic Rate of Return Using Multiple Regression
12 pages
ES N 3.3.5 Student Activity, Statistics, Inferential Marbles PISD2425
No ratings yet
ES N 3.3.5 Student Activity, Statistics, Inferential Marbles PISD2425
6 pages
ML Unit 3
No ratings yet
ML Unit 3
17 pages
Demand Forecasting - Exercises - Solution
100% (1)
Demand Forecasting - Exercises - Solution
29 pages
Statistics I: Parameter Estimation, Part II
No ratings yet
Statistics I: Parameter Estimation, Part II
22 pages

Probable Error Mean

Uploaded by

Probable Error Mean

Uploaded by

Probable Error of a Mean, The ("Student") [Biometrika, 6, 1-25]

Gosset and “Student”

of a Mean” as well as “Probable Error of a Correlation Coefficient”.

use his real name nor publish proprietary data.

The Problem: Estimating Sampling Error

of statistical inference: To draw inference about a population parameter from a sampled

researchers must use the sample variance.

variance (even when using an unbiased estimator of population variance). Furthermore,

write “The Probable Error of a Mean” in a self-described effort to 1) determine at what

The Solution: “z”

distribution of z, Gosset first needed to determine the sampling distribution of s. To do so,

Gosset to derive the distribution of z.

distributions of s 2 and X were completely independent of each other. Nevertheless,

for his admitted mathematical shortcomings.

Pioneering Use of Simulation

Gosset used simulation to estimate the sampling distribution of z non-parametrically, and

To conduct his simulation, he relied on a biometric database of height and finger

individually on pieces of cardboard – then segregated them into 750 samples of 4

Tables and Examples

probability values for z and illustrative examples of their implementation. To construct

probability of obtaining certain z values or smaller. For purposes of comparison, he also

in such approximation. The cumbersome nature of these calculations deterred Gosset

from providing a more extensive table.

in the three subsequent examples – correspond to modern-day one-sample t tests (or

equivalent paired t tests).

literally transforming it into a foundation of modern statistical practice.

significance of means, mean differences, correlation coefficients, and regression

introductory statistics textbook.

on a normal approximation) may have lacked in mathematical completeness, but their

relevance, correctness and timeliness shaped scientific history.

See also Central Limit Theorem, Distribution, Nonparametric Statistics, Sampling

(Independent Samples), t Test (One Sample), t Test (Paired Samples)

Fisher, R. A. (1925). Applications of ‘Student’s’ distribution. Metron, 5, 90-104.

Pearson, E. S. (1939). ‘Student’ as a statistician. Biometrika, 30, 210-250.

Boland, P. J. (1984). A biographical glimpse of William Sealy Gosset. American

Statistician, 38, 179-183.

Eisenhart, C. (1979). Transition from Student’s z to Student’s t. American Statistician,

had R? American Statistician, 62, 64-69.

correspondence between W. S. Gosset, R. A. Fisher, and K. Pearson with note and

comments. Biometrika, 55, 445-457.

of the American Statistical Association, 103, 1-7.

You might also like