0% found this document useful (0 votes)

47 views9 pages

Random Sampling, Statistics, and Estimators

The document discusses random sampling and statistics. It defines a random sample as one where each observation is independently drawn from the population distribution. The goal of statistics is to estimate unknown population parameters based on random samples. Common parameters that are estimated include the population mean and variance. Unbiased estimators are desirable as their expected value equals the true population parameter. The sample mean is an unbiased estimator of the population mean, while the sample variance with n-1 in the denominator is an unbiased estimator of the population variance.

Uploaded by

architbumb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views9 pages

Random Sampling, Statistics, and Estimators

Uploaded by

architbumb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Random Sampling, Statistics, and Estimators

statistics.tex Feb. 11, 2003 In general, we assume the rv of interest, x, has some known distribution, f X x in the population of interest, but we do not known the values of the parameters in that distribution. That is, we assume that in the population of interest, X has some known distribution This is a very strong assumption. Given that we know, by assumption, the distribution, the problem is to estimate the values of the parameters. We do this by taking a sample from the population of interest,and use information in the sample to estimate the values of the population parameters. Hereafter, I will use the term population to refer to the population of interest. Estimation always starts by defining the population of interest. If one wants to emphasis the parameters in f X x, one might write it f X x; , where is the vector of parameters.

Samples and random samples

We would like our sample to be a random sample from f X x. Definition: Define a sample of size n as x 1 , x 2 , . . . , x n where x j is the jth observation in the sample (MGB 223) Note that a sample is a vector of random variables with some joint distribution. Definition: The sample x 1 , x 2 , . . . , x n is a random sample from f X x if f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n f X x 1 f X x 2 . . . f X x 2 where f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n is the joint distribution of the sample (MGB 223 and G74).

In explanation, each variable in f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n is a random variable; that is, observation j can take different values, so observation j is a rv. Denote this random variable X j , and the specific value it takes x j . f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n is therefore a joint density function for the n random variables in the sample. Said in words, f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n is a random sample from f X x if each observation is an independent draw from f X x. Just to be clear, let me write out the above in a little more detail. The sample x 1 , x 2 , . . . , x n is a random sample from f X x if f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n f X 1 x 1 f X 2 x 2 . . . f Xn x 2 f X x 1 f X x 2 . . . f X x 2 fx 1 fx 2 . . . fx 2 because f X i x i f X x i that is, each observation in the sample has the same distribution. Often we say (G74) a sample is random if the observations in it are independent, identically distribution - I.I.A. That is, a sample is random if each observation in the sample is independently drawn from the same (identical) distribution. Said loosely, the sample is random if for each observation, each value of the rv in the population has an equal chance of appearing as the jth observation, and this is true for all j. Give me an example of a nonrandom sample. Start by identifying the population you are sampling from. Can one tell, by observation, whether a sample is a random sample?

Note that random does not mean representative.

However as n increases, the sample will likely become more representative of the population Because of sampling variation, samples differ. That is, any two random samples of size n from the same population are likely to not exhibit the same values of the n random variables X1, X2, . . . , Xn. Remember that the n random variables X 1 , X 2 , . . . , X n have some joint density function f X 1 ,X 2 ,...,X n x 1 , x 2 , . . . , x n We call this the distribution of the samples. Each sample is a draw from this distribution. Picture a sample with two observations; that is n 2.

The central problem in statistics

1. We desire to study a population which has density f X x; where the form of f X x; is known but is unknown (MGB 226). This statement describes most of the econometrics you will ever do. 2. We take a random sample from f X x; of size n, x 1 , x 2 , . . . , x n 3. We then assume some function t tX 1 , X 2 , . . . , X n is an estimate of some element of , call that element k The issue is whether tX 1 , X 2 , . . . , X n is a good estimator for k definition: A function of X 1 , X 2 , . . . , X n is called a statistic That is, a statistic is just a function of the observed data (a function of the observed values of the n random variables in the sample. tX 1 , X 2 , . . . , X n is what we mean by a statistic Note that statistics is just the plural of statistic, so statistics is the study of functions of observed values of random variables, or, said another way, statistics is the study of functions of the data For example, if one takes a random sample of size n from f X x; , the following are all statistics: X 1 (the first X drawn

the smallest (or largest) X drawn e 3X 1 e 6X 2 e 1X 4 e 3X 17 n 1 i1 X i n Each of these statistics might or might not be a good estimator of some element of Consider some estimator t tX 1 , X 2 , . . . , X n If we use tx 1 , x 2 , . . . , x n as an estimate of , we say that tX 1 , X 2 , . . . , X n is an estimator of and tx 1 , x 2 , . . . , x n is an estimate of .

Estimating the population mean

One population that we often want to estimate is the population mean. Let x represent the population mean such that f X x; x , 2 x We want an estimator for x . The sample mean, from a random sample drawn from f X x; x , 2 is an estimate of x . x That is, 1 n is an estimator of x and 1 n

X i tX 1 , X 2 , . . . , X n
i1

xi
i1

is an estimate of x from the specific sample x 1 , x 2 , . . . , x n . Let X 1 n

Xi
i1

As an alternative estimators of x consider minX 1 , X 2 , . . . , X n and X 3 . These are also both statistics and estimators for x .

In a general sense, every statistic from a sample is an estimator for each of the population parameters, maybe a bad estimator, but an estimator never the less. I have proposed three candidates for estimators for x X 1 n

Xi
i1

minX 1 , X 2 , . . . , X n and X3 Do they have any desirable properties? EX E 1 n

Xi
i1 n i1

1 E X i n 1 EX 1 EX 2 . . . EX n n 1 x x . . . x n 1n nx x That is, EX x , which seems like a nice property for X to have? Does X 3 have this property? Yes EX 3 x How about minX 1 , X 2 , . . . , X n ? No. Can you prove it? Consider another estimator for x sX 1 , X 2 . 5X 1 . 25X 2

EsX 1 , X 2 E. 5X 1 . 25X 2 . 5EX 1 . 25EX 2 . 5 x . 25 x . 75 x x sX 1 , X 2 systematically underestimated x . definition: tX 1 , X 2 , . . . , X n is an unbiased estimator of EtX 1 , X 2 , . . . , X n

Estimating 2 , the population variance x

Consider the following two statistics as estimators for 2 : x x s2 where X
1 n n i1 X i and

1 n

X i X 2
i1

s2 x

1 n 1

X i X 2
i1

If one had to choose between these two estimators, at first blink, I might go for the first one because it is the average of the squared deviations. Remember that , 2 is the expectation of x the squared deviations in the population, EX EX 2 . s 2 is called the method of moment x estimator of s 2 . x Note that x x lim s 2 lim s 2
n n

Consider the expectation of each. However, before we do this, consider the following algebra, which will turn out to be useful

x i x
i1

x i x x x 2 x i x x x 2 x i x 2 x x 2 2x i xx x x i x
i1 i1 n 2 i1 n i1 n

nx x 2x x x i x
2 i1

but since x i x x i nx nx nx 0
i1 i1 n n n n

x i x 2
i1

x i x 2 nx x 2
i1

Solve this for i1 x i x to obtain x i x

i1 n 2

x i x 2 nx x 2
i1

This will prove useful. We will use in our derivation of Es 2 x Es 2 x E 1 n 1 X i X 2

i1 n n

1 EX i X 2 n 1 i1

Substituting, the algebraic relationship we just derived

Es 2 x

1 EX i x 2 nX x 2 n 1 i1 1 n 1 1 n 1 EX i x 2 nEX x 2
i1 n

2 nvarX x
i1

1 n 2 nvarX x n 1 2 1 n 2 n nx x n 1 1 n 2 2 x x n 1 n 1 2 x 2 x n 1 What did we just show? Es 2 2 x x That is, s 2 is an unbiased estimate of 2 . x x x Therefore s 2 is a biased estimate of 2 . Note that the degree of bias in 2 decreases as n x x increases. x That is why we prefer s 2 , over s 2 , as an estimator for 2 x x What is the intuition?If one has a sample of n observation, once X is determined there are 2 only n 1 independent X i X . That is, if one knows X and X 1 , X 2 , . . . , X n1 , X n is completely determined.

Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
No ratings yet
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
12 pages
Transition To MATH503
No ratings yet
Transition To MATH503
12 pages
STA 303 Lec 1
No ratings yet
STA 303 Lec 1
5 pages
Lecture 9
No ratings yet
Lecture 9
39 pages
Sta 341 Class Notes Final
No ratings yet
Sta 341 Class Notes Final
120 pages
Basic Statistic
No ratings yet
Basic Statistic
20 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Lecture Slides 10 UN1201
No ratings yet
Lecture Slides 10 UN1201
35 pages
Basic Univariate Statistics For Engineers 2019
No ratings yet
Basic Univariate Statistics For Engineers 2019
32 pages
Estimation Bertinoro09 Cristiano Porciani 1
No ratings yet
Estimation Bertinoro09 Cristiano Porciani 1
42 pages
Stat 121 Chapter 6
No ratings yet
Stat 121 Chapter 6
41 pages
Untitled 3
No ratings yet
Untitled 3
32 pages
Bias Issues
No ratings yet
Bias Issues
16 pages
R Chapter 4 (Lecture 1)
No ratings yet
R Chapter 4 (Lecture 1)
22 pages
Chap 1 Sampling Distributions
No ratings yet
Chap 1 Sampling Distributions
14 pages
Unit - III (P&S Notes)
No ratings yet
Unit - III (P&S Notes)
39 pages
Topic 10 Point Estmation of Parameters
No ratings yet
Topic 10 Point Estmation of Parameters
36 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
MATH 403 Engineering Data Analysis 95 132
No ratings yet
MATH 403 Engineering Data Analysis 95 132
38 pages
Note 06 - Concept of Statistical Inference
No ratings yet
Note 06 - Concept of Statistical Inference
30 pages
Lecture No. Probability & Statistics
No ratings yet
Lecture No. Probability & Statistics
60 pages
FIN 640 - Lecture Notes 4 - Sampling and Estimation
100% (1)
FIN 640 - Lecture Notes 4 - Sampling and Estimation
40 pages
Ch1 Prob II NAU Spring23
No ratings yet
Ch1 Prob II NAU Spring23
17 pages
Probability and Statistics ch7
No ratings yet
Probability and Statistics ch7
19 pages
Chapter 2
No ratings yet
Chapter 2
37 pages
Statistics PDF
No ratings yet
Statistics PDF
17 pages
Bizstat ssn2
No ratings yet
Bizstat ssn2
55 pages
Introduction to Statistical Inference
No ratings yet
Introduction to Statistical Inference
16 pages
Prelims Stats
No ratings yet
Prelims Stats
39 pages
X, ..., X, X, ..., X X, X, ..., X: Basic Statistics
No ratings yet
X, ..., X, X, ..., X X, X, ..., X: Basic Statistics
29 pages
Review of Probability and Statistics
No ratings yet
Review of Probability and Statistics
34 pages
Part 2-1 Random Samples Sampling Distributions - Notes
No ratings yet
Part 2-1 Random Samples Sampling Distributions - Notes
8 pages
Notes On Sampling and Hypothesis Testing
No ratings yet
Notes On Sampling and Hypothesis Testing
10 pages
Chap5 Statistical Inference
No ratings yet
Chap5 Statistical Inference
38 pages
Statistical Inferences: Dr. Olivia Carrillo Gamboa
No ratings yet
Statistical Inferences: Dr. Olivia Carrillo Gamboa
16 pages
Lecture 11
100% (1)
Lecture 11
33 pages
Formula List Statistics 2
No ratings yet
Formula List Statistics 2
4 pages
Unit 7
No ratings yet
Unit 7
35 pages
Sampling Distributions & Estimators
No ratings yet
Sampling Distributions & Estimators
32 pages
Reading-Point Estimates of Population Mean
No ratings yet
Reading-Point Estimates of Population Mean
5 pages
Statistics Lecture 3 Summary
No ratings yet
Statistics Lecture 3 Summary
5 pages
P Sample
No ratings yet
P Sample
3 pages
Statistics
No ratings yet
Statistics
53 pages
Point Estimatiors
No ratings yet
Point Estimatiors
52 pages
Lectura 2 Point Estimator Basics
No ratings yet
Lectura 2 Point Estimator Basics
11 pages
Statistical Inference Frequentist
No ratings yet
Statistical Inference Frequentist
25 pages
Module02 Slides Print 1
No ratings yet
Module02 Slides Print 1
65 pages
Stimation: Statistic
No ratings yet
Stimation: Statistic
46 pages
DPBS 1203 Business and Economic Statistics
No ratings yet
DPBS 1203 Business and Economic Statistics
21 pages
Convergence of Random Variables
No ratings yet
Convergence of Random Variables
7 pages
Engineering Probability & Statistics
No ratings yet
Engineering Probability & Statistics
30 pages
Econ-2042 - Unit 5-HO
No ratings yet
Econ-2042 - Unit 5-HO
22 pages
Notes For Lectures 1 To 10 - 2024
No ratings yet
Notes For Lectures 1 To 10 - 2024
39 pages
Chapter 2 - Estimation PDF
No ratings yet
Chapter 2 - Estimation PDF
25 pages
Sampling Distribution & Estimator Properties
No ratings yet
Sampling Distribution & Estimator Properties
32 pages
Week 7 and 8 31 Aug To 18 Sept Sampling Distributions
No ratings yet
Week 7 and 8 31 Aug To 18 Sept Sampling Distributions
6 pages
Tom Engellenner - IP Dispute Cost Comparison
No ratings yet
Tom Engellenner - IP Dispute Cost Comparison
32 pages
En 19
No ratings yet
En 19
93 pages
The Motivational
No ratings yet
The Motivational
6 pages
En 19
No ratings yet
En 19
93 pages
Professional Responsibility: Summary of Different Senses of Responsibility
No ratings yet
Professional Responsibility: Summary of Different Senses of Responsibility
44 pages
Bonus Depreciation Kuhlemeyer Wachowicz Article
No ratings yet
Bonus Depreciation Kuhlemeyer Wachowicz Article
7 pages
Blaauw Master Thesis
No ratings yet
Blaauw Master Thesis
43 pages
Hint Sampa Fall 2011
No ratings yet
Hint Sampa Fall 2011
3 pages
Freemium Models for Media Markets
No ratings yet
Freemium Models for Media Markets
20 pages
Littlefield Technologies: Overview: Stanford University Graduate School of Business
No ratings yet
Littlefield Technologies: Overview: Stanford University Graduate School of Business
5 pages
Changing Indian Culture
No ratings yet
Changing Indian Culture
26 pages
Catalogo Compresor Bauer
No ratings yet
Catalogo Compresor Bauer
168 pages
Batch 17
No ratings yet
Batch 17
27 pages
Grade 9 Network Cable Installation Guide
No ratings yet
Grade 9 Network Cable Installation Guide
12 pages
Operating Manual AAA30924CAG
No ratings yet
Operating Manual AAA30924CAG
135 pages
Papd 911 Calls Raw
No ratings yet
Papd 911 Calls Raw
493 pages
Cha 2 Java
No ratings yet
Cha 2 Java
75 pages
SolidCAM 5-Axis Machining Guide
50% (2)
SolidCAM 5-Axis Machining Guide
33 pages
Lecture Notes 1.3.5
No ratings yet
Lecture Notes 1.3.5
5 pages
Chapter 1: Conceptual Framework For CRM: What Is Customer Relationship Management?
No ratings yet
Chapter 1: Conceptual Framework For CRM: What Is Customer Relationship Management?
68 pages
TL 7705 A
No ratings yet
TL 7705 A
23 pages
Industrial RTD Input Modules
No ratings yet
Industrial RTD Input Modules
4 pages
CAD For VLSI Algorithms For VLSI Design Automation by Gerez
89% (9)
CAD For VLSI Algorithms For VLSI Design Automation by Gerez
330 pages
Excel Data Visualization Essentials
100% (3)
Excel Data Visualization Essentials
24 pages
Aadhaar Card
No ratings yet
Aadhaar Card
1 page
Sage X3 For Mining The Answer Company 1
No ratings yet
Sage X3 For Mining The Answer Company 1
7 pages
16-Channel High Performance Differential Output, 192 KHZ, 24-Bit Dac
No ratings yet
16-Channel High Performance Differential Output, 192 KHZ, 24-Bit Dac
52 pages
PP Syllabus
No ratings yet
PP Syllabus
6 pages
Journal - 02 06 2017
No ratings yet
Journal - 02 06 2017
380 pages
M.Tech VLSI Lab Report
No ratings yet
M.Tech VLSI Lab Report
5 pages
Intelligent Human Systems Integration 2020
No ratings yet
Intelligent Human Systems Integration 2020
1,312 pages
F24 MVC Assignment 2
No ratings yet
F24 MVC Assignment 2
11 pages
02 - SS-Activities - Application of Function Library
No ratings yet
02 - SS-Activities - Application of Function Library
4 pages
2080sc-NTC 4-Channel Thermistor Analog Input Module: Micro800™
No ratings yet
2080sc-NTC 4-Channel Thermistor Analog Input Module: Micro800™
2 pages
Dgtl-Brkent-2711 (2020)
No ratings yet
Dgtl-Brkent-2711 (2020)
90 pages
Seccionadora A SF6 - Média Tensão
No ratings yet
Seccionadora A SF6 - Média Tensão
4 pages
Open Source GIS & Web Mapping Guide
No ratings yet
Open Source GIS & Web Mapping Guide
70 pages
BCSE431L FUNDAMENTALS-OF-QUANTUM-COMPUTING TH 1.0 74 B.Tech CSE
No ratings yet
BCSE431L FUNDAMENTALS-OF-QUANTUM-COMPUTING TH 1.0 74 B.Tech CSE
2 pages
GE Voluson P8 BT16
No ratings yet
GE Voluson P8 BT16
5 pages
E Waste Recycling
100% (2)
E Waste Recycling
22 pages
PFS4420-16GT-DP Datasheet 20240118
No ratings yet
PFS4420-16GT-DP Datasheet 20240118
3 pages

Random Sampling, Statistics, and Estimators

Uploaded by

Random Sampling, Statistics, and Estimators

Uploaded by

Random Sampling, Statistics, and Estimators

Samples and random samples

Note that random does not mean representative.

The central problem in statistics

Estimating the population mean

is an estimate of x from the specific sample x 1 , x 2 , . . . , x n . Let X 1 n

minX 1 , X 2 , . . . , X n and X3 Do they have any desirable properties? EX E 1 n

Estimating 2 , the population variance x

Solve this for i1 x i x to obtain x i x

This will prove useful. We will use in our derivation of Es 2 x Es 2 x E 1 n 1 X i X 2

Substituting, the algebraic relationship we just derived

You might also like