0% found this document useful (0 votes)

9 views8 pages

Data Analysis

The document discusses data analysis in scientific experiments, focusing on the importance of understanding errors, accuracy, and precision in measurements. It explains different types of errors, statistical methods for analyzing data, and the significance of results using t-tests and F-tests. Additionally, it provides exercises for calculating mean, standard deviation, and coefficients of variance to assess the reliability of experimental results.

Uploaded by

Maruthupandi M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views8 pages

Data Analysis

Uploaded by

Maruthupandi M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Unit I

Data Analysis

Science is based on quantitative data. They are generally derived from experimental
measurements and may have some error. So it is important to study the error to get accurate
value. Thus, here we discuss some methods used by analytical chemists in assessing the
significance of experimental results.
Error
The term error is used to deal the numerical difference between a measured value and
the true value. The true value of any quantity is something we never know, although we
generally accept a value as being true when it is believed that the uncertainty in the value is
less than the uncertainty in something else with which it is being compared. The percentage
composition of a standard sample is certified by the National Institute of Standards and
Technology (NIST), which may be regarded as standard or correct value.
Thus, error may be defined as the differences between the standard values and the
results obtained by the new method are treated as error.
While discussing the term error we should learn more about their different types. Generally,
error can be described by determinate and indeterminate error. Determinate errors are
generally unidirectional with respect to the correct or true value. However, indeterminate
errors lead to both high and low results with equal probability.
Determinate errors have been classified as methodic, operative and instrumental
accordance with their origin.
Determinate errors can also be classified as constant and proportional.
Examples of sources of determinate errors are incorrectly calibrated instruments such as pH
meter, balance, burette, etc.
Indeterminate errors cannot be attributed to any known cause, but they invariably
attend measurements made by human beings. These errors cannot be corrected and hence are
the ultimate limitation of the measurement.
Accuracy and Precision
A result can be treated as accurate when the value agrees closely with the true value
of a measured quantity. A comparison is usually made on the basis of an inverse measure of
the accuracy, i.e. the error. The absolute error is the difference between the experimental
value and the true value. Suppose an analyst obtained a value of 24.24% copper in a sample
which actually contains 24.14%, the absolute error is
24.24 -24.14 = 0.10%
It can be measured in percentage or in parts per thousand. Relative error is the absolute error
divided by the true value. It is measured in percentage or parts per thousand Here the relative
error is
(0.10/24.14) x 100 = 0.41%
Or, (0.10/24.14)x 1000 = 4.14 ppt
Precision is defined as the concordance of a series of measurement of the same
quantity. It implies nothing about their relation to the true value. The term precise is
commonly stated in terms of the standard deviation, average deviation or range.

The Normal Error Curve: Gaussian Distribution Curve

When a large number of replicate readings, at least 50 numbers are taken of a
continuous variable, the results attained will usually be distributed about the mean in a
roughly symmetrical manner. The mathematical model that best satisfies such a distribution
of random error is called the normal Gaussian distribution.
Significance: Gaussian distribution curve is a bell-shaped that is symmetrical about the mean
as shown in figure 1.
This curve satisfies the equation
1/(2) e- (x - )2/22
Where,  = standard deviation
 = mean of total population

In Gaussian distribution about 68% of all values will fall within one standard deviation on
either side of the mean, 95% will fall within two standard deviations and 99.7% within three
standard deviations.
Fig. 1: Normal or Gaussian Distribution curve

Statistical Treatment of Finite Samples

The central tendency of a group of results is simply that value about which the individual
results tend to “cluster”. For an infinite population, it is , the mean of a such sample.
Mean
The mean of a finite number of measurements, x1, x2, x3, x4,……..,xn is often
designated 𝑥̅ to distinguish it from . Of course, 𝑥̅ approaches  as a limit when the measured
value approaches infinity.
𝑥̅ = (x1 + x2 + x3 + x4 + ……..+ xn)n = ∑𝑖=𝑛
𝑖=1 𝑥𝑖 /𝑛

It may be shown that mean of n results is √𝑛 times as reliable as any of the individual results.
The mean of 4 results is twice as reliable as 1 result in measuring central tendency. The mean
of 9 results is three times as reliable, the mean of 25 results, five times as reliable etc.
Median
The median of an odd number of results is simply the middle value when the results
are listed in order. However the median of an even number of results is the average of the tow
middle ones. In a symmetrical distribution, the mean and the median are identical. The
median is a less efficient measure of central tendency than is the mean.
Range
It is the difference between the largest and smallest values. Like the median, the range
is sometimes useful in small statistics, but generally speaking it is an inefficient measure of
variability.
Average deviation
The average deviation from the mean is often given in scientific papers as a measure
of variability, although strictly it is not very significant from a statistical point of view. For a
large group of data which is normally distributed, the average deviation approaches 0.8 to
calculate the average or mean deviation, one simply finds the differences between individual
results and the mean, regardless of sign, adds these individual results, and by divides by the
number of results
Average deviation, 𝑑̅ = ∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮/n

Relative standard deviation

Often the average deviation is expressed relative to the magnitude of the measured
quantity, for example, as a percentage
Relative standard deviation (%), 𝑑̅ /𝑥̅ x 100 ={(∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮/n)/ 𝑥̅ }x100

in parts per thousand

Relative standard deviation (ppt), 𝑑̅ /𝑥̅ x 1000 ={(∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮/n)/ 𝑥̅ }x1000

Standard deviation
The standard deviation is much more meaningful statistically than is average
deviation. The symbol s is used for the standard deviation of a finite number of values. The
standard deviation, which may be thought as a root mean square deviation of values from
their average.

Standard deviation, s = √[ ∑𝑖=𝑛

𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮ / (n-1)
2

If n is large (50 or more), then it is immaterial whether the term in the denominator is n-1 or
n. When the standard deviation is expressed as a percentage of the mean, it is called the
coefficient of variance, 
 =(s/𝑥̅ ) x 100
Variance
It is the square of standard deviation, which is designated as s2. The variance is
fundamentally more important in statistics than is s itself.
Variance, s2 = (Standard deviation)2
Exercises
Ex. 1: An iron core gives the following results during the Fe estimation as the value 7.08,
7.21, 7.12, 7.09, 7.16, 7.14, 7.07, 7.14, 7.18, 7.11. Calculate the mean, the standard deviation
and coefficient of variance for the values.
Solution:
The mean, 𝑥̅ = (7.08 + 7.21 + 7.12 + 7.09 + 7.16 + 7.14 + 7.07 + 7.14 + 7.18 + 7.11)/10
= 71.98/10 =7.13
Standard deviation, s = √[ ∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮ / (n-1)
2

= √(7.08-7.13)2 + (7.21-7.13)2 + …………..(7.18-7.13)2 + (7.11-7.13)2/9

= √181.98 x 10-4/9
= 4.49 x 10-2 = 0.0449
Coefficient of variance,  = (s/𝑥̅ ) x 100
= {0.045/7.13} x 100
= 0.63 %
Ex. 2: The normality of a solution is determined by four analysts. The results being 0.2041,
0.2049, 0.2039 and 0.2043. Calculate mean, median, range, average deviation, relative
average deviation, standard deviation and coefficient of variance.
Solution:
The mean, 𝑥̅ = (0.2041 + 0.2049 + 0.2039 + 0.2043)/4
= 0.2043
Median, M = (0.2041 +0.2043)/2
= 0.2042
Range, R = 0.2043 -0.2039
= 0.0010
Average deviation, 𝑑̅ = ∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮/n

= (0.0002) + (0.0006) + (0.0004) + (0.0000)/4

= 0.0003
Relative standard deviation (ppt), 𝑑̅ /𝑥̅ x 1000 = {(∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮/n)/ 𝑥̅ }x1000

= (0.0003/0.2043) x 1000
= 1.5 ppt
Standard deviation, s = √[ ∑𝑖=𝑛
𝑖=1 ⋮ 𝑥𝑖 − 𝑥̅ ⋮ / (n-1)
2

= 0.0004
Coefficient of variance,  = (s/𝑥̅ ) x 100
= {0.0004/.2043} x 100
= 0.2 %

Analysis of results
Analysis of results can be made by two different ways. They are:
(i) Comparison of results
(ii) Reliability of results
Comparison of sets of values with either true value or with another set of values gives us
the trick to determine whether the sets of values or the analytical procedure is accurate of
precise.
There are two common methods
(a) Student’s t -test
(b) Variance ration test or F –test
t –test: Student’s t –test is used for small samples. It can also be used to test the
difference between the mean of two sets of data’s ( 𝑥̅ 1 and 𝑥̅ 2 ). The purpose of the test is
to compare the mean of samples with some standard value and to express some level of
confidence in the significance of comparison.
The t –test is obtained as
t = (𝑥̅ + ) √𝑁/s
Where, 𝑥̅ is the mean value
 is true value
N is number of determination
s is standard value
These calculated values is then compared with the sets of values obtained for different
probabilities and degree of freedom from the given table
(degree of freedom: It may be defined as the number of individual observations that could be
allowed to vary under the condition that 𝑥̅ and s, once determined, be held constant.)
Confidence interval of the mean
By rearranging the above equation, we obtain the confidence interval of the mean or
confidence limits
 = 𝑥̅  t.s/ √𝑁
We can use this equation to estimate the probability that the population mean, , lies within a
certain region centred at 𝑥̅ , the experimental mean of our measurements.
Exercises
Ex.3: If 𝑥̅ of twelve determination is 8.37 and a true value,  =7.91, say whether or not these
results is significant if the standard deviation, s is 0.17.
Solution.
The t value is
t = [(𝑥̅ - )√𝑁]/s
= [(8.37 – 7.91)√12]/0.17
= 9.37
20% 10% 5% 2% 1%
Probability (α) 0.20 0.10 0.05 0.02 0.01

Given t value 1.363 1.796 2.20 2.72 3.71

The calculated value for t is 9.37. Comparison of this value with true value, it implies that the
calculated t value is highly significant. The t value tells us that the probability of obtaining
the difference of 0.46 is less than 1 in 100.
Ex. 4: For the estimation of iron from a sample following results are obtained:
8.43, 8.41, 8.4, 8.32, 8.34, 8.24
Find out the mean standard deviation coefficient of variance and say whether or not these
results are significant if the true value is 8.37

F-test: This test is used to compare the precision of two sets of datas. It is calculated as
F = SA2/ SB2
Where SA is standard deviation for one method
SB is standard deviation of another method
Generally, SASB
The value obtained for ‘F’ is then check for its significance against values in the F table. If
the calculated F value can relate with the lower probability then the two sets of data’s is
highly significant.
The F test may be used to determine the validity of the sample t test described here, but it
may also be interest in its own right to determine whether two analytical procedures yield
significantly different precision.
Source: Quantitative Analysis, 6th edn, R.A. Day et al.

Statistical Analysis in Chemistry
No ratings yet
Statistical Analysis in Chemistry
8 pages
Experiment 1 Lab Report
No ratings yet
Experiment 1 Lab Report
10 pages
الفصل الثاني الاحصاء
No ratings yet
الفصل الثاني الاحصاء
27 pages
Error of Measurements
No ratings yet
Error of Measurements
9 pages
Statistical Analysis of Lab Data
100% (1)
Statistical Analysis of Lab Data
80 pages
Descriptive Statistics & Data Analysis
No ratings yet
Descriptive Statistics & Data Analysis
48 pages
Error of Measurements
No ratings yet
Error of Measurements
45 pages
Lesson 2 - Error of Measurements
No ratings yet
Lesson 2 - Error of Measurements
44 pages
Measure of Dispersion Kurtosi, Skiwness
No ratings yet
Measure of Dispersion Kurtosi, Skiwness
22 pages
Lecture of BIOSTATISTICS 12.2022 RMDC
No ratings yet
Lecture of BIOSTATISTICS 12.2022 RMDC
85 pages
APP601S Chapter 4 - Data Handling in Anal Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Anal Chem
42 pages
Precision & Accuracy in Experiments
No ratings yet
Precision & Accuracy in Experiments
42 pages
Errors, Standard Deviation, Data Analysis
No ratings yet
Errors, Standard Deviation, Data Analysis
29 pages
CHAPTERS
No ratings yet
CHAPTERS
17 pages
AnalChem Chapter3 PDF
No ratings yet
AnalChem Chapter3 PDF
67 pages
1431364884L03.EE3121.Measures of Dispersion
No ratings yet
1431364884L03.EE3121.Measures of Dispersion
8 pages
Understanding Measures of Variation
No ratings yet
Understanding Measures of Variation
63 pages
Lecture III-Measures of Dispersion
No ratings yet
Lecture III-Measures of Dispersion
33 pages
Lecture 3
No ratings yet
Lecture 3
25 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Week 2. Errors in Chemical Analysis (Abstract)
No ratings yet
Week 2. Errors in Chemical Analysis (Abstract)
31 pages
Basic - Statistics 30 Sep 2013 PDF
100% (1)
Basic - Statistics 30 Sep 2013 PDF
20 pages
EWP 1 Practical - 1 PDF
No ratings yet
EWP 1 Practical - 1 PDF
10 pages
Basic Terms in Statistical Calculations
No ratings yet
Basic Terms in Statistical Calculations
6 pages
Analytical Chem
No ratings yet
Analytical Chem
188 pages
Basic Statistical Analysis
No ratings yet
Basic Statistical Analysis
80 pages
One Dimensional Statistics
No ratings yet
One Dimensional Statistics
21 pages
Topic III
No ratings yet
Topic III
27 pages
Lec Set 1 Data Analysis
No ratings yet
Lec Set 1 Data Analysis
55 pages
CHM 421 - ToPIC 3 - Statistics
No ratings yet
CHM 421 - ToPIC 3 - Statistics
58 pages
Understanding Frequency Distributions
No ratings yet
Understanding Frequency Distributions
9 pages
4x @6ote ) 'Btda2@m
No ratings yet
4x @6ote ) 'Btda2@m
55 pages
Skoog FAC 10e SAG Ch04 Final
100% (1)
Skoog FAC 10e SAG Ch04 Final
16 pages
Random Errors in Chemical Analysis
No ratings yet
Random Errors in Chemical Analysis
37 pages
Advanced Statistics for Research
No ratings yet
Advanced Statistics for Research
27 pages
Experiment 1-B Evaluation of Analytical Data
No ratings yet
Experiment 1-B Evaluation of Analytical Data
5 pages
History Reporting
No ratings yet
History Reporting
61 pages
Analysis Interpretation and Use of Test Data
No ratings yet
Analysis Interpretation and Use of Test Data
50 pages
Examples Biostatistics. Final
No ratings yet
Examples Biostatistics. Final
90 pages
Statistics Chapter-IV
No ratings yet
Statistics Chapter-IV
59 pages
Lecture Series On Bio-Statistics Prepared by DR Asit Jain, Assistant Professor
No ratings yet
Lecture Series On Bio-Statistics Prepared by DR Asit Jain, Assistant Professor
25 pages
Normal Distribution
No ratings yet
Normal Distribution
9 pages
Measures of Dispersion & Skewness
No ratings yet
Measures of Dispersion & Skewness
12 pages
Understanding Standard Deviation
No ratings yet
Understanding Standard Deviation
9 pages
Lecture 4
No ratings yet
Lecture 4
38 pages
Measure of Dispersion-1
No ratings yet
Measure of Dispersion-1
17 pages
Evaluation of Analytical Data
No ratings yet
Evaluation of Analytical Data
58 pages
3 Measures of Dispersion 2
No ratings yet
3 Measures of Dispersion 2
6 pages
Lecture 2 2014 Random Errors in Chemical Analysis
No ratings yet
Lecture 2 2014 Random Errors in Chemical Analysis
24 pages
Analytical Chemistry Finals Reviewer
No ratings yet
Analytical Chemistry Finals Reviewer
10 pages
Measures of Variability
No ratings yet
Measures of Variability
13 pages
Biostatistics Revision DR - NJ
No ratings yet
Biostatistics Revision DR - NJ
67 pages
Chapter 4 Basic Statistics
No ratings yet
Chapter 4 Basic Statistics
22 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
32 pages
Analytical Chemistry - Errors in Chemical Analyses
No ratings yet
Analytical Chemistry - Errors in Chemical Analyses
20 pages
B. Data Management Lesson Plan
No ratings yet
B. Data Management Lesson Plan
9 pages
AYURSURE (Research and Stat) 4
No ratings yet
AYURSURE (Research and Stat) 4
44 pages
Blood Urea Is A Measure of The Amount of Urea Nitrogen in The Blood
No ratings yet
Blood Urea Is A Measure of The Amount of Urea Nitrogen in The Blood
3 pages
Hall Tickt
No ratings yet
Hall Tickt
2 pages
Fe Proteins
No ratings yet
Fe Proteins
2 pages
Nuclear Spin States and Population Density
No ratings yet
Nuclear Spin States and Population Density
2 pages
Chholeterol
No ratings yet
Chholeterol
4 pages
Vitamins
No ratings yet
Vitamins
2 pages
Saturation
No ratings yet
Saturation
4 pages
Larmor Frequency
No ratings yet
Larmor Frequency
7 pages
Tem Late
No ratings yet
Tem Late
147 pages
Relation Methods
No ratings yet
Relation Methods
3 pages
Flow Techniques
No ratings yet
Flow Techniques
4 pages
Chemical Bonding - 9
No ratings yet
Chemical Bonding - 9
27 pages
Iupac - 05
No ratings yet
Iupac - 05
31 pages
II and IIIrd Law Thermodynamics SRV
No ratings yet
II and IIIrd Law Thermodynamics SRV
29 pages
Equilibroum JEE With Answer
No ratings yet
Equilibroum JEE With Answer
29 pages
Acids
No ratings yet
Acids
146 pages
Iupac - 04
No ratings yet
Iupac - 04
33 pages
Ionic Equilibrium l4
No ratings yet
Ionic Equilibrium l4
37 pages
Concentration Terms Abdul Kalam
No ratings yet
Concentration Terms Abdul Kalam
112 pages
Thermodynamics 5-9-2024 Only Notes
No ratings yet
Thermodynamics 5-9-2024 Only Notes
93 pages
Work Energy Power L5
No ratings yet
Work Energy Power L5
32 pages
Thermochemistry PDF
No ratings yet
Thermochemistry PDF
74 pages
Bus ID No.:: E-Ticket/Reservation Voucher-H
No ratings yet
Bus ID No.:: E-Ticket/Reservation Voucher-H
1 page
8 9 24 KPM To TVR
No ratings yet
8 9 24 KPM To TVR
1 page
Bus ID No.:: E-Ticket/Reservation Voucher-H
No ratings yet
Bus ID No.:: E-Ticket/Reservation Voucher-H
1 page
Bus ID No.:: E-Ticket/Reservation Voucher-H
No ratings yet
Bus ID No.:: E-Ticket/Reservation Voucher-H
1 page
Annual Exam Phy Che Bio
No ratings yet
Annual Exam Phy Che Bio
8 pages
TVR-TBM 2-2-2024
No ratings yet
TVR-TBM 2-2-2024
1 page
13 9 24 TVR To CGL
No ratings yet
13 9 24 TVR To CGL
3 pages
Coordination Compounds
No ratings yet
Coordination Compounds
27 pages
Graphs-Tables and MSR - Notes
No ratings yet
Graphs-Tables and MSR - Notes
24 pages
Measures of Central Tendency Guide
No ratings yet
Measures of Central Tendency Guide
23 pages
Eco154 Introduction To Quantitative Method II Summary
No ratings yet
Eco154 Introduction To Quantitative Method II Summary
47 pages
Sample and Sampling Designs
No ratings yet
Sample and Sampling Designs
15 pages
Chap 2
No ratings yet
Chap 2
51 pages
Unit 2 1
No ratings yet
Unit 2 1
54 pages
Nearest Neighbor Analysis
No ratings yet
Nearest Neighbor Analysis
9 pages
Lesson Plan Cum Freq
100% (2)
Lesson Plan Cum Freq
2 pages
Guidelines For Reliability Based Design
100% (1)
Guidelines For Reliability Based Design
236 pages
Practice Exercises For Week 2: Data Visualization in Tableau
No ratings yet
Practice Exercises For Week 2: Data Visualization in Tableau
5 pages
Maths U II (PART 2)
No ratings yet
Maths U II (PART 2)
17 pages
Central Tendency for Data Analysis
No ratings yet
Central Tendency for Data Analysis
4 pages
De&v Two Marks Questions With Answers
No ratings yet
De&v Two Marks Questions With Answers
19 pages
Traffic Survey
No ratings yet
Traffic Survey
65 pages
Basic Statistical Concepts For Nurses
100% (2)
Basic Statistical Concepts For Nurses
23 pages
KRIS - CLASS 9 - TEST 11 - Revision Test 1
No ratings yet
KRIS - CLASS 9 - TEST 11 - Revision Test 1
2 pages
Math
No ratings yet
Math
7 pages
Relationship Between Introverted Student Behavior and 2
No ratings yet
Relationship Between Introverted Student Behavior and 2
16 pages
Excel Statistics Guide for Students
No ratings yet
Excel Statistics Guide for Students
11 pages
Arithmetic Mean and Range Explained
No ratings yet
Arithmetic Mean and Range Explained
14 pages
Topic 5.1 - 5.3 IB TB Questions and Answers
100% (1)
Topic 5.1 - 5.3 IB TB Questions and Answers
28 pages
Psy 313 Lesson Proper 1 5
No ratings yet
Psy 313 Lesson Proper 1 5
14 pages
PROC MEANS Freq Corr Regression Annova
No ratings yet
PROC MEANS Freq Corr Regression Annova
60 pages
Bounds Based On Sample Parameters: Confidence Level (%) Pop SD (Sigma) (% LC)
No ratings yet
Bounds Based On Sample Parameters: Confidence Level (%) Pop SD (Sigma) (% LC)
12 pages
Module 16 - Analyzing Data - 2
No ratings yet
Module 16 - Analyzing Data - 2
37 pages
MLB Salary & Performance Analysis
No ratings yet
MLB Salary & Performance Analysis
23 pages
Logarithm, Geometry, Mensuration and Progression
No ratings yet
Logarithm, Geometry, Mensuration and Progression
14 pages
BMAT202L Probability & Statistics Tutorial
No ratings yet
BMAT202L Probability & Statistics Tutorial
6 pages
Stolzenberg, R. M. 1980. "The Measurement and Decomposition of Causal
No ratings yet
Stolzenberg, R. M. 1980. "The Measurement and Decomposition of Causal
31 pages
Paper Full and Solution
50% (6)
Paper Full and Solution
34 pages

Data Analysis

Uploaded by

Data Analysis

Uploaded by

Unit I

The Normal Error Curve: Gaussian Distribution Curve

Statistical Treatment of Finite Samples

Relative standard deviation

in parts per thousand

Standard deviation, s = √[ ∑𝑖=𝑛

= √(7.08-7.13)2 + (7.21-7.13)2 + …………..(7.18-7.13)2 + (7.11-7.13)2/9

= (0.0002) + (0.0006) + (0.0004) + (0.0000)/4

Given t value 1.363 1.796 2.20 2.72 3.71

You might also like