Syllabus
Syllabus
UNIVERSITY, NAGPUR
M. A. / M.Sc. STATISTICS
TO BE IMPLEMENTED
FROM
2023 – 24
&
M. Sc. STATISTICS
Semester I
Max.Marks
Passing
Credits
Duration in hrs.
Code Marks
Total Marks
Practical
Total
Theory
Practical
External
Internal
Theory
Marks
Ass
Paper1:Probability & - -
MST1T0 4 4 4 3 60 40 100 50
Distribution Theory
1
Paper 2: Statistical
MST1T0 Inference 4 - 4 4 3 60 40 100 50 -
2
Paper 3:
Electives
Industrial
Process & - -
MST1T0 4 4 4 3 60 40 100 50
Quality Control –
3
A
OR
Survival
Analysis – B
OR
Bioassay - C
Paper4:Research - -
MST1T0 4 4 4 3 60 40 100 50
Methodology
4
Practical 1 - 4 -
MST1P0 6 6 3 50 50 100 50
1
Practical 2 - 4 -
MST1P0 6 6 3 50 50 100 50
2
TOTAL 16 12 28 22 -- 340 260 600 200 100
Semester II
Max.Marks MinimumP
Duration in hrs.
Credits
Code assingMarks
TotalMarks
Practical
Theory
Total
Practical
External
Internal
Theory
Marks
Ass
MST2T01 Paper5:Sampling & 4 - 4 4 3 60 40 100 50 -
design of Experiment
Paper 6: 4 - 4 4 3 60 40 100 50 -
MST2T02
Stochastic
Processes &
Time Series
Paper7: Electives 4 - 4 4 3 60 40 100 50 -
Industrial Statistics
–A
OR
MST2T03 Reliability Theory –
B OR
Statistical Genetics
– C1/ Statistical
Ecology – C2
MST2T04 Paper8: - 8 8 4 3 60 40 100 50 -
On Job Training
MST2P03 Practical 3 - 6 6 3 4 50 50 100 - 50
MST2P04 Practical 4 - 6 6 3 4 50 50 100 - 50
TOTAL 12 20 32 22 - 340 260 600 200 100
Semester III
Max. Minimum
Credits
Duration in hrs.
Passing
Code Marks
TotalMarks
Marks
Practical
Theory
Total
Practical
External
Internal
Theory
Marks
Ass
MST Paper 9:Decision - -
3T01 4 4 4 3 60 40 100 50
Theory & Non
Parametric
Methods
MST Paper10:Linear - -
3T02 4 4 4 3 60 40 100 50
&Non linear
Modelling
Paper11: Electives
MST3T
Data Mining – A - -
03 OR 4 4 4 3 60 40 100 50
Mathematical
Programming – B
OR
Demography - C
Practical 5
MST3
P05 - 6 6 3 4 50 50 100 - 50
MST3 Practical 6 - -
6 6 3 4 50 50 100 50
P06
MST3R Research - -
8 8 4 4 60 40 100 50
P1 Project(RP)
Minor Work
TOTAL 16 12 28 22 - 340 260 600 150 150
Semester IV
Max.Marks MinimumP
Duration in hrs.
Credits
Code assingMarks
TotalMarks
Practical
Theory
Total
Practical
External
Internal
Theory
Marks
Ass
Paper 4 - 4 4 3 60 40 100 50 -
MST4T01
12Multivariate
Analysis
Paper13:Computat 4 - 4 4 3 60 40 100 50 -
MST4T02
ional Statistics
Paper14:Electives 4 - 4 4 3 60 40 100 50 -
Big Data
Analysis
MST4T03 –A
Or
Operations
Research – B
Or
Actuarial
Statistics - C
Practical 7
MST4
P07 - 4 4 2 4 50 50 100 - 50
Practical 8
MST4
P08 - 4 4 2 4 50 50 100 - 50
50 25 25 100
C] Project Work:
Project work will carry 100 marks and distribution of these 100 marks will be as
follows.
For written project work: 40 Marks- Evaluated jointly by External and Internal
Examiner
Presentation: 20 Marks- Evaluated jointly by External and Internal Examiner
Internal Assessment: 20 Marks- Evaluated by Internal Examiner
Viva Voce: 20 Marks- Evaluated by External Examiner
List of Electives
List of Electives
Note: The exit option at the end of one year of the Master’s
degree program will commence from AY 2024- 25.
Students who have joined a two-year Master’s degree
program may opt for exit at the end of the first year and
earn a PG Diploma.
SEMESTER I
PAPER I – MST1T01
Probability & Distributions Theory
Unit – III: Joint, marginal and conditional pmfs and pdfs, Basic discrete and continuous
distributions - Binomial, Poisson, negative binomial, geometric, uniform, multinomial,
hyper geometric distribution, Normal, bivariate normal, exponential. Joint distribution of
sample and induced sampling distribution of a statistic. Beta, Gamma, Cauchy, Log-
normal, Weibull, Laplace distributions.
Unit – IV: z, t and F distributions and their properties and applications. Chi-square
distribution and its properties. Compound, truncated and mixture distributions.
Distributions of quadratic forms under normality and related distribution theory. Order
statistics, their distribution and their properties, joint and marginal distribution of order
statistics. Extreme values and their asymptotic distributions (statement only) with
applications.
Outcomes :
● Students will learn the concept of probability as a measure and its various properties .
Besides this, they learn the concept of random variable, its types, probability distribution,
cumulative distribution function, and its various properties .
● Concept and importance of Central limit Theorem which helps in understanding the
limiting behavior and the concept of characteristic function and its properties.
●This course gives a strong base for advanced theoretical as well as advanced applied
probability.
● Students will get knowledge of some standard discrete and continuous distributions
along with their properties.
● Students also learn the applications of the random variables studied in real life
situations. They will be able to test whether the given random variable follows any of
the standard distributions based on which further analysis can be done.
● Students will get knowledge about some sampling distributions with their applications
and some useful inequalities.
● Students learn the concept of truncation and mixture distributions and also an
important concept of order statistics. These concepts are useful in many areas like
Acturial Science, Telecommunication and in non-parametric density estimation
References :
PAPER II – MST1T02
Statistical Inference
There are some standard statistical methods useful in testing various types of
hypotheses. These methods are widely used in almost all disciplines. Sometimes we
need to construct a test procedure if the situation is not as required in the standard
methods. The course includes basic lemma useful in construction of the test. The test
procedure also changes according to the nature of null hypothesis, alternative
hypothesis, the distribution of a random variable under consideration etc. The course
helps in constructing most powerful or uniformly most powerful tests for different types
of hypotheses for any given distribution. Some situations require sequential test
procedure.
Unit I
Unit II
Unit III
Test of hypothesis, concept of critical regions, test functions, two kinds of errors, size
function, power function, level, MP and UMP test in the class of size tests. N.P lemma,
MP test for simple against simple alternative hypothesis. UMP tests for simple null
hypothesis against one sided alternative and for one sided null against one sided
alternative in one parameter exponential family.
Unit IV
Likelihood ratio test. Asymptotic distribution of LRT statistics (without proof). Wald test,
Rao’s score test,Sequential testing. Sequential probability ratio test. Relation among
parameters Application of SPRT to Binomial, Poisson, Normal Distribution. unbiased
test, UMPUT and their existence in case of exponential family similar tests and tests
with Neyman structure.
Outcome : At the end of the course , students become well versed with
They learn about the two types of errors committed while constructing any test
and the probabilities associated with them.
References :
1) E. L. Lehman : Theory of Point estimation
2) B. K. Kale : First course on Parametric inference
3) C.R. Rao : Linear statistical inference and its applications
4) Ferguson T. S. : Mathematical statistics.
5) Zacks S. : Theory of statistical inference.
PAPER III
Elective Paper MST1T03-A
Industrial Process and Quality Control
Objective: In industry, it is important to maintain quality of the product. The quality of
the product can be checked with the help of quality control techniques available in
statistics. The course includes various methods for monitoring the quality of the
product. The control charts can be used to monitor quality of product, process, and
services in various disciplines.
Unit – I:Basic concept of process monitoring General theory and review of Shewhart
control charts for measurements and attributes (p, d = np, C, X and R chart) O.C. and
ARL for X control chart. General ideas on economic designing of control chart.
Assumptions and costs. Duncan’s model for the economic design of X chart. Moving
average and exponentially weighted moving average charts. Cu-sum charts using v
masks and decision intervals.
Students develop the ability to apply various types of control charts as given below in
different industries.
Unit – II:Life tables, Failure rate, mean residual life and their elementary
properticsAgeing classes IFR ,IFRA, NBU, NBUE, HNBUE and their duals, Bathtub Failure
rate.
Unit – IV:Semi Parametric regression for failure rate Cox’s Proportional hazards model
with one and several covariates.
Semi parametric regression for failure rate, Cox’s proportional hazard models.
References:
1. Cox, D.R. and Oakes, D.(1984) Analysis of Survival Data, Chapman and Hall,
Newyork.
3. Elandt Johnson, R.E. Johnso9n NL (1980) Survival models and Data Analysis,
John Wiley and sons.
5. Zacks, S. Reliability.
●Types of biological assays and methods for estimating dose response relationship.
●Logit and probit approach for estimating dose-response relationship.
References:
1.GovindRajulu,Z.(2000):Statistical Techniques in Bioassay,S.Karger.
2. Finney, D.J.(1971):Statistical Methods in Bioassay, Griffin.
3.Finney, D.J.(1971): Probit Analysis 3rdED.), Griffin.
4.Weatherile, G.B.(1966):Sequential Methods in Statistics, Methuen.
Paper IV – MST1T04
Research Methodology
Course objectives:
o To understand the role of research methodology in Engineering/Science/Pharmacy
o To understand literature review process and formulation of a research problem
o To understand methods and basic instrumentation
o To learn technical writing and communication skills required for research
o To learn about intellectual property rights and patents
Reference Books:
1. Ranjit Kumar, "Research Methodology: A Step by Step Guide for Beginners",
SAGE Publications Ltd., 2011.
2. Wayne Goddard, Stuart Melville, "Research Methodology: An Introduction"
JUTA and Company Ltd, 2004.
3. C.R. Kothari ,"Research Methodology: Methods and Trends",
New Age International,2004
4. S.D. Sharma , "Operational Research", Kedar Nath Ram Nath & Co.,1972
5. B.L. Wadehra,"Law Relating to Patents,Trademarks, Copyright Designs and
Geographical Indications", Universal Law Publishing, 2014.
6. Donald Cooper, Pamela Schindler, "Business Research Methods",
McGraw-Hill publication, 2005.
SEMESTER II
Paper I – MST2T01
Sampling & Designs of Experiments
Unit I:-
Basic method of sample selection: -Simple random sampling with replacement, Simple
random sampling without replacement. Unequal probability sampling: PPS WR/WORand
related estimators of finite population mean (Des-Raj estimators for general sample
size.) Horvitz Thompson’s estimator
Unit – II: Use of supplementary information for estimation: - Ratio and Regression
method of estimation based on SRSWOR. Double sampling for estimating strata sizes
in ratio and regression method of estimation. Cluster sampling, equal and unequal
sizes.
Unit – III:- Analysis of variance, elementary concepts (one and 2 way classified data )
Review of elementary design (CRD, RBD, LSD) Missing plot technique in RBD and LSD
with one and two missing values (only estimation of missing values)
BIBD : Elementary parametric relations, Analysis. Definitions and parametric relations of
SBIBD, RBIBD, ARBIBD, PBIBD.
Unit – IV: Analysis of covariance of one-way and two-way classified data., split plot
design: construction. factorial experiments, factorial effects, best estimates and testing
3
the significance of factorial effects, study of 2 factorial experiments in RBD.
Confounding in factorial experiments, complete and partial confounding.
Outcome :
Students will get knowledge about various methods of sampling like SRSWOR,
SRSWR, Stratified random sampling, Cluster sampling, systematic sampling etc.
Students will get knowledge about the efficiency of one method of sampling with
respect to the other so that in any given situation, they will be able to apply an
appropriate method of sampling.
They also learn about ratio method of estimation and regression method of
estimation when the data on the associated variable is also available along with
a study variable.
They learn basic design like CRD, RBD, LSD, techniques of estimating missing
values (if any) in the data. Use of BIBD and its various types such as RBIBD,
ARBIBD, SBIBD.
They learn use of factorial experiments as per the requirement of the situation
where the effects of treatments are studied at two levels and their interactions.
Use of confounding in factorial experiments to maintain homogeneity of the
block.
References:
Paper II – MST2T02
Stochastic Processes & Time Series
Objectives :
Almost all processes that we come across in real life are stochastic in nature. The
course includes some basic standard stochastic processes which help in understanding
the real life situations. Many processes are observed to be Markovian in nature. Study
of such processes is also included in this course. A time series is a series of data points
indexed in time order. When this data is plotted, one can observe trends and other
variations. Time series analysis is used to study the patterns of variations in data with
time. These methods are useful in finance, weather forecasting, agriculture, etc. where
the primary goal is forecasting. Objective here is to introduce the concept of time series
,different methods of analyzing and modeling the time series data and their use in
forecasting.
Unit IV : Stationary processes: General linear processes, moving average (MA), auto
regressive
(AR), and autoregressive moving average (ARMA). Auto regressive integrated moving
average
(ARIMA) models, Box –Jenkins models Stationarity and inevitability conditions.
Nonstationary and seasonal time series models: Seasonal ARIMA (SARIMA) models,
Transfer function models (Time series regression). Estimation of ARIMA model
parameters, maximum likelihood method, large sample theory (without proofs). Choice
of AR and MA periods, FPE, AIC, BIC, residual analysis and diagnostic checking.
The definition of stochastic processes and its classification. Markov chain and
its applications in different areas, Classification of states and various results
associated with Markov Chain.
Random walk model, Poisson processes & branching processes and will be able
to apply them in real life situations.
References :
1. J. Medhi : Stochastic Processes.
2. S. Karlin and H Taylor : First course in stochastic processes.
3. W. Feller : Introduction to probability theory and its applications Vol.
1.
4. Brockwell, P.J. and Davis, R. A. (2003). Introduction to Time Series Analysis,
Springer
5. Chatfield, C. (2001). Time Series Forecasting, Chapmann &Hall, London
6. Fuller, W. A. (1996). Introduction to Statistical Time Series, 2nd Ed. Wiley.
7. Hamilton N. Y. (1994). Time Series Analysis. Princeton University press.
8. Box, G.E.P & Jenkins G.M (1976): Time Series Analysis – Forecasting & Coontrol ,
Holden-Day, San Francisco .
9. Montgomery, D.C & Johnson, L.A ( 1977) : Forecasting and Time Series Analysis,
McGraw Hill
Elective Paper
Industrial Statistics
Objective: Objective here is to introduce to the students a branch of statistics that helps
in maintaining quality in the industry. Course contains various methods for quality
maintenance.
●Use of design of experiments for quality improvement and process Capability Analysis,
various capability indices.
References :
Unit III:. Reliability estimation based on failure times in variously censored life tests and
in tests with replacement of failed items, stress and strength reliability and its
estimation. aintenance and replacement policies, availability of repairable systems,
modeling of repairable system by a non-homogeneous Poisson process.
Unit IV: Reliability growth models, probability plotting techniques, Hollander- Proschan
and Deshpande tests for exponentially, tests for HPP vs. NHPP with repairable systems.
Outcome: Students learn about the following concepts in reliability and their
applications
REFERENCES:
1. Barlow R E and Proschan F (1985), Statistical Theory of Reliability and Life Testing .
2. Lawless J.F. (1982) Statistical Models and Methods of Life Time Data.
3. Bain L. J Engelhardt (1991), Statistical Analysis of Reliability and Life Testing Model.
4. Zacks S, Reliability Theory.
5. D C Montgomery-Design and Analysis of Experiments.
6. R H Myers and D C Montgomery –Response Surface Methodology.
7. J Fox: Quality through Design
8. J A Nelder and P McCullasn Generalized Linear Models.
Unit I: Basic biological concepts in genetics. Mendel’s law. Hardy Weinberge equilibrium.
Matrix theory of random mating. Mating tables. Estimation of allel frequency for
dominant and co dominant cases. Approach to equilibrium for X-linked gene.
Unit II:. Non random mating. Inbreeding. Coefficients of inbreeding. Inbreeding in
randomly mating populations of finite size. Phenotypic assortative mating.
Unit III: Natural selection, mutation, genetic drift. Equilibrium when both natural
selection and mutation are operative. Statistical problems in human genetics, Blood
group analysis.
Unit IV: Analysis of family data : (a) Relative pair data, I ,T,O matrices, identity by
descent. (b) Family data- estimation ofsegregation ratio under ascertainment bias. (c)
Pedigree data –Elston- Stewart algorithm for calculation of likelihoods, linkage,
Detection and estimation of linkage, estimation of recombination fraction, inheritance of
quantitative trials models and estimation of parameters.
Outcome: Students learn about the basic concepts of genetics and their development
using statistical tools
References:
1. Li,C.C.(1976):First Course on Population genetics. Boxwood Press, California.
2. Ewens, W.J.(1979):Mathematical Population genetics ,Springer Verlag.
3.Nagylaki, T.(1992): Introduction to theoretical population genetics. Springer
Verlag
4. Elandt – Johnson Probability Models and Statistical Methods in Genetics. John
Wiley
Unit – I:Population Dynamics One species exponential, logistic and Gompertz models,.
Two species competition, coexistence, predator prey oscillation, Lotka-Volterra
Equations, isoclines, Lestie matrix model for age structured populations. Survivorship
curves constant hazard rate, monotone hazard rate and bath tub shaped hazard rates.
Unit – II:Population density estimation: Capture recapture modesls, nearest neighbor
models, Line transect sampling, Ecological Diversin, Simpsons index, Diversity as
average rarity.
Unit – III:Optimal Harvesting of Natural Resources, Maximum Sustainble field, tragedy
of the commons Game theory in ecology, concepts of Evolutionarily stable strategy, its
Properties, simple cases such as Hawk-Dove geme.
Unit – IV: Foraging Theory : Diet choice Problem, patch choice problem meanvariance
trade off.
Outcome: Students develop the ability to analyze life processes, biodiversity etc using
statistical methods in ecology which includes
Various models of population dynamics for one species, two species, their co-
existence.
References:
1. Gore, A.P. and Paranjpe S.A. (2000) A course on Mathematical and Statistical
Ecology, Kluwer Academic Publishers.
2. Pielou, E.C. (1977) An Introduction to Mathematical Ecology (Wiley)
3. Seber, G.A.F (1982) The Estimation of animal abundance and related parameters
(2nd Ed) (Grittin)
4. Clark, C.W. (1976) Mathematical bio-economics : the optimal management of
renewable resources (John wiley)
5. Maynard Smith J. (1982) Evolution and the theory of games (Cambridge
University Press)
6. Stephenes, D.W. & Krebs JR (1986) Foraging Theory (Princeton University Press)
PAPER IV
SEMESTER III
Paper I – MST3T01
Decision Theory & Non Parametric methods
Objective : Decision making is very important in all walks of life. The statistical aspect
of decision making is based on the risk involved in any decision. There are methods
which come out with the decision having minimum risk. The objective here is to
introduce the topic to the students giving only basic knowledge of it. There are many
real life situations where the assumptions required for the application of parametric
statistical methods are not satisfied. Nonparametric inference is a branch of statistics
which has solutions in such situations. The course includes various non-parametric
methods.
Unit – I: Decision problem, loss function, expected loss, decision rules (nonrandomized
and randomized), decision principles (conditional Bayes, frequentist) inference and
estimation problems as decision problems, criterion of optimal decision rules. Concepts
of admissibility and completeness, Bayes rules, minimax rules, admissibility of Bayes
rules. Existence of Bayes decision rules.
Unit – II: Definition of non-parametric test, Advantages and disadvantages of Non-
parametric tests. Single sample problems :
a) test of randomness
b) test of goodness of fit : Empirical distribution function.
Kolmogorov – Smirnov test, 2 test, Comparison of 2 test & KS test
c) One sample problem of location : sign. Test, Wilcoxon’s signed rank
test,
Wilcoxons paired sample signed rank test
Unit –III: Two sample problems : different types of alternatives, sign test, Wilcoxans
two sample rank sum test, Wald-Wolfowitz run test, Mann-Whitney-Wilcoxons test,
Median test, KS-two sample test. Klotz Normal score test.
One sample U-statistic, Kernel and symmetric Kernel Variance of U-Statistic, two-
sample U-statistic, Linear rank statistics and their distributional properties under null
hypothesis.
Unit – IV: Concept of time order and random censoring, likelihood in these cases,
survival function, hazard function Non-parametric Estimation of Survival function, Cox’s
proportional hazards model, the actuarial estimator, Kaplan – Meier Estimator.
●To formulate decision making problem, methods to solve the problem by defining
decision functions and risks involved and the methods of minimizing the risk.
● How and when to apply various non-parametric methods to different types of data for
testing various types of hypotheses.
●Various nonparametric tests for one sample, paired samples and two independent
samples problem.
●Concept of censoring the data, its need and its types. Parametric and nonparametric
methods of analyzing the censored data.
References:
1) Ferguson T. S. : Mathematical Statistics – A decision theoretic approach
2) Berger J. O. : statistical decision theory and Bayesian analysis
3) Gibbons J.D. : Non parametric Statistical inference
4) Randles and Wolfe : Introduction to the theory of non parametric statistics.
Paper II – MST3T02
The objective of this course is to impart knowledge about the use different useful tools
used in regression analysis. The relationship between variables can be of different
types like linear, nonlinear etc. The relationship is represented in terms of a model. The
adequacy of any model can be checked using residual plots and residual analysis.
Appropriate statistical tools are required to check for the violations of model
assumptions and for dealing with problems of multicollinearity etc.
Unit – I: MultipleLinear regression : Model assumptions and checking for the violations
of model assumption., Residual analysis – definition of residuals, standardized
residuals, residual plots, statistical tests on residuals, Press statistics. Transformation
of variables, Box-Cox power transformation.
Unit – III: Variable selection : Problem of variable selection, criteria for evaluation
subset regression models (choosing subsets), coefficient of multiple determination,
residual mean square, Mallow’s Cp Statistics. Computational Techniques for variable
selection-Forward selection, Backward elimination, stepwise regression.
Unit – IV: Generalized linear models : Exponential families, Definition of GLM, Link
function, Estimation of parameters and inference in GLM.
Logistic regression model : Link function, logit, probit, complementary log-log,
estimation of parameters, odds ratio, hypothesis testing using model deviance.
Outcome: Students will get knowledge about
● To interpret different types of plots such as residual plots, normal probability plots
etc. To check for the violations of model assumptions using residual analysis and other
statistical tests.
Elective Paper
Paper III MST3T03 - A
Data Mining
Objective: All over the globe, a huge amount of data is getting generated at a very high
rate. This huge data needs to be analyzed everywhere around us . Data mining is an
interdisciplinary subfield of computer science and statistics. The techniques are useful
in discovering patterns in large data sets. Objective here is to introduce the students to
this branch of statistics and impart knowledge in data processing, data management,
analysis of large data, model and inference consideration and online updating.
●Use of machine learning and statistical models to uncover the hidden patterns in large
volume of data.
●Clustering methods from statistical and data mining point of view.Unsupervised and
supervised learning of data in different cases.
References:
1. Berson, A and Smith, S.J. (1997) Data Ware housing, Data mining and OLAP
(McGraw-Hill)
2. Brieman, L. Friedman, J.H. Olshen, RA, and Stone, C.J. (1984) Classification and
regression Trees
3. Han, J and Kamber, M (2000) Data Mining, Concepts and Techniques (Morgan
Kaufmann)
4. Mitchell, T.M. (1997) Machine Learning (McGraw Hill)
5. Ripley, B.D. (1996) Pattern Recognition and Neural Networks (Cambridge
University Press)
● To formulate and solve linear programming problem (LPP). They also learn various
methods to solve LPP.Application of LPP in industry, management, transportation,
assignment etc.
●Pure and mixed integer linear programming problem and formulation of non linear
programming problem and different methods to solve them.
●The problem and different methods of solving two person zero sum game.
References:
References :
PAPER IV
Research Project (Minor)
SEMESTER IV
Paper I – MST4T01
Multivariate Analysis
Objective: Multivariate Analysis is a branch of statistics where data on two or more
variables are analyzed simultaneously. Most of the statistical methods in univariate
analysis can be extended to the case of two or more variables. There are many
situations where we need to study effect of two or more independent variables on one
variable. This is studied as an extension of simple correlation and regression in case of
one independent variable. This course consists of all such distributions, statistical
methods etc. which are multivariate analogues of corresponding univariate analysis.
Many results that are derived only for multivariate cases are also included in the course.
Unit – I: Correlation : multiple and partial correlation. Linear and multiple regression co-
efficient of determination and its uses. Tests of significance of multiple and partial
correlation coefficient. Multivariate normal distribution, singular and nonsingular
normal distribution, characteristic function, moments, marginal and conditional
distributions, maximum likelihood estimators of the parameters of multivariate normal
distribution .
Unit – II: Wishart matrix-its distribution without proof and properties.Distribution of
MLEs of parameters of Multivariate Normal distribution ,Distribution of sample
generalized variance, Applications in testing and interval estimation, Wilks λ
[Introduction, definition, distribution (statement only)].
Unit – III: Hotelling’s T2 statistic and its null distribution. Application in tests on mean
vector for one and more multivariate normal populations and also on the equality of
the components of a mean vector in a multivariate normal population. Application of
T2 statistic and its relationship with Mahalanobis’ D2 statistic. Confidence region for the
mean vector. Applications of D2 statistics.
Unit – IV: Classification and discrimination : procedures for discrimination between two
multivariate normal populations. Fisher’s discriminant function, tests associated with
discriminant function, Sample discriminant function. Probabilities of misclassification
and their estimation. Classification into more than two multivariate populations.
Principal components. Dimension reduction. Canonical variables and anonical
correlation, definition, uses, estimation and computation.
Outcome: Students develop the ability of handling multivariate data and to draw
inferences from such data using following methods.
●Hotellings T2, its null distribution and its applications for testing hypotheses
associated with mean vector (vectors).
References :
●Some important methods of handling missing data and incomplete data problems like
EM algorithm etc.
Elective Paper
Unit II: Structured &Semi-structured data, relational and non relational data bases, real
time analytics using Hadoop systems, Hadoop ecosystem, Hadoop distributed file
system, stream processing engines, introducing &understanding text mining processes.
Unit III: Statistical methodologies for big data, re-sampling based methods, bag of little
bootstrap (BLB), leveraging, mean log likelihood, MCMC methods, divide & conquer
methods for linear regression (univariate & multivariate) models and GLM, online
updating method, implementation of factor analysis, cohort analysis and Time series
analysis.
Unit IV: Open source R and R packages, command level R, learning programming
concepts of R, breaking memory barriers,data management, numerical calculation,
sentiment analysis, R packages bigmemory & ff.
Outcome :
Students will learn application of statistical concepts to very large data sets.
References:
1. BigData (covers Hadoop 2, MapReduce, Hive, YARN, Pig, Rand Data Visualization)
Black Book, DTE ditorial Services, Dreamtech Press.
2. Data Science & BigData Analytics Discovering, Analyzing, Visualizing and Presenting
data EMC Education Services, Wiley Publication.
3. Beginner’s guide for data analysis using R : Jeeva, Jone, Khanna Publication.
4. Practical Statistics for Data Scientists: By Peter Bruce and Andrew Bruce
Operations Research
Objective: Operations research deals with the application of advanced analytical
methods which helps in taking better decisions. The course includes advanced
techniques that are useful in business, management, industry, project planning etc.
Unit – I:Inventory problems : Structure of inventory problem, EOQ formula, EOQ model
with uniform rate of demand & having no shortages, EOQ model with different rate of
demand in different cycles having no shortages, EOQ model with uniform rate of
demand & finite rate of replenishment having no shortages, EOQ model with uniform
rate of demand & finite rate of replenishment having shortages, EOQ model with
uniform rate of demand, infinite rate of replenishment having shortages, EOQ model
with single & double price breaks.
Unit – II:Single period probabilistic inventory models with
i) instantaneous demand & discrete units
ii) instantaneous demand& continuous units
iii) Continuous demand & discrete units
iv) Continuous demand & continuous units
References:
1) Taha H. A. : Operations Research
2) Hiller & Liberman ; Introduction to Operations research.
3) Kantiswaroop Gupta and Singh : Operations research.
4) Gross D and Harris C. M. : Fundamentals of queueing theory.
Unit I: Life table and its relation with survival function, assumptions for fractional ages,
some analytical laws of mortality, select and ultimate tables. Multiple life functions, joint
and last survivor status, insurance and annuity benefits through multiple life functions.
Multiple decrement models, deterministic and random survivor groups, associated
single decrement tables, central rates of multiple decrement, net single premiums and
their numerical evaluations.
Unit II:.Principals of compound interest: Nominal and effective rates of interest and
discount, force of interest and discount, compound interest, accumulation factor,
continuous compounding.
Life insurance : Insurance payable at the moment of death and at the end of the year of
death-level benefit insurance, endowment insurance, diferred insurance and varying
benefit insurance, recursion, commutation functions.
Unit III: Life annuities : Single payment, continuous life annuities, discrete life annuities,
life annuities with monthly payments, commutation functions, varying annuities,
recursion, complete annuities- immediate and apportionable annuities-due. Net
premiums : Continuous and discrete premiums, true monthly payments premiums,
apportionable premiums, commutation functions, accumulation type benefits.
Unit IV: Net premium reserves : Continuous and discrete net premium reserves on a
semi continuous basis, reserves based on true monthly premiums, reserves on an
apportionable or discounted continuous basis, reserves at fractional duration,
allocations of loss to policy years, recursive formulas and differential equations for
reserves, commutation functions. Some practical considerations: Premiums that
include expenses – general expenses, types of expenses, per policy expenses. Claim
amount distributions, approximating the individual model, stop-loss insurance.
Outcome: The course will make the students aware about an important branch of
Statistics called Actuarial Statistics by studying the following topics
●Life tables, its relation with survival function and application to life insurance.
References:
1. Bowers, N.L.; Gerber, H.U.; Hickman,J.C.; Jones D.A. and Nesbitt, C.J.(1986) :
Actuarial Mathematics. Society of Actuarials, Ithaca, Illiois, U.S.A. Second Ed
(1977).
2. Deshmukh S.R (2009): An introduction to Actuarial Statistics using R, Uni.Press
3. Spurgeon E.T (1972): Life Contingencies, Cambridge University.
Paper IV