Research Modure
Research Modure
COURSE OBJECTIVES
By the end of the module students should be able to;
   a.   Describe basic concepts in research, monitoring and evaluation
   b.   Outline steps in designing data collection tools for projects
   c.   Collect and analyze data using appropriate analytical tools
   d.   Develop and carry out research projects
A. INTRODUCTION
Meaning of Research
    In simple terms: - It’s the process of finding out information about something or an issue/
     an activity designed to provide answers to questions about simple day-to-day life
     activities/ It’s a way of gathering evidence
    In scientific terms, research is a mechanism of collecting, classifying, analyzing
     discussing summarizing and drawing valid conclusions about a population based on
     information contained in a sample
    Research is a systematic and objective search for, and analysis of, information relevant to
     the identification and solution of a problem or to the discovery of a principle
               Systematic approach: Involves careful planning through all stages of research
                  i.e clear and concise statement of the problem, required information,
                  analytical techniques
               Objectivity: Bringing new and reliable information for better decisions.
                  Hence, scientific methods are crossly accepted.
               Analysis: Data must be converted into information before it becomes useful in
                  decision making
Importance of Research
            Understanding a phenomenon/issue
            To gain knowledge about phenomena that may or may not have applications in
             the near future
            It helps researchers to apply techniques learnt from other disciplines to solve
             problems (Applied Research), therefore it helps in formulating models…)
            Research is geared at answering questions of immediate importance and
             contributes to theory testing processes
            To come up with solutions to address challenges in the society or any profession.
            To develop new techniques and procedures that form the body of research
             methodology
            It also assists us to approach our research in a manner that incorporates reality by
             coming up with conclusions and recommendations that are implementable
e.g. a particular product may not be selling well and the manager might want to find the reason
for this in order to take corrective action.
Research Ethics
There are some ethical issues involved in conducting reliable research.
What are Research Ethics?
 Ethics in research refer to a code of conduct or expected norm of behaviour while conducting
  research.
 Following research ethics and norms helps a researcher to pursue research objective rather
  than self-interests.
 Ethical conduct should be reflected in the entire research team including the researcher,
  participants providing the data, analysts who provide the results etc.
Ethics in data Collection
Ethics in data collection pertains to those who sponsor the research, those who collect the data
and those who provide the information.
  I. Sponsor
     The sponsor should respect the confidentiality of information obtained by the researcher.
     S/he should not ask for disclosure of individual sources of information. The sponsor can
     only request to see questionnaires for monitoring purposes
 II. Researchers
     Have informed consent from participants
     They should know the risks, objectives and agreed voluntarily to participate
      Safeguard social and physical environment of participants
      Explain reasons for the study
      Treat information given with strict confidentiality
      Personal or intrusive information should not be solicited or with strong reasons if it is
       needed by the project,
     No-participant observers should not be as non-intrusive as possible. If they intrude, the
       quality of data may be affected
     Never expose subjects/ respondents to situations that will physically or mentally harm
       them,
     Avoid distortions and misinterpretations of collected data.
     When children or other vulnerable groups are involved you need clearance & consent
       from guardians
     Always get authorization from relevant authorities
        Do research to the benefit of society/community i.e. academic community &
       development community •
     Safeguard against disseminating results that will be detrimental to society.
     Never claim knowledge of something you did not generate (lying/ failing to acknowledge
       source)
III. Respondents
    Ethical behaviour of respondents hinges on:
      Full cooperation once offer to participate is accepted,
      Truthfulness and honesty in responses.
2CLASSIFICATION OF RESEARCH
Research is generally classified into 3 types:
    Experimental Research
Biology, chemistry etc lab experiments
    Observational Research
Observation of socio-economic problems through questionnaires and interviews etc
Research methods are split broadly into quantitative and qualitative methods .
QUANTITATIVE RESEARCH METHODOLOGIES
Quantitative research
– Research strategy based on measurement of quantity or amount. – It is used to quantify the
problem by way of generating numerical data or data that can be transformed into useable
statistics. – Used to quantify attitudes, opinions, behaviours, and other variables. – Emphasizes
quantification in the collection and analysis of data. – E.g. Survey using structured
questionnaires, secondary quantitative data collection.
There are four main types of quantitative research designs: descriptive, correlational, quasi-
experimental and experimental
EXPERIMENTAL RESEARCH
   Is one of the most powerful research methodologies that researchers can use.
   Experiments are the best way to establish cause and effect relationships.
In experimental studies, researches look at the effect(s) of at least one in dependent variable on
one hand or more dependent variables.
An independent variable is also known as the Experimental or treatment variable (e.g. fertiliser).
The dependent variable is also known as the Criterion or Outcome variable.
The major characteristic of experimental research distinguishing it from all other types of
research is that researchers manipulate the independent variable e.g. change fertilizer dose to see
or observe changes in yield of a crop.
ESSENTIAL CHARACTERISTICS OF EXPERIMENTAL RESEARCH
The basic idea underlying all experimental research is quite simple: try something and
systematically observe what happens.
Comparison of groups
   An experiment involves two groups of subjects:
    - An experimental group
    - A control or comparison group
   It is however possible to conduct an experiment with only one group or three or more groups.
   The experimental group receives treatment of some sort .
   The control group receives no treatment or the comparison group receives a different
    treatment.
   The control or comparison group is very crucial in experiments because it enables the
    researcher to determine whether the treatment has had an effect or whether one treatment is
    more effective than another.
   A pure control historically receives no treatment at all (hence comparison groups since
    sometimes they receive treatment).
Randomisation
Experimental research is characterised by random selection and assignment
   Random assignment means that every individual participating in the experiment has an equal
    chance of being assigned to any of the experimental or control conditions being compared.
   Random selection on the other hand means that every member of a population has an equal
    chance of being selected to be a member of the sample.
Qualitative research refers to research studies that investigate the quality of relationships
activities situation or materials. It has more emphasis on holistic description (ie describing in
detail all of what goes on in a particular activity or situation rather than on comparing the effects
of a particular treatment)
Qualitative research strives to define human behavior and explain the reasons behind
that behavior. Often used in commercial areas such as market research, the goal of
qualitative research is to provide answers as to why and how people come to make
certain decisions. There are several different approaches to undertaking qualitative
research.
BIOGRAPHY
A biographical study is the study of a single individual and his or her experience as told by the
researcher or found in docs or the archives.
Examples of biographers include:
Biography
Biographies are life stories written by individuals themselves
Autobiography
An autobiography is a life story written by the individual himself/ herself.
Life Stories
Life stories are a combination of biographies and autobiography.
Oral Histories
Oral histories are stories in which a researcher gathers personal recollections from a variety of
individuals.
CASE STUDIES
A case involves the study of just one individual, programme or object, eg a school village or an
event. For example, if some people learn languages quickly, their behaviours and attitudes could
be studied to explore noticeable patterns in their behaviours or regularities in their behaviour.
HISTORICAL RESEARCH
Def: Historical research is the systematic collection and evaluation of data to describe, explain
     and understand actions or events that occurred in the past
   It is unique because it only focuses on the past
   It examines documents of the past, interviewing people who lived during that time.
   It aims at reconstructing the past and explain why certain things happened
ETHNOGRAPHIC RESEARCH
Is the most complex type of research because it deals with humans. It uses a variety of
approaches. It emphasizes on documenting or portraying the everyday experiences of
individuals.
Key tools used in ethnographic research include;
   In-depth interviewing,
   Continual and on-going observations of a situation
Ethnographic Concepts
These are concepts that ethnographers follow as they go about their work. These are as follows;
Culture
It is the sum of a social group’s observable patterns of behaviour, customs and ways of life.
A culture comprises ideas, beliefs and knowledge that characterises a particular group of people.
A holistic Approach
Ethnographers try to describe as much as they can about the culture of a group. Developing a
holistic perspective demands that the ethnographer spend a great deal of time out in the field
gathering data, only then can he/she develop a picture of the social or cultural whole of what is
being studied
Contextualisation
Contextualisation involves placing what was seen and heard into a larger perspective.
Emic perspective
An emic perspective is an insiders’ perspective of reality. It helps in understanding and
accurately describing behaviours and situations.
Member checking
Aims at validating what the ethnographer has written. Participants review what the researcher
has written.
Non-judgemental orientation
This aims at preventing researchers from making judgements about unfamiliar practices,
It helps to guard against biases.
RESEARCH BY PRACTITIONERS
ACTION RESEARCH
Action research is research conducted by one or more individuals or groups for the purpose of solving a
problem or obtaining information in order to inform local practice.
Sampling
    Sampling is a process of selecting a representative number of elements from a population.
    Sampling errors will always occur when a sample (and not a population) is used.
    Proper and accurate sampling therefore helps us to draw or establish closer relationships
       between a sample and the population (through a sample statistic or population
       parameter)
Basic Elements of Sampling
Population = entire group of people, or things of interest that researcher wishes to investigate.
           ◦    e.g., population of Lilongwe district comprises of all households in Lilongwe. Or
                AH25 class comprises of 31 students.
Element = a single member of the population.
           ◦    e.g., if AH25 class has 31 students, each student is an element
Sample = a subset of the population, i.e., as a finite part of a statistical population whose
properties are used to make estimates about the population
           ◦    e.g., if 20 students are randomly selected from 31 students in AH25 class, the 20
                students form the sample.
Sampling unit = an element (or set of elements) that is available for selection at some stage of
the
               sampling process, i.e., the unit of sampling
         ◦ e.g., if you wanted to select sample class, EPAs & sample households, the EPA,
             class and household comprise the sampling units
Sampling frame = list of elements (population members) from which you draw your sample.
           ◦    e.g. List of EPAs, Sections or households
Parameters = characteristics of the population
           ◦    e.g., population mean, SD & variance
Statistics = characteristics of the sample
           ◦    e.g., sample mean, SD & variance
Sampling error The difference between the results obtained from the survey sample and those
               that would have been obtained had the entire population been surveyed (a census).
Size of sampling error varies both with the size of the sample and with the percentages giving a
particular response
Why get a sample?
   Financial limitations,
   Logistical and geographical limitations,
Types of Sampling
There are two types of sampling: (i) probability sampling and (ii) nonprobability sampling.
probability sampling
In the case of probability sampling, the probability or chance of every unit in the population
being included in the sample is known due to randomization involved in the process. It is thus a
method of sampling that utilizes some form of random selection. Some methods of probability
sampling are as follows:
     a) simple random sampling
It is a method of selecting units out of a population where each element (item or person) has an
equal chance of being selected.
If somebody wanted to select n units from population N, each unit will have 1/N chance of being
selected.
Certain computer software can also help in randomly selecting units out of a population such as
SPSS or GENSTAT or some sophisticated calculators can generate random numbers.
        Steps:
            1. Obtain a complete sampling frame
            2. Assign each case a unique number starting from one.
            3. Decide on the required sample size.
            4. Select sample in form of raffle draw or using Ms excel
       Example: Consider a population of 255 EPAs, you want to draw a stratified random
       sample of 51 EPAs using proportional allocation. If stratum 1 (has) = 44 EPA’s, 2= 116
       EPA’s, 3= 48 EPA’s, and 4= 47 EPA’s. Then from each stratum you would sample the
       following EPA’s:
           •    N = 255, required sample size (n) = 51. Then overall sampling fraction is 51/255.
                From each stratum:
Stratum                   # of EPAs
  1                       44    >     n1 = 44*51/255 =     9
  2                       116   >     n2 = 116*51/255 = 23
  3                       48    >     n3 = 48*51/255 = 10
  4                       47    >     n4 = 47*51/255 =     9____
                                                          51____
d) Cluster sampling
Signifies that instead of selecting individual units from the population, entire group or clusters
are selected at random.
In cluster sampling, first we divide the population into clusters (usually along geographic
boundaries).
• Then we randomly select some clusters from all clusters formed to measure all units within
sampled clusters in the end. •
 Steps on how to select a cluster sample?
       Step 1: List all the sections in an EPA, and draw a random sample of section from this
       list.
       Step 2: From all the selected Sections, list all the farmers/households, then select a
       sample of farmers/households.
NON-PROBABILITY SAMPLING
Non-probability sampling does not involve the process of random selection, that is, in the case
on non-probability sampling, the probability of selection of each sampling unit is not known.
There is no rational way to prove/know whether the selected sample is representative of the
population.
In applied social research due to constraints such as time and cost and objectives of the research
study there are circumstances when it is not feasible to adopt a random process of selection and
in those circumstances usually nonprobabilistic sampling is adopted.
Non-probability sampling methods
                    a) Accidental or Convenience Sampling
                        as the name suggests, sampling units are selected out of convenience, for
                        example, in clinical practice, researchers are forced to use clients who are
                        available as samples, as they do not have many options.
     Are samples that are taken without plan among a set of units that are readily
        available.But, they are not random samples, and usually contain important information.
     Though the sampling appears to be random, the final sample is not representative of the
        entire population.
     Such samples may include a larger proportion of people possessing certain
        characteristics, than what is there in the population at large.
     Results from such samples cannot be considered to be accurate.
                    b) Purposive Sampling
 As the name suggests, is done with a purpose, which means that selection of sampling units is
purposive in nature. Purposive sampling can be very useful for situations where you need to
reach a targeted sample quickly and where a random process of selection or proportionality is not
the primary concern.
       These samples are not based on randomness.
       They are based on ones’ judgment that the sample will give them the information they are
       interested in (or have a better understanding of the problem they are examining)
       Such study will be more qualitative than quantitative.
                   c) Quota Sampling, .
      Shares some characteristics with stratified random samples, but they are non-
     probabilistic.
    They include various groups in the same proportions as in the general population.
    The groups are defined based on a certain criteria, such as, sex, age, income levels,
     ethnicity, etc.
    Quota samples reflect the characteristic of the population, but they are not probability
     samples
                   d) Expert Sampling
    Involves selecting a sample of persons, who are known to have demonstrable experience
     and expertise in a particular area of study interest.
    Researchers resort to expert sampling because it serves as the best way to elicit the views
     of persons who have specific expertise in the study area.
    Expert sampling, in some cases, may also be used to provide evidence for the validity of
     another sampling approach chosen for the study.
                    e) Snowball sampling
            generally used in the case of explorative research study/design, where researchers
             do not have much lead information.
            Snowball sampling is especially useful when you are trying to reach populations
             that are inaccessible or difficult to find, for example, in the case of identifying
             injecting drug users
       ERRORS IN SAMPLING
Sampling Error: It is defined as the differences between the sample and the population, which
occurs solely due to the nature or process in which particular units have been selected.
The first is chance, that is, due to chance some unusual/variant units, which exist in every
population, get selected as there is always a possibility of such selection. Researchers can avoid
this error by increasing sample size, which would minimize the probability of selection of
unusual/variant units. •
The second cause of sampling error is sampling bias, that is, the tendency to favour the selection
of units that have particular characteristics.
II. Non-sampling error
Defined as an error that results solely from the manner in which the observations are made and
the reason could be researchers’ fault in designing questionnaires, interviewers’ negligence in
asking questions, or even analysts’ negligence in analysing data.
• Non-sampling error may be due to human error, that is, error made by the interviewer, if he is
not able to communicate the objective of the study or he is not able to cull out the response from
respondents or it could be due to fault in the research tool/instrument.. •
The simplest example of non-sampling error is inaccurate measurements due to poor procedures
and poor measurement tools
 SAMPLING BIAS
General precautions to be followed in order to prevent sampling bias
  Do not use samples chosen at will by interviewer supervisor or director.
  Do not choose samples exclusively from particular groups such as only people you know.
  Do not restrict your sample to people living in easily accessible households. (Avoid
    roadside bias).
  Do not omit households where you did not find anyone at home during first visit revisit the
    later
Biases can be done through improper use of data collection methods. These biases and how
Table 1: Biases, their description and how they can be avoided or minimized
DATA COLLECTION
DATA
Data is raw information gathered through interviews, questionnaires, observations or secondary
sources. The type of data someone intends to collect/ use will determine the statistical model to
use.
CLASSIFICATION OF DATA
            Data are classified into the following:
                         Cross-sectional data
                         Time series data
                         Panel data
   A. Cross-sectional data
                o Usually contain independent observations
                o Exclude time factors or contains no element of time factors, and hence is
                   named spot data
                o Are analyzed through static models such as regression models, qualitative
                   models and simultaneous models
   B. Time series data
         – Usually contain inter-dependent observations
         – Includes time factors or patterns
         – Are analyzed through time series models, e.g.,
                • Autoregressive Integrated Moving Average (ARIMA),
                • Autoregressive Moving Average (ARMA),
                 • lag or dynamic models
   C. Panel Data
      Hybrid of cross-sectional and time-series
      • Repeated data collection on the same observation over similar time period
      • Are analyzed using panel data models, e.g., Fixed and random effect models
The method of data collection in research is determined by the purpose/objective of the study,
use of research results and available resources.
Combination of methods is used since there is usually a set of objectives which require a
combination of techniques.
There are three basic data collection approaches in social research:
                  A. Secondary Research;
                  B. Survey Research; and
                  C. Experimental Research.
   A. Secondary Research
What are secondary data?
   •   Secondary data are data that were collected for some purpose other than helping to solve
       the current problem, whereas primary data are collected expressly to help solve the
       problem at hand.
            Survey data are secondary data if they were collected earlier for another study and
             primary data if they were collected for the present study.
       Secondary data could be
            Internal – generated within the organization
            External – generated by outside organizations
   •   Potential secondary sources of information:
            Weather reports - rainfall, temperatures
            Soil maps - soil types, aerial photographs (natural vegetation)
            Population Census reports
            National Sample Survey of Agriculture
            Famine Early Warming Systems (FEWS) Publications
            Food Security and Nutrition Bulletins
            Economic Reports
             Statistical Yearbooks
             Integrated Household Surveys
             World Bank Annual Reports
             UNDP Human Development Reports
             FAO Food Production and Consumption Surveys
             Research reports
     •   In using the secondary data, one has to look at these aspects:
             Accuracy and reliability - should be checked by comparing data from different
              secondary sources;
             Adequacy of the data - e.g. is the rainfall data daily, weekly, monthly or annual;
              and
             Time period - recent data is more suitable. Socioeconomic secondary data that
              are more than 5 years old should be verified.
     B. Survey Research
            This is a systematic collection of information directly from respondents who are a
              sample/portion of entire population.
             Survey research may be grouped into:
             Telephone Interviews - collection of information from respondents via
              telephone;
             Mail Interviews - collection of information from respondents via mail or similar
              technique, e.g., email; and
             Personal Interviews - collection of information in a face-to-face situation:
                    o Key informant interviews;
                    o Group interviews/Focus Group Discussions;
                    o Exploratory/informal/reconnaissance survey; and
                    o Formal or verification survey.
       Disadvantages
                Personal interviews are costly
                They require properly trained interviewers to avoid bias.
        iv.        Formal (Verification) Survey
              •    Use formal methods of collecting information:
                       Collect information about a population by interviewing a random sample of the
                        population;
                       Use standardized semi-structured questionnaire – interviews conducted in a
                        uniform way by trained enumerators who ask questions in the same way using a
                        pretested written questionnaire; and
                        Responses are tabulated, studied and analysed
       C          Experiments
              •    Experiments have mostly been associated with natural (hard) sciences, e.g. physics,
                   chemistry, biology, medicine, etc.
              •    Social sciences have also borrowed the experimental concepts to conduct social
                   experiments.
QUESTIONNAIRE
       Designing a questionnaire
       Good design and filling of a questionnaire are a prerequisite to getting correct information. Once
       a data collection tool is bad, no proper conclusions and recommendations can result from a
       research process. (garbage in-garbage out)
       A good questionnaire has the following attributes;
  i.          It is complete and concise
              It obtains information required to meet survey objectives with as few questions as possible.
 ii.          It is reliable
              A good questionnaire enables the interviewer to get the same response regardless of who asks
              the question and where the question is asked. There are very few differences between
              interviewers. Proper training of interviewees can enhance reliability.
iii.          It provides valid data
              Validity means that the question elicits a true and accurate response that measures or
              explains whatever you are interested in measuring/ explaining.
iv.      Ensures consistency
         A good questionnaire allows an interviewer and an interviewee to interpret the meaning of
         the questions in the same way.
         Consistency ensures that there is a common understanding of the questions.
      HOW TO DESIGN A GOOD QUESTIONNAIRE
      A good questionnaire can be designed by following the following guidelines.
 I. Question ordering
     Questions on the same topic should follow each other,
     General questions should be asked first,
     Responses on closed questionnaires should also be well-ordered to avoid what is referred to
      as positional advantages,
     Avoid long lists of options for respondents to choose from,
     Print questionnaires with different orderings for different subsets of the sample.
       Open questions
       Open questions are questions in which a respondent is freely allowed to give an unstructured
       response.
       Open questions/ responses are difficult to analyse;
        They cannot be easily quantified,
        They cannot be easily compared across the questionnaire.
       They are mostly used in Focus Group Discussions.
       Closed questions
       Each question has a pre-determined answer of choices/options for a respondent to choose from.
       Closed questions are preferred to open question;
        They are easy to code,
        They are easy to analyse.
       Their major limitation is that they restrict an interviewee’s responses.
      DATA ANALYSIS
    Data analysis is a process of inspecting, cleansing, transforming, and modeling data with
     the goal of discovering useful information, informing conclusions, and supporting
     decision-making
   The process of evaluating data using analytical and logical reasoning to examine each
     component of the data provided
DATA ENTRY
       This is the process transferring information collected from a data collection instrument
        (usually a questionnaire) into a computer.
   •    Several spreadsheets can be used including: MS Excel, SPSS, MS Access, CSPro, Stata
        etc.
   •    At your this level, SPSS or Excel are recommended.
   •    Between Excel and SPSS, SPSS has the advantage that it has capabilities to label the
        variables (including categorical codes) unlike Excel
   •     Most data entry spreadsheets/statistical packages interact with each other: one can enter
        data in SPSS or Excel and analyze the data in Stata
        TYPES OF ANALYSIS
        1. Univariate data –
        This type of data consists of only one variable. The analysis of univariate data is thus the
        simplest form of analysis since the information deals with only one quantity that changes.
        It does not deal with causes or relationships and the main purpose of the analysis is to
        describe the data and find patterns that exist within it. The example of a univariate data
        can be height.
   Suppose that the heights of seven students of a class is recorded (figure 1),there is only
    one variable that is height and it is not dealing with any cause or relationship. The
    description of patterns found in this type of data can be made by drawing conclusions
    using central tendency measures (mean, median and mode), dispersion or spread of data
    (range, minimum, maximum, quartiles, variance and standard deviation) and by using
    frequency distribution tables, histograms, pie charts, frequency polygon and bar charts.
   2. Bivariate data –
    This type of data involves two different variables. The analysis of this type of data deals
    with causes and relationships and the analysis is done to find out the relationship among
    the two variables.Example of bivariate data can be temperature and ice cream sales in
    summer season.
   Suppose the temperature and ice cream sales are the two variables of a bivariate
    data(figure 2). Here, the relationship is visible from the table that temperature and sales
    are directly proportional to each other and thus related because as the temperature
    increases, the sales also increase. Thus bivariate data analysis involves comparisons,
    relationships, causes and explanations. These variables are often plotted on X and Y axis
    on the graph for better understanding of data and one of these variables is independent
    while the other is dependent.
      3. Multivariate data –
       the examination of more than two variables simultaneously (e.g., the relationship between
       gender, race, and college graduation)
Data Preparation
The first stage of analyzing data is data preparation, where the aim is to convert raw data into
something meaningful and readable. It includes four steps:
The purpose of data validation is to find out, as far as possible, whether the data collection was
done as per the pre-set standards and without any bias. It is a four-step process, which includes…
To do this, researchers would need to pick a random sample of completed surveys and validate
the collected data. (Note that this can be time-consuming for surveys with lots of responses.) For
example, imagine a survey with 200 respondents split into 2 cities. The researcher can pick a
sample of 20 random respondents from each city. After this, the researcher can reach out to them
through email or phone and check their responses to a certain set of questions.
Typically, large data sets include errors. For example, respondents may fill fields incorrectly or
skip them accidentally.
To make sure that there are no such errors, the researcher should conduct basic data
checks, check for outliers, and edit the raw research data to identify and clear out any data points
that may hamper the accuracy of the results.
For example, an error could be fields that were left empty by respondents. While editing the data,
it is important to make sure to remove or fill all the empty fields.
For example, if a researcher has interviewed 1,000 people and now wants to find the average age
of the respondents, the researcher will create age buckets and categorize the age of each of the
respondent as per these codes. (For example, respondents between 13-15 years old would have
their age coded as 0, 16-18 as 1, 18-20 as 2, etc.)
Then during analysis, the researcher can deal with simplified age brackets, rather than a massive
range of individual ages.
Data analysis for quantitative studies involves critical analysis and interpretation of figures and
numbers, and attempts to find rationale behind the emergence of main findings
The two most commonly used quantitative data analysis methods are descriptive statistics and
inferential statistics.
DESCRIPTIVE STATISTICS
Typically descriptive statistics (also known as descriptive analysis) is the first level of analysis. It
helps researchers summarize the data and find patterns. A few commonly used descriptive
statistics are:.
                       Frequency Distribution
    •   Classification of data according to some characteristics, e.g. sex (gender), height, weight,
        income, etc.
    •   Elements of a frequency distribution:
            • Variable (what is being measured, e.g. weight, occupation)
            • Frequency – the count (number) of elements in each class (category).
    •   See example below:
                       Frequency Distribution Example
                       Student Distribution By Sex
                         Measures of Central Tendency
         A measure of central tendency is a single value that attempts to describe a set of data by
         identifying the central position within that set of data.
         As such, measures of central tendency are sometimes called measures of central location.
         They are also classed as summary statistics.
         The mean (often called the average) is most likely the measure of central tendency that
         you are most familiar with, but there are others, such as the median and the mode.
     •   These aim at coming with a single value in a data set that helps describe the
         characteristics of the entire data set.
     •   Measures of central tendency also facilitate comparison between or among data sets, e.g.
         annual mean pass rates, average yield for different areas, years, etc.
     •   The following are some of the common measures of central tendency:
            A. Arithmetic mean
     •   The mean is equal to the sum of all the values in the data set divided by the number of
         values in the data set. So, if we have n values in a data set and they have values x 1, x2, ...,
         xn, the sample mean, usually denoted by (pronounced x bar), is:
         This formula is usually written in a slightly different manner using the Greek capitol
         letter,   , pronounced "sigma", which means "sum of...":
65 55 89 56 35 14 56 55 87 45 92
     •   Where x represents the values of observations and n is the total number of observations.
                 B. Median
     •   The median is the middle score for a set of data that has been arranged in order of
         magnitude. The median is less affected by outliers and skewed data. In order to calculate
         the median, suppose we have the data below:
65       55        89   56     35      14     56      55      87     45      92
     •   We first need to rearrange that data into order of magnitude (smallest first):
14 35 45 55 55 56 56 65 87 89 92
     •   Our median mark is the middle mark - in this case, 56 (highlighted in bold). It is the
         middle mark because there are 5 scores before it and 5 scores after it. This works fine
         when you have an odd number of scores, but what happens when you have an even
         number of scores? What if you had only 10 scores? Well, you simply have to take the
         middle two scores and average the result. So, if we look at the example below:
65 55 89 56 35 14 56 55 87 45
14 35 45 55 55 56 56 65 87 89
     •   Only now we have to take the 5th and 6th score in our data set and average them to get a
         median of 55.5.
C. Mode
The mode is the most frequent score in our data set. On a histogram it represents the highest bar
in a bar chart or histogram. You can, therefore, sometimes consider the mode as being the most
popular option. An example of a mode is presented below:
65 55 89 56 35 14 56 55 87 45
         Measures of Dispersion
               A. Range
     •   The difference between the largest and smallest item in a data set.
     •   Example: Find the range the following wages 1120, 1150, 1400, 1080, 1200, 1100, and
         1160.
            • Range = 1400-1080 = 320
              B. Standard deviation
        Measures how far observations move from their mean.
        A low standard deviation indicates that the data values tend to be closer to the mean.
    A high standard deviation indicates that the data points are spread over a wider range of
     values.
    Standard deviation is given by the following formula:
Sx 
                ( x  x ) 2
                       n 1
    Variance
•    This measures the variation of the data.
•    It is given by the square of the standard deviation, i.e. just remove the square root sign in
     the Std. Dev. formula:
                             S x2 
                                           ( x  x ) 2
                                                  n 1
•    Calculate the standard deviation and the range for the following wages 1120, 1150, 1400,
     1080, 1200, 1100, and 1160.
                  INFERENTIAL STATISTICS
Measures of association
                    A. Correlation
       It measures the relationship between two variables.
       Note that correlation measures the association and not the cause, i.e. correlation
       shows that x is related to y and not x causes y.
                   B Covariance
       Covariance measures the same association just like correlation
Hypothesis Testing
•   This is important when we want to compare two groups, e.g. adopters vs non-adopters,
    subsidy beneficiaries vs non-beneficiaries, males vs females, etc.
•   Two means may appear to be different but there is need to see whether that difference is
    significantly different.
              B. Association between two categorical variables (Chi-square test)
   •   A chi-square test is used to test whether there is a significant difference between the
       expected frequencies and observed frequencies in one or more categories.
   •   In other words, chi-square tests whether there is a significant relationship between the
       categorical variables, e.g. gender and belonging to a club, adoption, etc.
Qualitative data analysis works a little differently from quantitative data, primarily because
qualitative data is made up of words, observations, images, and even symbols. Deriving absolute
meaning from such data is nearly impossible; hence, it is mostly used for exploratory research.
While in quantitative research there is a clear distinction between the data preparation and data
analysis stage, analysis for qualitative research often begins as soon as the data is available.
Analysis and preparation happen in parallel and include the following steps:
   1. Getting familiar with the data: Since most qualitative data is just words, the researcher
      should start by reading the data several times to get familiar with it and start looking for
      basic observations or patterns. This also includes transcribing the data.
   2. Revisiting research objectives: Here, the researcher revisits the research objective and
      identifies the questions that can be answered through the collected data.
   3. Developing a framework: Also known as coding or indexing, here the researcher
      identifies broad ideas, concepts, behaviors, or phrases and assigns codes to them. For
      example, coding age, gender, socio-economic status, and even concepts such as the
      positive or negative response to a question. Coding is helpful in structuring and labeling
      the data.
   4. Identifying patterns and connections: Once the data is coded, the research can start
      identifying themes, looking for the most common responses to questions, identifying data
      or patterns that can answer research questions, and finding areas that can be explored
      further.
Several methods are available to analyze qualitative data. The most commonly used data analysis
methods are:
      Content analysis: This is one of the most common methods to analyze qualitative data. It
       is used to analyze documented information in the form of texts, media, or even physical
       items. When to use this method depends on the research questions. Content analysis is
       usually used to analyze responses from interviewees.
      Narrative analysis: This method is used to analyze content from various sources, such as
       interviews of respondents, observations from the field, or surveys. It focuses on using the
       stories and experiences shared by people to answer the research questions.
      Discourse analysis: Like narrative analysis, discourse analysis is used to analyze
       interactions with people. However, it focuses on analyzing the social context in which the
       communication between the researcher and the respondent occurred. Discourse analysis
       also looks at the respondent’s day-to-day environment and uses that information during
       analysis.
      Grounded theory: This refers to using qualitative data to explain why a certain
       phenomenon happened. It does this by studying a variety of similar cases in different
       settings and using the data to derive causal explanations. Researchers may alter the
       explanations or create new ones as they study more cases until they arrive at an
       explanation that fits all cases..
Weak analysis produces inaccurate results that not only hamper the authenticity of the research
but also make the findings unusable.
It’s imperative to choose your data analysis methods carefully to ensure that your findings are
insightful and actionable.
   B. Univariate analysis
          a. the examination of the distribution of cases on only one variable at a time (e.g.,
             college graduation)
   C. Bivariate analysis
          a. the examination of two variables simultaneously (e.g., the relation between
             gender and college graduation)
   D. Multivariate analysis
          a. the examination of more than two variables simultaneously (e.g., the relationship
             between gender, race, and college graduation)
INDICATORS AND VARIABLES
               A variable is a character or quality that is observed, measured, and recorded in a
                data file (generally, in a single column).
               Examples of commonly variables used in social-sciences.
               Socio-demographic variables: age, sex, religion, level of education, marital status
                etc.
               Psychological variables: level of motivation, IQ, level of anxiety etc.
               Economic variables: Income, work status, household assets,
               Variables that refer to individual units other than individual:
               Birth rate, fertility rate, unemployment rate etc.
TWO BASIC TYPES OF VARIABLES
        Quantitative variables – These are characters or features that are best expressed by
        numerical values eg., age, number of hh members, income, etc.
        Qualitative variables – These are characters or qualities that are not numerical, eg.,
        mother tongue or country of origin.
• 1.0 Introduction
• 3.0 Methodology
• 6.0 Budget
• 7.0 References/Bibliography
• 8.0 Appendices/Annexes
   •   EXECUTIVE SUMMARY - Brief statement of the major points from each of the other sections.
       The objective is to allow the reader develop a basic understanding of the proposal without
       reading the entire proposal (1 page).
   •   TABLE OF CONTENTS - Contains page numbers of chapter titles and their sub-headings (1-2
       pages).
• LIST OF TABLES - Contains titles of tables and their page numbers (1-2 pages).
• LIST OF FIGURES - Contains titles of figures and their page numbers (1 – 2 pages).
• Problem Definition – Statement of the Problem and the factors that influence it (2-3 pages)
   •   Objectives – what the research intends to achieve. Objectives should be SMART as much as
       possible (0.5 page)
– Specific Objectives
– Historical and descriptive: what research has been done on the topic
           –   Methodological: data collection and analytical methods used by others explaining why
               they are suitable for your study; and
METHODS (3 – 5 pages)
   •   Data collection methods – e.g., key informant interviews, focus group discussions,
       exploratory/informal survey, formal survey
   •   Sampling method - simple random sampling, systematic sampling, stratified sampling, cluster
       sampling, multi-stage sampling
• Field Supervision
• Analytical Techniques
   •   Anticipated key accomplishments and how they will be disseminated (1-2 pages)
5.0       WORK PLAN AND LABOUR INPUT (2 – 3 pages)
      •    Work plan - indicates when main activities will be carried out, normally presented in a chart
           form (1 – 2 pages).
Labour input - gives person days and cumulative person days required for the main activities (1 page
BUDGET (1 – 3 pages)
• Transport
• Computer time
• Fee/Honoraria
• Overhead (%)
      •    Present all literature cited in the text as well as other literature not cited in the text in the case
           of Bibliography;
• Present alphabetically.
APPENDICES
1. Title
    A research Proposal or Project must have a title.
    A title should be explicit and should contain key words that capture the intention of the
        researcher.
    Should avoid abbreviations.
2. Introduction
   Introduces the main theme of the research or project.
3. Literature Review
    Provides references on the technical and scientific basis of the proposed work.
       Helps the researcher to focus the research more on certain aspects that are found in the already
        published studies (especially those that are not very clear or those that require further
        research).
5. Introduction
   Introduces the main theme of the research or project.
6. Literature Review
    Provides references on the technical and scientific basis of the proposed work.
    Helps the researcher to focus the research more on certain aspects that are found in the already
       published studies (especially those that are not very clear or those that require further
       research).
7. Justification
    Also known as rationale to the study or project.
    Explains why the project is important and how it contributes to certain goals.
8. Objectives
    Are operationalized goals that specify the central results expected.
    Identifies the central theme or concern of study or research or project.
    Should be well defined.
    Should be matched to indicators (For monitoring and evaluation purposes)
9. Hypothesis
   Describe statements to be proved or disapproved.
10. Methodology
    Includes the methods, tools and approaches one is going to use in order to carry out ones work e.g.
    sampling procedure, data collection tools and methods, statistical analysis of the data or other
    logistical aspects of the proposed work.
11. Results
    Are outputs, effects and impacts resulting from a project or research.
12. Discussions
    Involves discussion and interpretation of results in terms of objectives and hypotheses.
    One also reports on key issues.
13. Conclusion
    Summarises the findings in relation to the objectives.
14. Recommendations
    State the limitations and main activities that ought to be done as a follow-up to the study or project
    or research.
15. References/Bibliography
    Provides literature cited in alphabetical order of first authors.
Mwanza J. (2005), Research Methods Monitoring and Evaluation: A Reference Manual for students
16. Justification
     Also known as rationale to the study or project.
     Explains why the project is important and how it contributes to certain goals.
17. Objectives
     Are operationalized goals that specify the central results expected.
     Identifies the central theme or concern of study or research or project.
     Should be well defined.
     Should be matched to indicators (For monitoring and evaluation purposes)
18. Hypothesis
    Describe statements to be proved or disapproved.
19. Methodology
    Includes the methods, tools and approaches one is going to use in order to carry out ones work e.g.
    sampling procedure, data collection tools and methods, statistical analysis of the data or other
    logistical aspects of the proposed work.
20. Results
    Are outputs, effects and impacts resulting from a project or research.
21. Discussions
    Involves discussion and interpretation of results in terms of objectives and hypotheses.
22. Conclusion
    Summarises the findings in relation to the objectives.
23. Recommendations
    State the limitations and main activities that ought to be done as a follow-up to the study or project
    or research.
24. References/Bibliography
    Provides literature cited in alphabetical order of first authors.
Mwanza J. (2005), Research Methods Monitoring and Evaluation: A Reference Manual for students
                                          Identification
             Evaluation
                                                           Preparation
                Implementation
                (Monitoring)                               Appraisal
1.   Identification (Conceptualization)
     Finding potentially fundable projects. Sources of information are
                     Local leaders
                     Politicians
                     Existing projects
                     Sector studies
                     Communities (ordinary people)
2.   Preparation
     Stage where objectives, activities and other requirements are outlined which would lead
     to the realization of the goal(s)
              Identify the goal
              Draw the objectives and their corresponding activities
3.   Appraisal
        Involves evaluation of the alternatives options/actions (called ex-ante analysis)
        Involves critical review by an independent team
        Approval or rejection is the outcome
4.   Implementation
     Has stages as follows:
        Investment period – when all inputs are assembled/purchased
        Development period – when inputs begin showing results
        Monitoring – to ensure that we are on the right track
           Completion – when all activities have been completed
MONITORING
Monitoring is the process of routinely gathering information on all aspects of a project and
utilizing this information in project management and decision making.
A monitoring plan should serve to provide essential information and act as a basic and vital
management tool. To fulfil this role, a monitoring plan must include systems for:
     Acquiring information on key project areas
     Summarizing information
     Analyzing information and
     Utilizing this information to make decisions
In addition to the above, information obtained from monitoring can contribute to:
     Demonstrating innovative and effective strategies
     Generating financial and other support and
     Marketing the institution
What do we Monitor?
(a)      Progress and effectiveness of project activities - training, meetings, outreach activities
(b)      Use of resources – finances, commodities, equipment, staff etc.
(c)      Timing – schedules, deadlines, achievement of work plans
(d)      Results – achieving objectives, coverage, impact on beneficiaries
Why do we Monitor?
Information from monitoring enables project managers to know
(a)       How activities are progressing, whether on track or not
(b)       Whether strategies used are effective
(c)       Establish the need for materials and/or technical support
(d)       To feed into evaluation
Reporting
This is very important and project managers need to regularly provide reports to stakeholders.
The aims of these reports are:
   To inform stakeholders of progress and constraints encountered so that remedial or corrective
    action can be taken
   To provide a formal documented record of achievement
   Facilitate future reviews and evaluation
   Promote transparency and accountability
EVALUATION
Definition: Evaluation involves a systematic, objective analysis of a project’s relevance,
efficiency, effectiveness, impact in relation to objectives, sustainability.
Efficiency:
Focuses on how well various activities transformed available resources into intended
outputs/results in terms of quantity, quality and timeliness. Efficiency therefore focuses on the
following:
   whether all planned benefits have delivered and received as perceived by ALL stakeholders
    and not implementers only
   whether there were any patterns of behavioural change among the target group and hoe these
    have produced planned improvements
   if assumptions or risks turned out to be true or not
   how unplanned outcomes or experiences may have affected results
Impact:
the extent to which the benefits received by the target group had a wider overall effect on larger
numbers of people in the area, sector etc. An important aspect of impact is sustainability.
Sustainability:
Relates to whether the positive outcomes of the project are likely to continue after external
assistance ands. Analysis of this will focus on:
   ownership of objectives and achievements
   whether products or services provided were affordable for the intended beneficiaries
PARTICIPATORY MONITORING AND EVALUATION
Participatory monitoring and evaluation is when beneficiaries are also involved in the processes
besides staff and external people. Beneficiaries and other stakeholders are involved in the design,
implementation and analysis of the results.
Participatory monitoring and evaluation must begin rights at the design stage of the project.
Beneficiaries will be part of the planning process and will therefore among other things, be
familiar with the monitoring tool as well as the bench marks set. Beneficiaries must have copies
of the project write up to enable them follow up and track progress.
Regular formal meetings must be scheduled where all concerned can compare notes on progress
of the project in case of monitoring and progress of the process in case of evaluation. If it is a
construction project, it may be advisable to have what are called “Site Meetings”. These are
meetings that take place right at the site of construction and targets agreed during the previous
meeting are reviewed. Contractors involved must be part of such meetings.
References for        Work plans, performance targets,   Programme objectives and strategy,
comparison            reference indicators               performance targets, widely
                                                         accepted benchmarks
Information sources   Routine systems, field             Same, plus specific surveys, studies
                      observation, progress reports,
                      rapid assessments