Unit Ii
Unit Ii
1. MEASUREMENT
1. The Nature of Measurement
• Measurement in research consists of assigning numbers to empirical events,
objects or properties, or activities in compliance with a set of rules.
• This definition implies that measurement is a three-part process:
– Selecting observable empirical events
– Developing a set of mapping rules: a scheme for assigning numbers or
symbols to represent aspects of the event being measured
– Applying the mapping rule(s) to each observation of that event
Characteristics of Measurement
What Is Measured?
• Variables being studied in research may be classified as objects or as
properties.
• Objects include the concepts of ordinary experience, such as tangible items like
furniture, laundry detergent, people, or automobiles.
• Objects also include things that are not as concrete, such as genes, attitudes,
and peer-group pressures.
• Properties are the characteristics of the object. A person’s physical properties
may be stated in terms of weight, height, and posture, among others.
– Psychological properties include attitudes and intelligence.
– Social properties include leadership ability, class affiliation, and status.
Page 1 of 31
UNIT II DATA COLLECTION AND SOURCES
1. Measurement Scales
In measuring, one devises some mapping rule and then translates the
observation of property indicants using this rule.
Each one has its own set of underlying assumptions about how the
numerical symbols correspond to real-world observations.
Mapping rules have four assumptions:
1. Numbers are used to classify, group, or sort responses. No order
exists.
2. Numbers are ordered. One number is greater than, less than, or
equal to another number.
3. Differences between numbers are ordered.
4. The difference between any pair of numbers is greater than, less
than, or equal to the difference between any other pair of numbers.
5. The number series has a unique origin indicated by the number
zero. This is an absolute and meaningful zero point.
Measurement Scales
Scales of Measurement
Page 2 of 31
UNIT II DATA COLLECTION AND SOURCES
Ordinal scale
• Set of categories that are ordered from least to most
• Don’t know numerical distance from each category to the next
• Example
– Military rank
– Letter Grade in Class
Interval scale
• Scale with values, and there is the same numerical distance between each value
• This scale has an aribitrary zero point(no true meaningful zero point)
• Examples
– How appealing is this cereal box to children?
Not at all Very
-3 -2 -1 0 1 2 3
– Current temperature
– Many behavioral science questionnaires
Ratio Scale
• Scale with scores where there is the same numerical distance between each
score
• The scale has a true, meaning full zero point that anchors the scale
• Only scale that allows you to make ratio comparisons, such as “Maribel’s income
is 35 % more than Susan’s?
• Examples
– Weight of a packages of candy
– Amount of money in your checking account
– Number of questions correct on a quiz
3. Sources of Measurement Differences
• The ideal study should be designed and controlled for precise and unambiguous
measurement of the variables.
• Since complete control is unattainable, error does occur. Much error is
systematic (results from a bias), while the remainder is random (occurs
erratically).
• One authority has pointed out several sources from which measured differences
can come
Error Sources
• The Respondent
– Opinion differences that affect measurement come from relatively stable
characteristics of the respondent.
– Typical of these are employee status, ethnic group membership, social
class, and nearness to manufacturing facilities.
Page 3 of 31
UNIT II DATA COLLECTION AND SOURCES
Page 4 of 31
UNIT II DATA COLLECTION AND SOURCES
Content Validity
– The content validity of a measuring instrument is the extent to which it
provides adequate coverage of the investigative questions guiding the
study
Criterion-Related Validity
• Criterion-related validity reflects the success of measures used for prediction or
estimation
• The researcher must ensure that the validity criterion used is itself “valid.”
• Any criterion measure must be judged in terms of four qualities:
– relevance,
– freedom from bias,
– reliability,
– Availability.
Construct Validity
• Construct validity is used to determine how well a test measures
• Construct validity is usually verified by comparing the test to other tests that
measure similar qualities to see how highly correlated the two measures
Reliability
• Reliability is a necessary contributor to validity but is not a sufficient condition
for validity.
• Reliability is concerned with estimates of the degree to which a measurement is
free of random or unstable error
Stability
• A measure is said to possess stability if you can secure consistent results with
repeated measurements of the same person with the same instrument
• Types
• Test–Retest
• Parallel Forms
• Split-Half
• Some of the difficulties that can occur in the test–retest methodology and cause
a downward bias in stability include:
– Time delay between measurements
– Insufficient time between measurements
– Respondent’s discernment of a study’s disguised purpose
– Topic sensitivity
Page 5 of 31
UNIT II DATA COLLECTION AND SOURCES
Equivalence
• Equivalence is concerned with variations at one point in time among observers
and samples of items.
Internal Consistency
• A third approach to reliability uses only one administration of an instrument or
test to assess the internal consistency or homogeneity among the items
Practicality
• Practicality has been defined as economy, convenience, and interpretability .
Economy
• Some trade-off usually occurs between the ideal research project and the
budget. Data are not free, and instrument length is one area where economic
pressures dominate
Convenience
• A measuring device passes the convenience test if it is easy to administer. A
questionnaire or a measurement scale with a set of detailed but clear
instructions.
Interpretability
• This aspect of practicality is relevant when persons other than the test
designers must interpret the results.
• A statement of the functions the test was designed to measure and the
procedures by which it was developed.
– Detailed instructions for administration.
– Scoring keys and instructions.
– Norms for appropriate reference groups.
– Evidence about reliability.
– Evidence regarding the inter correlations of sub scores.
– Evidence regarding the relationship of the test to other measures. Guides
for test use
2. MEASUREMENT SCALES
• Scales in business research are generally constructed to measure behavior,
knowledge, and attitudes.
• Attitude scales are among the most difficult to construct
1. The Nature of Attitudes
• An attitude is a learned, stable predisposition to respond to oneself, other
persons, objects, or issues in a consistently favorable or unfavorable way.
• cognitively based attitude
• Affectively based attitude
• behaviorally based attitudes
Page 6 of 31
UNIT II DATA COLLECTION AND SOURCES
Page 7 of 31
UNIT II DATA COLLECTION AND SOURCES
Attitude Scaling
• Attitude scales provide a quantitative measurement of attitudes, opinions or
values by summarizing numerical scores given by researchers to people's
responses to sets of statements exploring dimensions of an underlying theme.
2. Selecting a Measurement Scale
• Selecting and constructing a measurement scale requires the consideration of
several factors that influence the reliability, validity, and practicality of the
scale:
– Research objectives.
– Response types.
– Data properties.
– Number of dimensions.
– Balanced or unbalanced.
– Forced or unforced choices
– Number of scale points.
– Rater errors.
Research Objectives
• Researchers’ objectives are too numerous to list such as attitude change,
purchase intention
• Two general types of scaling objectives:
– To measure characteristics of the participants who participate in the
study.
– To use participants as judges of the objects or indicants presented to
them.
Response Types
• Measurement scales fall into one of four general types: rating, ranking,
categorization, and sorting.
• A rating scale is used when participants score an object or indicant without
making a direct comparison to another object or attitude
• Ranking scales constrain the study participant to making comparisons and
determining order among two or more properties or objects.
• Categorization asks participants to put themselves or property indicants in
groups or categories
• Sorting requires that participants sort cards into piles using criteria established
by the researcher
Data Properties
• Decisions about the choice of measurement scales are often made with regard
to the data properties generated by each scale.
• We classify scales in increasing order of power; scales are nominal, ordinal,
interval, or ratio.
Page 8 of 31
UNIT II DATA COLLECTION AND SOURCES
Number of Dimensions
Measurement scales are either unidimensional or multidimensional.
o An unidimensional scale, one seeks to measure only one attribute of the
participant or object
o A multidimensional scale recognizes that an object might be better
described with several dimensions than on a unidimensional continuum
Balanced or Unbalanced
• A balanced rating scale has an equal number of categories above and below the
midpoint.
• Example
– very good—good—average—poor—very poor.”
• An unbalanced rating scale has an unequal number of favorable and
unfavorable response choices.
• Example
– “poor—fair—good— very good—excellent
Forced or Unforced Choices
• An unforced-choice rating scale provides participants with an opportunity to
express no opinion when they are unable to make a choice among the
alternatives offered.
• A forced-choice rating scale requires that participants select one of the offered
alternatives
• Example
• “no opinion,” “undecided,” “don’t know,” “uncertain,” or “neutral”
Number of Scale Points
• A scale should be appropriate for its purpose. For a scale to be useful, it should
match the stimulus presented and extract information proportionate to the
complexity of the attitude object, concept, or construct
Rater Errors
• Some raters are reluctant to give extreme judgments, and this fact accounts for
the error of central tendency.
• Participants may also be “easy raters” or “hard raters,” making what is called an
error of leniency.
3. Rating Scales
• We use rating scales to judge properties of objects without reference to other
similar objects.
• These ratings may be in such forms as “like—dislike,” “approve— indifferent—
disapprove,” or other classifications using even more categories
Simple Attitude Scales
• The simple category scale offers two mutually exclusive response choices.
Page 9 of 31
UNIT-II DATA COLLECTION AND SOURCES
• When there are multiple options for the rater but only one answer is sought, the
multiple-choice, single-response scale is appropriate
• A variation, the multiple-choice, multiple-response scale (also called a
checklist ), allows the rater to select one or several alternatives
Likert Scales
• The Likert scale, developed by Rensis Likert the most frequently used variation
of the summated rating scale.
• Summated rating scales consist of statements that express either a favorable
or an unfavorable attitude toward the object of interest.
AVCCE/CA/TVN/SEM/I Page 10 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 11 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 12 of 31
UNIT-II DATA COLLECTION AND SOURCES
4. Ranking Scales
• In ranking scales, the participant directly compares two or more objects and
makes choices among them.
• Frequently, the participant is asked to select one as the “best” or the “most
preferred.”
• When there are only two choices, this approach is satisfactory.
• The forced ranking scale, some attributes that are ranked relative to each
other.
•The comparative scale is ideal for such comparisons if the participants are
familiar with the standard
5. Sorting
• Q-sorts require sorting of a deck of cards into piles that represent points
along a continuum.
• The participant—or judge—groups the cards based on his or her response
to the concept written on the card.
• Researchers using Q-sort resolve three special problems: item selection,
structured or unstructured choices in sorting, and data analysis.
• The basic Q-sort procedure involves the selection of a set of verbal
statements, phrases, single words, or photos related to the concept being
studied.
6. Cumulative Scales
• Total scores on cumulative scales have the same meaning.
• Given a person’s total score, it is possible to estimate which items were
answered positively and negatively.
• A pioneering scale of this type was the scalogram.
• Scalogram analysis is a procedure for determining whether a set of items
forms a unidimensional scale
• We are surveying opinions regarding a new style of running shoe. We have
developed a preference scale of four items:
1. The Airsole is good-looking.
2. I will insist on Airsole next time because it is great-looking.
3. The appearance of Airsole is acceptable to me.
4. I prefer the Airsole style to other styles.
AVCCE/CA/TVN/SEM/I Page 13 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 14 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 15 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 16 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 17 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 18 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 19 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 20 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 21 of 31
UNIT-II DATA COLLECTION AND SOURCES
3. Probability Sampling
Simple Random Sampling
• The unrestricted, simple random sample is the purest form of
probability sampling
Cluster Sampling
• Population is divided into internally subgroups. Some are randomly selected for
further study.
Area Sampling
• Much research involves populations that can be identifi ed with some
geographic area. When this occurs, it is possible to use area sampling
Double Sampling
It is Process includes collecting data from a sample using a previously defined
technique. Based on the information found, a subsample is selected for further
study.
4. Nonprobability Sampling
• The probability of selecting population elements is unknown.
• There are a variety of ways to choose persons or cases to include in the sample
Practical Considerations
• We may use nonprobability sampling procedures because they satisfactorily
meet the sampling objectives.
AVCCE/CA/TVN/SEM/I Page 22 of 31
UNIT-II DATA COLLECTION AND SOURCES
Methods
– Convenience-Nonprobability samples that are unrestricted are called
convenience samples.
– Judgment sampling occurs when a researcher selects sample members
to conform to some criterion.
– Quota sampling -The logic behind quota sampling is that certain
relevant characteristics describe the dimensions of the population
– Snowball-It is also especially appropriate for some qualitative studies.
– In the initial stage of snowball sampling, individuals are discovered and
may or may not be selected through probability methods.
5. DATA – PREPARING
1. Introduction
• Data preparation includes editing, coding, and data entry and is the activity that
ensures the accuracy of the data and their conversion from raw form to reduced
and classified forms that are more appropriate for analysis
2. Editing
• The customary first step in analysis is to edit the raw data. Editing detects
errors and omissions, corrects them when possible, and certifies that maximum
data quality standards are achieved.
– Accurate.
– Consistent with the intent of the question and other information in the
survey.
– Uniformly entered
–
AVCCE/CA/TVN/SEM/I Page 23 of 31
UNIT-II DATA COLLECTION AND SOURCES
Field Editing
• field editing review is a responsibility of the fi eld supervisor. It, too, should be
done soon after the data have been gathered. During the stress of data collection
in a personal interview and paper-and-pencil recording in an observation, the
researcher often uses ad hoc abbreviations and special symbols.
Central Editing
• At this point, the data should get a thorough editing. For a small study, the use
of a single editor produces maximum consistency
3. Coding
• Coding involves assigning numbers or other symbols to answers so that the
responses can be grouped into a limited number of categories. In coding,
categories are the partitions of a data set of a given variable (e.g., if the variable
is gender, the partitions are male and female ).
• Categorization is the process of using rules to partition a body of data. Both
closed- and open-response questions must be coded.
Codebook Construction
• A codebook, or coding scheme, contains each variable in the study and specifies
the application of coding rules to the variable
AVCCE/CA/TVN/SEM/I Page 24 of 31
UNIT-II DATA COLLECTION AND SOURCES
Coding Rules
• Four rules guide the pre- and post coding and categorization of a data set.
– Appropriate to the research problem and purpose.
– Exhaustive.
– Mutually exclusive.
– Derived from one classification dimension
Using Content Analysis for Open Questions
• Increasingly text-based responses to open-ended measurement questions are
analyzed with content analysis software. Content analysis measures the
semantic content or the what aspect of a message
Types of Content
• Syntactical units
• Referential units
• Propositional units
• Thematic units
Missing Data
• Missing data are information from a participant or case that is not available for
one or more variables of interest. In survey studies, missing data typically occur
when participants accidentally skip, refuse to answer, or do not know the
answer to an item on the questionnaire
AVCCE/CA/TVN/SEM/I Page 25 of 31
UNIT-II DATA COLLECTION AND SOURCES
AVCCE/CA/TVN/SEM/I Page 26 of 31
UNIT-II DATA COLLECTION AND SOURCES
Several useful techniques for displaying data are not new to EDA.
They are essential to any examination of the data. For example, a frequency
table is a simple device for arraying data
AVCCE/CA/TVN/SEM/I Page 27 of 31
UNIT-II DATA COLLECTION AND SOURCES
• The values and percentages are more readily understood in this graphic format,
and visualization of the media placements and their relative sizes is improved
Histograms
• Histograms are used when it is possible to group the variable’s values into
intervals.
Stem-and-Leaf Displays
• The stem-and-leaf display is a technique that is closely related to the histogram.
It shares some of the histogram’s features but offers several unique advantages
• Example
– Each line or row in this display is referred to as a stem, and each piece of
information on the stem is called a leaf. The first line or row is
5|455666788889
• The meaning attached to this line or row is that there are 12 items in the data
set whose first digit is five:
– 54, 55, 55, 56, 56, 56, 57, 58, 58, 58, 58, and 59.
AVCCE/CA/TVN/SEM/I Page 28 of 31
UNIT-II DATA COLLECTION AND SOURCES
Pareto Diagrams
• The Pareto diagram is a bar chart whose percentages sum to 100 percent. The
data are derived from a multiple-choice, single-response scale; a multiple-
choice, multiple-response scale; or frequency counts of words (or themes) from
content analysis
Mapping
• Increasingly, participant data are being attached to their geographic dimension
as Geographic Information System (GIS) software and coordinate measuring
devices have become more affordable and easier to use. Essentially a GIS works
by linking data sets to each other with at least one common data field (e.g., a
household’s street address).
2. Cross-Tabulation
• Cross-tabulation is a technique for comparing data from two or more
categorical variables such as gender and selection by one’s company for an
overseas assignment. Cross-tabulation is used with demographic variables and
the study’s target variables
AVCCE/CA/TVN/SEM/I Page 29 of 31
UNIT-II DATA COLLECTION AND SOURCES
• The technique uses tables having rows and columns that correspond to the
levels or code values of each variable’s categories.
• This table has two rows for gender and two columns for assignment selection.
The combination of the variables with their values produces four cells. Each cell
contains a count of the cases of the joint classification and also the row, column,
and total percentages.
• Percentages are used by virtually everyone dealing with number. The following
guidelines, if used during analysis will help to prevent errors in reporting
– Averaging percentages
– Use of too large percentages
– large percentage
– Using too small a base
– Percentage decreases can never exceed 100 percent
AVCCE/CA/TVN/SEM/I Page 30 of 31
UNIT-II DATA COLLECTION AND SOURCES
*****
AVCCE/CA/TVN/SEM/I Page 31 of 31