Choice Modeling
Choice Modeling
CME 508
Choice Modeling
Kouros Mohammadian
Department of Civil, Environmental, and
Materials Engineering
University of Illinois at Chicago
1
Urban Transportation Modeling System
Population &
Employment Forecasts
Trip Generation
Trip Distribution
Transportation
Mode Split Network & Service
Attributes
Trip Assignment
Tij,auto
Link & O-D Flows,
Times, Costs, etc. I Mode Split J
Tij,transit 2
Mode Split Modeling
• Trip-End Models
• Trip-Interchange Models
• Explanatory Variables
• Modes
• Decision Structures
• Logit Models
3
Trip-End Models
4
Trip-End Models - ii
5
Trip-Interchange Models
6
Trip-Interchange Models - ii
7
Explanatory Variables
8
Explanatory Variables - ii
9
Modes Modeled
11
Shared Ride Mode
12
Transit Mode Representation
13
Mixed Modes
14
Mixed Mode Example: Commuter Rail and Buses
Access Sta. 1
Access Sta. 2
Origin
Zone
Egress Station
auto or local transit
access trip link local transit or walk
egress trip link
Rail line-haul
Access Sta. 4 trip link Dest’n
(chosen station Zone
Access Sta. 3
for this trip)
15
Alternative Modes
17
Mode Definition & Choice Structures
Access Access
Sta. 1 ... Sta. n
18
Alternative Choice Structures
Auto Auto Transit Subway, Commuter Rail, Commuter Rail Walk Bicycle
Drive Passenger Allway Auto Access Auto Access Tran. Access Allway
Access Access
Sta. 1 ... Sta. n
19
General procedure for model application
Insert Predicted
Proportions for Each
Mode in the Four
Step Sequence
20
Discrete Choice
22
Theory from microeconomics
• Brief theoretical description of principles and
theorems
• Emphasize practical aspects
• Note: Dan McFadden is Professor of Economics
and Nobel Laureate in Economics
http://emlab.berkeley.edu/users/mcfadden/
• A site that contains a very good bibliography on
Random Utility Models
23
Dependent Variable:
Discrete: Nominal scale
Independent Variables:
Discrete or Continuous
Example:
Commuters had a choice of three alternate
routes; a four-lane arterial (speed limit = 60
km/h, 2 lanes each direction), a two-lane
highway (speed limit = 60 km/h, 1 lane each
direction) and a limited access four-lane freeway
(speed limit = 90 km/h, 2 lanes each direction).
24
Example:
Pipes carrying various commodities (oil, gas,
water, etc.) experience failures. Can the type of
pipe be predicted as a function of commodity,
location, age of pipe, ambient temperature,
etc.?
Example:
Can accident type (rear-end, rollover, run off
road, etc.) be predicted by roadway functional
class, VMT, time of day, age of driver, speed
limit, etc.?
25
Basic Assumptions (1)
• Suppose a trip maker i faces J options (choices
or alternatives) with index j=1,2,3…J.
• Assume that each trip maker associates with
each choice j=1,2,...,J a function called
UTILITY representing the "convenience" of
choosing mode j.
• j=1,2,..., J is called the choice set. This is the
set from which a decision maker chooses an
option.
• Note: Let’s assume that choice and consideration sets are the same.
26
Basic Assumptions Example
– A person, i, needs to go to work from home
to the downtown area.
– Suppose this person has three possible modes
to choose from: Car (j=1), Bus (j=2), and her
Bike (j=3). Total number of options (J=3).
– One possible form of the person ’ s
convenience function (called utility) is:
• Generally assume:
Vij = b’Xij [5.1]
= b0 + b1Xij1 + b2Xij2 + … + bmXijm [5.2]
b = Vector of parameters (utility function weights)
Xijm = Vector of explanatory variables
29
Utility components
n Variables describing the individual --> this is an attempt to
represent "taste variation" from person to person. In our
example if young persons have systematically differing
preferences from the older individuals, then, age would be one
of the variables.
30
Key Assumption (maximum utility)
• Travelers (decision makers) formulate for each
option a utility and they calculate its value.
• Then, they choose the option with the most
advantageous utility (maximum utility).
• Example: U(car,bus,bike)=-0.5*cost-2*waiting time
• Cost by bus=$1, Waiting time=5 minutes
• Cost by car=$2.5, Waiting time=1 minute
• Cost by bike=$0.2, Waiting time=0 minutes
32
Incorporating uncertainty and
traveler/trip characteristics
• The example becomes: U(bus,car,bike)=-
0.5*cost-2*waiting time + SOMETHING ELSE
33
Utility elements
nUij = aj-0.5*costj-
2*waiting timej
+ bj * agei + eij
nUij = aj-0.5*costj-2*waiting
timej + bj * agei + eij
35
Utility elements
Cost is different for each mode j
nUij = aj-0.5*costj-
2*waiting timej + bj*agei +eij
nUij = aj-0.5*costj-2*waiting
timej + bj * agei + eij
nUij = aj-0.5*costj-2*waiting
timej + bj * agei + eij
Random
Systematic & measurable part
40
Numerical example
(trip from home to work/school)
nUicar = - 0.5*cost - 2*waiting time + eicar
nUibus = 5 - 0.5*cost - 2*waiting time +
0.25 * agei + eibus
nUibike = 12 - 0.5*cost - 2*waiting time -
0.3 * agei + eibike
41
Note: Different age coefficients - why?
Compare systematic part (V)
• Compute for each person the systematic
part of utility for each mode
• Plot all V (syst. utilities) for all persons
• Horizontal = age
• Vertical = V the systematic part of utility
of each mode
42
Modal Utilities
20
10
Utility Value
Vbus
0
Vbike
0 20 40 60 80 100
-10
-20
Age
43
Probability of Choice
• We need to convert utilities to an estimate
of the chance to choose a mode
• The specific equation to use depends on
the probability distribution of the random
component (e) in the utility function
(U=V+e)
• Ease of calculations should be considered
in selecting a probability function
44
Random Utility Theory - ii
exp(Vibike )
Pi (bike) = bus ,bike
å exp(V
j = car
ij )
47
Probability
1.2
Probability to choose a
0.8
Pcar
mode
0.6 Pbus
Pbike
0.4
0.2
0
0 20 40 60 80 100
Age of Traveler
48
Applications
• Modal split (type of mode)
• Route choice (link by link or entire path)
• Car ownership (type of car)
• Destination choice (shopping place)
• Activity types (type of activity)
• Residential unit (size and type of home)
49
Example: Binomial Logit
51
Binary Logit Results
Beta Value Std. Error t value
(Intercept) 0.00759 1 0.0076 0.3995 0.0190
FEMALE -1.14253 0 0.0000 0.1336 -8.5505
AGE25 -0.58257 1 -0.5826 0.3257 -1.7884
AIVTT -0.01582 22 -0.3480 0.0179 -0.8823
TIVTT 0.02410 45 1.0843 0.0117 2.0595
ACTOT -0.07608 4 -0.3043 0.0158 -4.8299
TOVTT 0.04229 5 0.2114 0.0258 1.6394
P(auto) 51.71%
P(transit) 48.29%
P(auto) 25.46%
P(transit) 74.54%
52
Example: Work Trip Mode Choice
53
Example: Work Trip Mode Choice - ii
• Variable definitions:
– Vm= utility for mode m (m = d, drive; p, passenger; t,
transit)
– COSTm= out-of-pocket travel cost ($), mode m
– IVTTm = in-vehicle travel time for mode m (min.)
– OVTTm= out-of-vehicle travel time for mode m (min.)
– NVEH = avg. no. of vehicles per household in home
zone
– TWY = 1 if emp. zone is located within the catchment
area of a Transit-way station outside the CBD;
= 0 otherwise
– REGION =1 if the home zone is located in the Outaouais;
= 0 otherwise
54
Example: Work Trip Mode Choice - iii
Pij =
åe
k
Vik
56
Practical issues
• Choice set - consideration set
• Variables to include in utility
• Measurement of mode attributes (e.g.,in-
vehicle-travel-time)
• Need survey data and mode by mode
attributes!
57
Individual & Travel Data Predict Exogenous
Explanatory & Policy
Variables
Choice
Model
Formulation
Estimate Aggregate
Apply
Disaggregate (TAZ) Travel
Prediction
Choice Prediction
Procedure
Model(s)
Insert in the
Four Step
Sequence
58
For the four step modal split
• We need aggregate TAZ proportions by
each mode (% of trips by car, % trips
by bus, % trips by bike)
• We have a disaggregate (individual)
model which tells us the likelihood
(chance) of a person to choose each
mode
• We need a procedure to go from
disaggregate predictions of chance to
aggregate predictions of proportions
59
Taking Average TAZ Characteristics
Does Not Work
• (Pa+Pb)/2 is not the same as
P([Va+Vb]/2) - a and b are value points
for V
• When the two are equated we have the
Naïve method of aggregation
• Bias depends on how close the probability
function is to a linear function
• Following is an example from Probability
to choose bus as an option
60
Pbus
0.8
P(V=12)=0.679
0.6 Pbus
0.4
0.2
P(V=2)=0.034
0 V=2 V=12
-5 0 5 10 15 20
Systematic Utility of Bus (Vbus)
61
What is the correct TAZ Proportion of
Choosing the Bus?
• (P(V=2)+P(V=12))/2
• or
• P((2+12)/2)=P(V=7)
62
Pbus
The correct value is: [P(V=2)+P(V=12)]/2=0.357
1.2
Proability of Choice
0.8
P(V=12)=0.679
0.6 Pbus
0.4
P(V=7)=0.223
0.2
P(V=2)=0.034
0
V=2 V=7 V=12
-5 0 5 10 15 20
Systematic Utility of Bus (Vbus)
63
Pbus
1.2
Proability of Choice
0.8
0.6 Pbus
[P(V=2)+P(V=12)]/2=0.357
0.4
P(V=7)=0.223
0.2 Bias
0
V=2 V=7 V=12
-5 0 5 10 15 20
Systematic Utility of Bus (Vbus)
64
Naïve Aggregation
• For each TAZ take the average value of
explanatory variables
• Compute average value for each utility
function for each mode
• Compute the corresponding probability
and use it as the TAZ proportion choosing
each mode
65
Market Segmenation
• Divide the residents in each TAZ into
relatively homogeneous segments
• Apply Naïve aggregation to each segment
and get proportions for each mode
• Compute the TAZ proportion either as
average segment-specific proportion or
weighted segment-specific proportion
66
Complete Enumeration
• Compute for each person and for each
mode the probability to choose a mode
• Compute the proportion for each mode as
an average of the individual probabilities
• Stochastic microsimulation is a method
derived from this - see Chapter 12 of
Goulias, 2003 (Transportation Systems
Planning: Methods and Application)
67
Example
(TAZ with four persons)
Age Vcar Vbus Vbike
Segment 1 45 7.500 5.750 -1.600
Segment 2 21 3.900 -0.250 5.600
Segment 2 20 3.750 -0.500 5.900
Segment 3 79 12.600 14.250 -11.800
Average 41.25 6.938 4.813 -0.475
Exp (U) 1030.192 123.039 0.622
Naïve Prob 0.893 0.107 0.001
Vicar = - 0.5*cost - 2*waiting time + 0.15 * agei
Vibus = 5 - 0.5*cost - 2*waiting time + 0.25 * agei
Vibike = 12 - 0.5*cost - 2*waiting time - 0.3 * agei
68
Compare values of the three
methods
Pcar Pbus Pbike
Average of Segments 0.372 0.329 0.298
Weigthed Average
of Segments 0.305 0.247 0.447
69
Forecasting
70
Forecasting Example
71
Forecasting Example - ii
• Define:
Tijm = Predicted trips from i to j by mode m
Tij = Total trips from i to j
wijk = Fraction of workers living in i, working in j who
belong in NVEH-DLIC category k
Pijkm = Logit probability of a worker of type k, living in i,
working in j, using mode m
Then:
Tijm = Tij *{Sk wijk* Pijkm} [7]
72
Theoretical issues
73
Independence of Irrelevant Alternatives (IIA)
• Note that for two modes in the choice set, say m and
n:
Pim eVim
= Vin = eVim -Vin [8]
Pin e
– i.e., relative probability of choosing mode m versus mode
n depends only on the utilities of m and n, independent of
what other alternatives are in the choice set
• Problems with this?
• Blue Bus - Red Bus
74
IIA - ii
75
Elasticity-based models
77
Arc elasticity
• Elastic demand
– When the absolute value of elasticity is greater than 1,
i.e., a percentage change in x is accompanied by a greater
than one percentage change in demand, the demand is
said to be elastic wrt x
• Inelastic demand
– When the absolute value of elasticity is less than 1, i.e., a
percentage change in x is accompanied by a less than one
percentage change in demand, the demand is said to be
inelastic wrt x
• Unit elasticity
– When the absolute value of eDx is exactly 1
79
Direct and Cross elasticities
80
Direct Elasticity Formulae
Direct Elasticity of the individual selecting alternative i with
respect to a change in some attributes of the alternative i.
%age_Change_in_P dt 8.5%
Arc _ Elasticity = = = -0.85
%age_Change_in_Cost dt - 10%
81
Direct Elasticity :Auto Choice & Trip Costs
Direct Elasticity and Change in Auto Mode Choice
%-d_Pautod Elasticity
100.0% 0.00
80.0% -0.10
Percentage Change in Auto mode-splits
Percentage in Auto Mode Choice
-0.20
60.0%
-0.30
40.0%
Elasticity Values
Elasticity Values
-0.40
20.0%
-0.50
0.0%
-0.60
-90%
-80%
-70%
-60%
-50%
-40%
-30%
-20%
-10%
10%
20%
30%
40%
50%
60%
70%
80%
90%
-100%
100%
0%
-20.0%
-0.70
-40.0%
-0.80
-60.0% -0.90
-80.0% -1.00
Percentage Change in Auto Trip Cost
82
Cross Elasticity Formulae
Cross Elasticity of the individual selecting alternative i with respect
to a change in some attributes of the alternative j.
%age_Change_in_P dt - 2.6%
Cross _ Elasticity = = = 0.26
%age_Change_in_Cost tt - 10%
83
Cross Elasticities: Auto Choice & Transit Fare
%-d_Pautod
Percentage Change in Auto Driver Mode Choice
30.0%
20.0%
10.0%
0.0%
-10.0%
-20.0%
-30.0%
-100% -50% 0% 50% 100%
84
Additional sources
• Ortuzar Willumsen - Chapters 8 and 9
• http://www.bts.gov/ntl/DOCS/SICM.html
(Spear’s report on how to apply models)
• http://www.bts.gov/ntl/DOCS/UT.html
(self-instructional overview with examples)
• http://www.tfhrc.gov/safety/pedbike/vol2/
sec2.5.htm (simple description of most of
the key issues)
85
Summary
• Rational economic behavior
• Utility linear in systematic and random
components
• Choice probability is function of utilities
– non linear function!
• Application by enumeration is best -
weighted average by market segments
may be good - depends on application!
• Aggregate models are also available –
approximate!
• Surveys must be used for this step
86