Goeree 2001
Goeree 2001
This paper reports laboratory data for games that are played only once. These
games span the standard categories: static and dynamic games with complete and
incomplete information. For each game, the treasure is a treatment in which
behavior conforms nicely to predictions of the Nash equilibrium or relevant refine-
ment. In each case, however, a change in the payoff structure produces a large
inconsistency between theoretical predictions and observed behavior. These con-
tradictions are generally consistent with simple intuition based on the interaction of
payoff asymmetries and noisy introspection about others’ decisions. (JEL C72, C92)
The Nash equilibrium has been the center- sequential Nash equilibrium for dynamic games
piece of game theory since its introduction with incomplete information (e.g., Robert Gib-
about 50 years ago. Along with supply and bons, 1997). The rationality assumptions that
demand, the Nash equilibrium is one of the underlie this analysis are often preceded by
most commonly used theoretical constructs in persuasive adjectives like “perfect,” “intuitive,”
economics, and it is increasingly being applied and “divine.” If any noise in decision-making is
in other social sciences. Indeed, game theory admitted, it is eliminated in the limit in a pro-
has finally gained the central role envisioned by cess of “purification.” It is hard not to notice
John von Neumann and Oscar Morgenstern, and parallels with theology, and the highly mathe-
in some areas of economics (e.g., industrial matical nature of the developments makes this
organization) virtually all recent theoretical de- work about as inaccessible to mainstream econ-
velopments are applications of game theory. omists as medieval treatises on theology would
The impression one gets from recent surveys have been to the general public.
and game theory textbooks is that the field has The discordant note in this view of game
reached a comfortable maturity, with neat clas- theory has come primarily from laboratory ex-
sifications of games and successively stronger periments, but the prevailing opinion among
(more “refined”) versions of the basic approach game theorists seems to be that behavior will
being appropriate for more complex categories eventually converge to Nash predictions under
of games: Nash equilibrium for static games the right conditions.1 This paper presents a
with complete information, Bayesian Nash for much more unsettled perspective of the current
static games with incomplete information, sub- state of game theory. In each of the major types
game perfectness for dynamic games with com- of games, we present one or more examples for
plete information, and some refinement of the which the relevant version of the Nash equilib-
rium predicts remarkably well. These “trea-
sures” are observed in games played only once
* Goeree: Department of Economics, 114 Rouss Hall,
University of Virginia, P.O. Box 400182, Charlottesville, by financially motivated subjects who have had
VA 22904, and University of Amsterdam, Roetersstraat 11, prior experience in other, similar, strategic sit-
1018 WB Amsterdam, The Netherlands (e-mail: uations. In each of these games, however, we
jg2n@virginia.edu); Holt: Department of Economics, 114 show that a change in the payoff structure can
Rouss Hall, University of Virginia, P.O. Box 400182, Char-
lottesville, VA 22904 (e-mail: cah2k@virginia.edu). We
produce a large inconsistency between theoret-
wish to thank Rachel Parkin and Scott Saiers for research
assistance, and Colin Camerer, Monica Capra, Glenn Har-
1
rison, Susan Laury, Melayne McInnes, Theo Offerman, For example, George J. Mailath’s (1998) survey of
Amnon Rapoport, Joel Sobel, and an anonymous referee for evolutionary models cites the failure of backward induction
helpful comments. This research was funded in part by the as the main cause of behavioral deviations from Nash pre-
National Science Foundation (SES–9818683). dictions.
1402
VOL. 91 NO. 5 GOEREE AND HOLT: TREASURES AND CONTRADICTIONS 1403
ical prediction(s) and human behavior. For ex- offs for this game written on a blackboard, and
ample, a payoff change that does not alter the he invented the prisoner’s dilemma story that
unique Nash equilibrium may move the data to was later used in a lecture on game theory that
the opposite side of the range of feasible deci- he gave in the Psychology Department at Stan-
sions. Alternatively, a payoff change may cause ford (Tucker, 1950).
a major shift in the game-theoretic predictions Interestingly, Nash’s response to Dresher and
and have no noticeable effect on actual behav- Flood’s repeated prisoner’s dilemma experi-
ior. The observed contradictions are typically ment is contained in a note to the authors that
somewhat intuitive, even though they are not was published as a footnote to their paper:
explained by standard game theory. In a simul-
taneous effort-choice coordination game, for The flaw in the experiment as a test of
example, an increase in the cost of players’ equilibrium point theory is that the exper-
“effort” decisions is shown to cause a dramatic iment really amounts to having the play-
decrease in effort, despite the fact that any com- ers play one large multi-move game. One
mon effort is a Nash equilibrium for a range of cannot just as well think of the thing as a
effort costs. In some of these games, it seems sequence of independent games as one
can in zero-sum cases. There is just too
like the Nash equilibrium works only by coin- much interaction . . . (Nasar, 1998 p.
cidence, e.g., in symmetric cases where the 119).
costs of errors in each direction are balanced. In
other cases, the Nash equilibrium has consider- In contrast, the experiments that we report in
able drawing power, but economically signifi- this paper involved games that were played only
cant deviations remain to be explained. once, although related results for repeated
The idea that game theory should be tested games with random matching will be cited
with laboratory experiments is as old as the where appropriate. As Nash noted, the advan-
notion of a Nash equilibrium, and indeed, the tage of one-shot games is that they insulate
classic prisoner’s dilemma paradigm was in- behavior from the incentives for cooperation
spired by an experiment conducted at the and reciprocity that are present in repeated
RAND Corporation in 1950. Some of the stra- games. One potential disadvantage of one-shot
tegic analysts at RAND were dissatisfied with games is that, without an opportunity to learn
the received theory of cooperative and zero-sum and adapt, subjects may be especially prone to
games in von Neumann and Morgenstern’s the effects of confusion. The games used in
(1944) Theory of Games and Economic Behav- this paper, however, are simple enough in
ior. In particular, nuclear conflict was not structure to ensure that Nash-like behavior
thought of as a zero-sum game because both can be observed in the “treasure” treatment.
parties may lose. Sylvia Nasar (1998) describes In addition, the study of games played only
the interest at RAND when word spread that a once is of independent interest given the
Princeton graduate student, John Nash, had gen- widespread applications of game theory to
eralized von Neumann’s existence proof for model one-shot interactions in economics and
zero-sum games to the class of all games with other social sciences, e.g., the FCC license
finite numbers of strategies. Two mathemati- auctions, elections, military campaigns, and
cians, Melvin Dresher and Merrill Flood, had legal disputes.
been running some game experiments with their The categories of games to be considered are
colleagues, and they were skeptical that human based on the usual distinctions: static versus
behavior would be consistent with Nash’s no- dynamic and complete versus incomplete infor-
tion of equilibrium. In fact, they designed an mation. Section I describes the experiments
experiment that was run on the same day they based on static games with complete informa-
heard about Nash’s proof. Each player in this tion: social dilemma, matching pennies, and
game had a dominant strategy to defect, but coordination games. Section II contains results
both would earn more if they both used the from dynamic games with complete informa-
cooperative strategy. The game was repeated tion: bargaining games and games with threats
100 times with the same two players, and a fair that are not credible. The games reported in
amount of cooperation was observed. One of Sections III and IV have incomplete infor-
Nash’s professors, Al W. Tucker, saw the pay- mation about other players’ payoffs: in static
1404 THE AMERICAN ECONOMIC REVIEW DECEMBER 2001
settings (auctions) and two-stage settings (sig- models with elements of both belief and rein-
naling games). forcement learning (Colin Camerer and Teck-
It is well known that decisions can be af- Hua Ho, 1999). Learning from experience is not
fected by psychological factors such as framing, possible in games that are only played once, and
aspiration levels, social distance, and heuristics beliefs must be formed from introspective
(e.g., Daniel Kahneman et al., 1982; Catherine thought processes, which may be subject to
Eckel and Rick Wilson, 1999). In this paper we considerable noise. Without noise, iterated best
try to hold psychological factors constant and responses will converge to a Nash equilibrium,
focus on payoff changes that are primarily eco- if they converge at all. Some promising ap-
nomic in nature. As noted below, economic proaches to explaining deviations from Nash
theories can and are being devised to explain the predictions are based on models that limit play-
resulting anomalies. For example, the rational- ers’ capacities for introspection, either by lim-
choice assumption underlying the notion of a iting the number of iterations (e.g., Dale O.
Nash equilibrium eliminates all errors, but if the Stahl and Paul W. Wilson, 1995; Rosemarie
costs of “overshooting” an optimal decision are Nagel, 1995) or by injecting increasing amounts
much lower than the costs of “undershooting,” of noise into higher levels of iterated beliefs
one might expect an upward bias in decisions. (Goeree and Holt, 1999; Dorothea Kübler and
In a game, the endogenous effects of such bi- Georg Weizsäcker, 2000). The predictions de-
ases may be reinforcing in a way that creates a rived from these approaches, discussed in Sec-
“snowball” effect that moves decisions well tion V, generally conform to Nash predictions
away from a Nash prediction. Models that in- in the treasure treatments and to the systematic,
troduce (possibly small) amounts of noise into intuitive deviations in the contradiction treat-
the decision-making process can produce pre- ments. Some conclusions are offered in
dictions that are quite far from any Nash equi- Section VI.
librium (Richard D. McKelvey and Thomas R.
Palfrey, 1995, 1998; Goeree and Holt, 1999). I. Static Games with Complete Information
Equilibrium models of noisy behavior have
been used to explain behavior in a variety of In this section we consider a series of two-
contexts, including jury decision-making, bar- player, simultaneous-move games, for which
gaining, public goods games, imperfect price the Nash equilibria show an increasing degree
competition, and coordination (Simon P. of complexity. The first game is a “social di-
Anderson et al., 1998a, b, 2001a; C. Monica lemma” in which the pure-strategy Nash equi-
Capra et al., 1999, 2002; Stanley S. Reynolds, librium coincides with the unique rationalizable
1999; Serena Guarnaschelli et al., 2001). outcome. Next, we consider a matching pennies
A second type of rationality assumption that game with a unique Nash equilibrium in mixed
is built into the Nash equilibrium is that beliefs strategies. Finally, we discuss coordination
are consistent with actual decisions. Beliefs are games that have multiple Nash equilibria, some
not likely to be confirmed out of equilibrium, of which are better for all players.
and learning will presumably occur in such In all of the games reported here and in
cases. There is a large recent literature on in- subsequent sections, we used cohorts of student
corporating learning into models of adjustment subjects recruited from undergraduate econom-
in games that are played repeatedly with differ- ics classes at the University of Virginia. Each
ent partners.2 These models include adaptive cohort consisted of ten students who were paid
learning (e.g., Vincent P. Crawford, 1995; $6 for arriving on time, plus all cash they earned
David J. Cooper et al., 1997), naive Bayesian in the games played. These one-shot games
learning (e.g., Jordi Brandts and Holt, 1996; followed an initial “part A” in which the sub-
Dilip Mookherjee and Barry Sopher, 1997), re- jects played the same two-person game for ten
inforcement or stimulus-response learning (e.g., periods with new pairings made randomly in
Ido Erev and Alvin E. Roth, 1998), and hybrid each period.3 Earnings for the two-hour ses-
2 3
See, for instance, Drew Fudenberg and David K. Le- We only had time to run about six one-shot games in
vine (1998) for a survey. each session, so the data are obtained from a large number
VOL. 91 NO. 5 GOEREE AND HOLT: TREASURES AND CONTRADICTIONS 1405
sions ranged from $15 to $60, with an average 210 ⫺ R respectively. Since R ⬎ 1, the best
of about $35. Each one-shot game began with response is to undercut the other’s decision by 1
the distribution and reading of the instructions (if that decision were known), and therefore, the
for that game.4 These instructions contained upper bound 300 is never a best response to any
assurances that all money earned would be paid possible beliefs that one could have. Conse-
and that the game would be followed by “an- quently, a rational person must assign a proba-
other, quite different, decision-making experi- bility of zero to a choice of 300, and hence 299
ment.” Since the one-shot treatments were cannot be a best response to any possible beliefs
paired, we switched the order of the treasure that rule out choices of 300, etc. Only the lower
and contradiction conditions in each subsequent bound 180 survives this iterated deletion pro-
session. Finally, the paired treatments were al- cess and is thus the unique rationalizable action,
ways separated by other one-shot games of a and hence the unique Nash equilibrium.6 This
different type. game was introduced by Kaushik Basu (1994)
who coined it the “traveler’s dilemma” game.7
A. The One-Shot Traveler’s Dilemma Game Although the Nash equilibrium for this game
can be motivated by successively dropping
The Nash equilibrium concept is based on the those strategies that are never a best response
twin assumptions of perfect error-free decision- (to any beliefs about strategies that have not yet
making and the consistency of actions and be- been eliminated from consideration), this dele-
liefs. The latter requirement may seem tion process may be too lengthy for human
especially strong in the presence of multiple subjects with limited cognitive abilities. When
equilibria when there is no obvious way for the cost of having the higher number is small,
players to coordinate. More compelling argu- i.e., for small values of R, one might expect
ments can be given for the Nash equilibrium more errors in the direction of high claims, well
when it predicts the play of the unique justifi- away from the unique equilibrium at 180, and
able, or rationalizable, action (B. Douglas indeed this is the intuition behind the dilemma.
Bernheim, 1984; David G. Pierce, 1984). Ra- In contrast, with a large penalty for having the
tionalizability is based on the idea that players higher of the two claims, players are likely to
should eliminate those strategies that are never end up with claims that are near the unique
a best response for any possible beliefs, and Nash prediction of 180.
realize that other (rational) players will do the To test these hypotheses we asked 50 subjects
same.5 (25 pairs) to make choices in a treatment with
To illustrate this procedure, consider the R ⫽ 180, and again in a matched treatment
game in which two players independently and with R ⫽ 5. All subjects made decisions in
simultaneously choose integer numbers be- each treatment, and the two games were sepa-
tween (and including) 180 and 300. Both play- rated by a number of other one-shot games. The
ers are paid the lower of the two numbers, and,
in addition, an amount R ⬎ 1 is transferred
from the player with the higher number to the 6
In other games, rationalizability may allow outcomes
player with the lower number. For instance, if that are not Nash equilibria, so it is a weaker concept than
one person chooses 210 and the other chooses that of a Nash equilibrium, allowing a wider range of
possible behavior. It is in this sense that Nash is more
250, they receive payoffs of 210 ⫹ R and persuasive when it corresponds to the unique rationalizable
outcome.
7
The associated story is that two travelers purchase
of sessions where part A involved a wide range of repeated identical antiques while on a tropical vacation. Their lug-
games, including public goods, coordination, price compe- gage is lost on the return trip, and the airline asks them to
tition, and auction games that are reported in other papers. make independent claims for compensation. In anticipation
The one-shot games never followed a repeated game of the of excessive claims, the airline representative announces:
same type. “We know that the bags have identical contents, and we will
4
These instructions can be downloaded from http:// entertain any claim between $180 and $300, but you will
www.people.virginia.edu/⬃cah2k/datapage.html. each be reimbursed at an amount that equals the minimum of
5
A well-known example for which this iterated deletion the two claims submitted. If the two claims differ, we will
process results in a unique outcome is a Cournot duopoly also pay a reward R to the person making the smaller claim
game with linear demand (Fudenberg and Jean Tirole, 1993 and we will deduct a penalty R from the reimbursement to
pp. 47– 48). the person making the larger claim.”
1406 THE AMERICAN ECONOMIC REVIEW DECEMBER 2001
roles). The choice percentages are shown in TABLE 2—AN EXTENDED COORDINATION GAME
parentheses next to the decision labels in the top
L H S
part of Table 1. Note that the choice percentages L 90, 90 0, 0 x, 40
are essentially “50-50,” or as close as possible H 0, 0 180, 180 0, 40
given that there was an odd number of players
in each role.
Now consider what happens if the row play-
er’s payoff of $0.80 in the (Top, Left) box is in an intuitive manner, with row players’
increased to $3.20, as shown in the asymmetric choices responding to their own payoffs.11 In
matching pennies game in the middle part of this context, the Nash mixed-strategy prediction
Table 1. In a mixed-strategy equilibrium, a seems to work only by coincidence, when the
player’s own decision probabilities should be payoffs are symmetric.
such that the other player is made indifferent
between the two alternatives. Since the column C. A Coordination Game with a Secure
player’s payoffs are unchanged, the mixed- Outside Option
strategy Nash equilibrium predicts that row’s
decision probabilities do not change either. In Games with multiple Nash equilibria pose
other words, the row player should ignore the interesting new problems for predicting behav-
unusually high payoff of $3.20 and still choose ior, especially when some equilibria produce
Top or Bottom with probabilities of one-half. higher payoffs for all players. The problem of
(Since column’s payoffs are either 40 or 80 for coordinating on the high-payoff equilibrium
playing Left and either 80 or 40 for playing may be complicated by the possible gains and
Right, row’s decision probabilities must equal losses associated with payoffs that are not part
one-half to keep column indifferent between of any equilibrium outcome. Consider a coordi-
Left and Right, and hence willing to random- nation game in which players receive $1.80 if
ize.)10 This counterintuitive prediction is dra- they coordinate on the high-payoff equilibrium
matically rejected by the data, with 96 percent (H, H) $0.90 if they coordinate on the low-
of the row players choosing the Top decision payoff equilibrium (L, L), and they receive
that gives a chance of the high $3.20 payoff. nothing if they fail to coordinate (i.e., when one
Interestingly, the column players seemed to player chooses H and the other L). Suppose that,
have anticipated this, and they played Right 84 in addition, the column player has a secure
percent of the time, which is quite close to their option S that yields $0.40 for column and results
equilibrium mixed strategy of 7⁄8. Next, we low- in a zero payoff for the row player. This game is
ered the row player’s (Top, Left) payoff to given in Table 2 when x ⫽ 0. To analyze the
$0.44, which again should leave the row play- Nash equilibria of this game, notice that for the
er’s own choice probabilities unaffected in a column player a 50-50 combination of L and H
mixed-strategy Nash equilibrium. Again the ef- dominates S, and a rational column player
fect is dramatic, with 92 percent of the choices should therefore avoid the secure option. Elim-
being Down, as shown in the bottom part of inating S turns the game into a standard 2 ⫻ 2
Table 1. As before, the column players seemed coordination game that has three Nash equilibria:
to have anticipated this reaction, playing Left 80
percent of the time. To summarize, the unique
Nash prediction is for the bolded row-choice 11
This anomaly is persistent when subjects play the
percentages to be unchanged at 50 percent for game repeatedly. Jack Ochs (1995a, b) investigates a match-
all three treatments. This prediction is violated ing pennies game with an asymmetry similar to that of the
middle game in Table 1, and reports that the row players
continue to select Top considerably more than one-half of
the time, even after as many as 50 rounds. These results are
10
The predicted equilibrium probabilities for the row replicated in McKelvey et al. (2000). Similarly, Goeree et
player are not affected if we relax the assumption of risk al. (2000) report results for ten-period repeated matching
neutrality. There are only two possible payoff levels for pennies games that exactly match those in Table 1. The
column so, without loss of generality, columns’ utilities for results are qualitatively similar but less dramatic than those
payoffs of 40 and 80 can be normalized to 0 and 1. Hence in Table 1, with row’s choice probabilities showing strong
even a risk-averse column player will only be indifferent “own-payoff” effects that are not predicted by the Nash
when row uses choice probabilities of one-half. equilibrium.
1408 THE AMERICAN ECONOMIC REVIEW DECEMBER 2001
prefers a penny to nothing. Knowing this, the behavior in bargaining games hinge on models
first player should demand $3 and offer $2 to of preferences in which a person’s utility de-
the other in the first stage. In a subgame-perfect pends on the payoffs of both players, i.e., dis-
equilibrium, the first player receives the amount tribution matters (Bolton, 1998; Fehr and
by which the pie shrinks, so a larger cost of Schmidt, 1999; Bolton and Ockenfels, 2000;
delay confers a greater advantage to the player Miguel Costa-Gomes and Klaus G. Zauner,
making the initial demand, which seems reason- 2001). The role of fairness is illustrated dramat-
able. For example, a similar argument shows ically in the experiment reported in Goeree and
that if the pie shrinks by $4.50, from $5 to Holt (2000a), who obtained even larger devia-
$0.50, then the first player should make an tions from subgame-perfect Nash predictions
initial demand of $4.50. than those reported here by giving subjects
We used 60 subjects (six cohorts of ten sub- asymmetric money endowments that were paid
jects each), who were randomly paired for each independently of the bargaining outcome. These
of the two treatments described above (alternat- endowments were selected to accentuate the
ing in order and separated by other one-shot payoff inequities that result in the subgame-
games). The average demand for the first player perfect Nash equilibrium, and hence their effect
was $2.83 for the $5/$2 treatment, with a stan- was to exaggerate fairness issues without alter-
dard deviation of $0.29. This is quite close to ing the equilibrium prediction. The result (for
the predicted $3.00 demand, and 14 of the 30 seven different one-shot bargaining games) was
initial demands were exactly equal to $3.00 in for demands to be inversely related to the sub-
this treasure treatment. But the average demand game-perfect Nash predictions.
only increased to $3.38 for the other treatment
with a $4.50 prediction, and 28 of the 30 de- III. Static Games with Incomplete Information
mands were below the prediction of $4.50. Re-
jections were quite common in this contradiction William Vickrey’s (1961) models of auctions
treatment with higher demands and correspond- with incomplete information constitute one of the
ingly lower offers to the second player, which is most widely used applications of game theory. If
not surprising given the smaller costs of rejecting private values are drawn from a uniform distribu-
“stingy” offers. tion, the Bayesian Nash equilibrium predicts that
These results fit into a larger pattern surveyed bids will be proportional to value, which is gen-
in Douglas D. Davis and Holt (1993 Chapter 5) erally consistent with laboratory evidence. The
and Roth (1995); initial demands in two-stage main deviation from theoretical predictions is the
bargaining games tend to be “too low” relative tendency of human subjects to “overbid” (relative
to theoretical predictions when the equilibrium to Nash), which is commonly rationalized in terms
demand is high, say more than 80 percent of the of risk aversion, an explanation that has lead to
pie as in our $5.00/$0.50 treatment, and initial some controversy. Glenn W. Harrison (1989), for
demands tend to be close to predictions when instance, argues that deviations from the Nash
the equilibrium demand is 50 –75 percent of the equilibrium may well be caused by a lack of
pie (as in our $5.00/$2.00 treatment). Interest- monetary incentives since the costs of such devi-
ingly, initial demands are “too high” when the ations are rather small: the “flat maximum cri-
equilibrium demand is less than half of the pie. tique.” Our approach here is to specify two
Here is an example of why theoretical explana- auction games with identical Nash equilibria, but
tions of behavior should not be based on exper- with differing incentives not to overbid.
iments in only one part of the parameter space, First, consider a game in which each of two
and why theorists should have more than just a bidders receives a private value for a prize to be
casual, secondhand knowledge of the experi- auctioned in a first-price, sealed-bid auction. In
mental economics literature.14 Many of the di- other words, the prize goes to the highest bidder
verse theoretical explanations for anomalous for a price equal to that bidder’s own bid. Each
bidder’s value for the prize is equally likely to
14
be $0, $2, or $5. Bids are constrained to be
Another example is the development of theories of
generalized expected utility to explain “fanning out” pref- integer dollar amounts, with ties decided by the
erences in Allais paradox situations, when later experiments flip of a coin.
in other parts of the probability triangle found “fanning in.” The relevant Nash equilibrium in this game
VOL. 91 NO. 5 GOEREE AND HOLT: TREASURES AND CONTRADICTIONS 1413
TABLE 4—EQUILIBRIUM EXPECTED PAYOFFS FOR THE (0,2,5) TREATMENT (EQUILIBRIUM BIDS MARKED WITH AN ASTERISK*)
TABLE 5—EQUILIBRIUM EXPECTED PAYOFFS FOR THE (0,3,6) TREATMENT (EQUILIBRIUM BIDS MARKED WITH AN ASTERISK*)
with incomplete information about others’ pref- as indicated by the location of optimal bids for
erences is the Bayesian Nash equilibrium, each value. Even though the equilibria are the
which specifies an equilibrium bid for each pos- same, we expected more of an upward bias in
sible realization of a bidder’s value. It is bids in the second (0, 3, 6) treatment. The
straightforward but tedious to verify that the intuition can be seen by looking at payoff losses
Nash equilibrium bids are $0, $1, and $2 for a associated with deviations from the Nash equi-
value of $0, $2, and $5 respectively, as can be librium. Consider, for instance, the middle-
seen from the equilibrium expected payoffs in value bidder with expected payoffs shown in the
Table 4. For example, consider a bidder with a second rows of Tables 4 and 5. In the (0, 3, 6)
private value of $5 (in the bottom row) who treatment, the cost of bidding $1 above the
faces a rival that bids according to the proposed equilibrium bid is $1 ⫺ $0.83 ⫽ $0.17, which is
Nash solution. A bid of 0 has a one-half chance less than the cost of bidding $1 below the equi-
of winning (decided by a coin flip) if the rival’s librium bid: $1 ⫺ $0.50 ⫽ $0.50. In the (0, 2, 5)
value, and hence the rival’s bid, is zero, which treatment, the cost of an upward deviation from
happens with probability one-third. Therefore, the equilibrium bid is greater than the cost of a
the expected payoff of a zero bid with a value of downward deviation; see the middle row of
$5 equals 1⁄2 ⴱ 1⁄3 ⴱ ($5 ⫺ $0) ⫽ $5/6 ⫽ 0.83. Table 4. A similar argument applies to the high-
If the bid is raised to $1, the probability of win- value bidders, while deviation costs are the
ning becomes 1⁄2 (1⁄3 when the rival’s value is $0 same in both treatments for the low-value bid-
plus 1⁄6 when the rival’s value is $2). Hence, the der. Hence we expected more overbidding for
expected payoff of a $1 bid is 1⁄2 ⴱ ($5 ⫺ the (0, 3, 6) treatment.
$1) ⫽ $2. The other numbers in Table 4 are This intuition is borne out by bid data for the
derived in a similar way. The maximum expected 50 subjects who participated in a single auction
payoff in each row coincides with the equilibrium under each condition (again alternating the or-
bid, as indicated by an asterisk (ⴱ). Note that the der of the two treatments and separating the two
equilibrium involves bidding about one-half of auctions with other one-shot games). Eighty
the value.15 percent of the bids in the (0, 2, 5) treatment
Table 5 shows the analogous calculations for matched the equilibrium: the average bids for
the second treatment, with equally likely private low-, medium-, and high-value bidders were $0,
values of $0, $3, or $6. Interestingly, this in- $1.06, and $2.64, respectively. In contrast, the
crease in values does not alter the equilibrium average bids for the (0, 3, 6) treatment were $0,
bids in the unique Bayesian Nash equilibrium, $1.82, and $3.40 for the three value levels, and
only 50 percent of all bids were Nash bids. The
bid frequencies for each value are shown in
15
The bids would be exactly one-half of the value if the
Table 6. As in previous games, deviations from
highest value were $4 instead of $5, but we had to raise the Nash behavior in these private-value auctions
highest value to eliminate multiple Nash equilibria. seem to be sensitive to the costs of deviation. Of
1414 THE AMERICAN ECONOMIC REVIEW DECEMBER 2001
TABLE 7—SIGNALING WITH A SEPARATING EQUILIBRIUM (MARKED BY ASTERISKS) (SENDER’S PAYOFF, RESPONDER’S PAYOFF)
Evade. With some uncertainty about the send- tive criterion rules out these beliefs, and selects
er’s type, Evade is sufficiently unattractive to the separating equilibrium observed in the trea-
respondents that it is never selected. Consider sure treatment.19
the other two responses and note that a sender The game in Table 8 is a minor variation on
always prefers that the responder choose Con- the previous game, with the only change be-
cede instead of Deny. In the separating equilib- ing that the (500, 500) in the bottom left part
rium, the signals reveal the senders’ types, the of Table 7 is replaced by a (300, 300) pay-
responder always Concedes, and all players are off.20 As before, consider the sender’s ex-
satisfied. There is, however, a second equilib- pected payoffs when each response is
rium for the game in Table 7 in which the presumed to be equally likely, which leads
responder Concedes to Left and Denies Right, one to expect that type A senders will choose
and therefore both sender types send Left to Right and that type B senders will choose
avoid being Denied.18 Backward induction ra- Left, as indicated by the bold payoff numbers.
tionality (of the sequential Nash equilibrium) In the experiment, 10 of the 13 type A senders
does not rule out these beliefs, since a deviation did choose Right, and 9 of the 11 type B
does not occur in equilibrium, and the respon- senders did choose Left. But the separation
dent is making a best response to the beliefs. observed in this contradiction treatment is not
What is unintuitive about these beliefs (that a a Nash equilibrium.21 All equilibria for this
deviant Right signal comes from a type B) is
that the type B sender is earning 500 in this
(Left, Concede) equilibrium outcome, and no
deviation could conceivably increase this pay-
off. In contrast, the type A sender is earning 300 19
Brandts and Holt (1992, 1993) report experimental
in the Left side pooling equilibrium, and this data that contradict the predictions of the intuitive criterion,
type could possibly earn more (450 or even i.e., the decision converged to an equilibrium ruled out by
that criterion.
1,000), depending on the response to a devia- 20
Unlike the paired treatments considered previously,
tion. The In-Koo Cho and Kreps (1987) intui- the payoff change for these signaling games does alter the
set of Nash equilibria.
21
The respondents would prefer to Concede to a Right
18
To check that the responder has no incentive to devi- signal and Deny a Left signal. Type B senders would there-
ate, note that Concede is a best response to a Left regardless fore have an incentive to deviate from the proposed sepa-
of the sender’s type, and that Deny is a best response to a rating equilibrium and send a Right signal. In the
deviant Right signal if the responder believes that it was sent experiment, half of the Left signals were Denied, whereas
by a type B. only 2 of the 12 Right signals were Denied.
1416 THE AMERICAN ECONOMIC REVIEW DECEMBER 2001
22 23
For example, it is an equilibrium for both types to send Goeree et al. (2000) report formal econometric tests
Right if a Left signal triggers a C or a D response. The D that reject the predictions of inequity aversion models in the
response to Left is appropriate if the respondent thinks the context of a group of repeated asymmetric matching pennies
deviant signal comes from a type B sender, and the C games.
24
response is appropriate if the deviant is thought to be of type Payoff inequity aversion also has no effect in the
A. Beliefs that the deviant is of type A are intuitive, since minimum-effort coordination game; any common effort
type A earns 450 in equilibrium and could possibly earn level is still a Nash equilibrium. To see this, note that a
more (500) by switching to Left (if an E response follows). unilateral effort increase from a common level reduces
A second pooling equilibrium involves both types sending a one’s own payoff and creates an disadvantageous inequity.
Left signal to which the respondents Concede. A deviant Similarly, a unilateral decrease from a common effort level
Right signal is Denied, which is appropriate if the respon- reduces one’s payoff and creates an inequity where one
dent thinks the deviant signal comes from a type B sender. earns more than the other, since their costly extra effort is
Again these beliefs are intuitive since the type B sender wasted. Thus inequity aversion cannot explain the strong
could possibly gain by deviating. effect of an increase in effort costs.
VOL. 91 NO. 5 GOEREE AND HOLT: TREASURES AND CONTRADICTIONS 1417
resulting beliefs.25 The experiments reported reported by Nagel (1995).27 It is easy to verify
above indicate that magnitudes (not just signs) that level one rationality also provides good
of payoff differences matter, and it is thus nat- predictions for both treasure and contradiction
ural to consider a decision rule for which choice treatments in the traveler’s dilemma, the
probabilities are positively but imperfectly re- minimum-effort coordination game, and the
lated to payoffs. The logit rule, for example, Kreps game. There is evidence, however, that at
specifies that choice probabilities, pi , for op- least some subjects form more precise beliefs
tions i ⫽ 1, ... ,m, are proportional to exponen- about others’ actions, possibly through higher
tial functions of the associated expected levels of introspection.28 In the matching pen-
payoffs, ei : nies games in Table 1, for example, a flat prior
makes column indifferent between Left and
exp共 ei / 兲 Right, and yet most column players seem to
(1) p i ⫽ , i ⫽ 1, ... , m, anticipate that row will choose Top in the 320
¥ exp共 e / 兲 version and Bottom in the 44 version of this
j
j ⫽ 1,...,m
game.
Of course, what the other player does de-
where the sum in the denominator ensures that pends on what they think you will do, so the
the probabilities sum to one, and the “error next logical step is to assume that others make
parameter,” , determines how sensitive choice responses to a flat prior, and then you respond to
probabilities are to payoff differences.26 that anticipated response (Selten, 1991). This is
In order to use the “logit best response” in Stahl and Wilson’s (1995) “level two” rational-
(1), we need to model the process of belief ity. There is, however, no obvious reason to
formation, since belief probabilities are used to truncate the levels of iterated thinking. The no-
calculate the expected payoffs on the right side tion of rationalizability discussed above, for ex-
of (1). By the principle of insufficient reason ample, involves infinitely many levels of
one might postulate that each of the others’ iterated thinking, with “never-best” responses
actions are equally likely. This corresponds to eliminated in succession. But rationalizability
the Stahl and Wilson (1995) notion of “level seems to imply too much rationality, since it
one” rationality, which captures many of the predicts that all claims in the traveler’s dilemma
first-period decisions in the “guessing game” will be equal to the minimum claim, indepen-
dent of the penalty/reward parameter. One way
25
to limit the precision of the thought process,
Perhaps the best-known model of introspection is Har-
sanyi and Selten’s (1988) “tracing procedure.” This proce- without making an arbitrary assumption about
dure involves an axiomatic determination of players’ the number of iterations, is to inject increasing
common priors (the “preliminary theory”) and the construc- amounts of noise into higher levels of iterated
tion of a modified game with payoffs for each decision that thinking (Goeree and Holt, 1999; Kübler and
are weighted averages of those in the original game and of
the expected payoffs determined by the prior distribution.
Weizsäcker, 2000). Let denote the logit best-
By varying the weight on the original game, a sequence of response map (for error rate ) on the right side
best responses for the modified game are generated. This of (1). Just as a single logit response to beliefs,
process is used to select one of the Nash equilibria of the p0 , can be represented as p ⫽ (p0), a series of
original game. Gonzalo Olcina and Amparo Urbano (1994) such responses can be represented as:29
also use an axiomatic approach to select a prior distribution,
which is then revised by a simulated learning process that is
essentially a partial adjustment from current beliefs to best
27
responses to current beliefs. Since neither the Harsanyi/ In our own work, we have used a noisy response to a
Selten model nor the Olcina/Urbano model incorporates any flat prior as a way of starting computer simulations of
noise, they predict that behavior will converge to the Nash simulations of behavior in repeated games (Brandts and
equilibrium in games with a unique equilibrium, which is an Holt, 1996; Capra et al., 1999, 2002; Goeree and Holt,
undesirable feature in light of the contradictions data re- 1999).
28
ported above. Costa-Gomes et al. (2001), for example, infer some
26
As goes to zero, payoff differences are “blown up,” heterogeneity in the amount of introspection by observing
and the probability of the optimal decision converges to 1. the types of information that subjects acquire before making
In the other extreme, as goes to ⬁, the choice probabilities a decision.
29
converge to 1/m independently of expected payoffs. See R. Goeree and Holt (2000b) use continuity arguments to
Duncan Luce (1959) for an axiomatic derivation of the logit show that the limit in (2) exists even if the (increasing) error
choice rule in (1). parameters are person specific.
1418 THE AMERICAN ECONOMIC REVIEW DECEMBER 2001
(2) p ⫽ lim 1 共 2 共 ... n 共p0 兲兲兲, example. Several of the models discussed above
n3⬁ do a fairly good job of organizing the qualitative
patterns of conformity and deviation from the
where 1 ⱕ 2 ⱕ ... , with ⬁ converging predictions of standard theory, but there are
to infinity.30 This assumption captures the idea obvious discrepancies. We hope that this paper
that it becomes increasingly complex to do will stimulate further theoretical work on mod-
more and more iterations.31 Since for ⫽ ⬁ els of behavior in one-shot games. One poten-
maps the whole probability simplex to a single tially useful approach may be to elicit beliefs
point, the right side of (2) is independent of the directly as the games are played (Theo Offer-
initial belief vector p0. Moreover, the introspec- man, 1997; Andrew Schotter and Yaw Narkov,
tion process in (2) yields a unique outcome even 1998).
in games with multiple Nash equilibria. Note
that the choice probabilities on the left side of VI. Conclusion
(2) generally do not match the beliefs at any
stage of the iterative process on the right. In One-shot game experiments are interesting
other words, the introspective process allows because many games are in fact only played
for surprises, which are likely to occur in one- once; single play is especially relevant in appli-
shot games. cations of game theory in other fields, e.g.,
For games with very different levels of com- international conflicts, election campaigns, and
plexity such as the ones reported here, the error legal disputes. The decision makers in these
parameters that provide the best fit are likely to contexts, like the subjects in our experiments,
be different. In this case, the estimates indicate typically have experience in similar games with
the degree of complexity, i.e., they serve as a other people. One-shot games are also appeal-
measurement device. For games of similar com- ing because they allow us to abstract away from
plexity, the model in (2) could be applied to issues of learning and attempts to manipulate
predict behavior across games. We have used it others’ beliefs, behavior, or preferences (e.g.,
to explain data patterns in a series of 37 simple reciprocity, cooperativeness). This paper re-
matrix games, assuming a simple two-parame- ports the results of ten pairs of games that are
ter model for which n ⫽ t n , where t deter- played only once by subjects who have experi-
mines the rate at which noise increases with ence with other one-shot and repeated games.
higher iterations (Goeree and Holt, 2000b). The The Nash equilibrium (or relevant refinement)
estimated value (t ⫽ 4.1) implies that there is provides accurate predictions for standard ver-
more noise for higher levels of introspection, a sions of these games. In each case, however,
result that is roughly consistent with estimates there is a matched game for which the Nash
obtained by Kübler and Weizsäcker (2000) for prediction clearly fails, although it fails in a way
data from information-cascade experiments. that is consistent with simple (non-game-
The analysis of introspection is a relatively theoretic) intuition. The results for these expe-
understudied topic in game theory, as compared rienced subjects show:
with equilibrium refinements and learning, for
(1) Behavior may diverge sharply from the
unique rationalizable (Nash) equilibrium in
30
The case of a constant parameter (1 ⫽ 2 ⫽ ... ⫽ )
a social (traveler’s) dilemma. In these
is of special interest. In this case, the process may not games, the Nash equilibrium is located on
converge for some games (e.g., matching pennies), but one side of the range of feasible decisions,
when it does, the limit probabilities, p*, must be invariant and data for the contradiction treatment
under the logit map: ( p*) ⫽ p*. A fixed point of this
type constitutes a “logit equilibrium,” which is a special
have a mode on the opposite side of this
case of the quantal-response equilibrium defined in range. The most salient feature of the data
McKelvey and Palfrey (1995). It is in this sense that the is the extreme sensitivity to a parameter
logit equilibrium arises as a limit case of the noisy intro- that has no effect on the Nash outcome.
spective process defined in (2). (2) Students suffering through game theory
31
For an interesting alternative approach, see Capra
(1998). In her model, beliefs are represented by degenerate classes may have good reasons when they
distributions that put all probability mass at a single point. have trouble understanding why a change
The location of the belief points is, ex ante, stochastic. in one player’s payoffs only affects the
VOL. 91 NO. 5 GOEREE AND HOLT: TREASURES AND CONTRADICTIONS 1419
Games: Stochastic Potential and Logit Equi- Journal of Economics, May 1987, 102(2), pp.
librium.” Games and Economic Behavior, 179 –221.
February 2001b, 34(2), pp. 177–99. Cooper, David J.; Garvin, Susan and Kagel, John
Basu, Kaushik. “The Traveler’s Dilemma: Par- H. “Signaling and Adaptive Learning in an
adoxes of Rationality in Game Theory.” Entry Limit Pricing Game.” Rand Journal of
American Economic Review, May 1994 Economics, Winter 1997, 28(4), pp. 662– 83.
(Papers and Proceedings), 84(2), pp. 391– Cooper, Russell; DeJong, Douglas V.; Forsythe,
95. Robert and Ross, Thomas W. “Communica-
Beard, Randolph T. and Beil, Richard O. “Do tion in Coordination Games.” Quarterly
People Rely on the Self-interested Maximi- Journal of Economics, May 1992, 107(2), pp.
zation of Others: An Experimental Test.” 739 –71.
Management Science, February 1994, 40(2), Costa-Gomes, Miguel; Crawford, Vincent P. and
pp. 252– 62. Broseta, Bruno. “Cognition and Behavior in
Bernheim, B. Douglas. “Rationalizable Strategic Normal-Form Games: An Experimental
Behavior.” Econometrica, July 1984, 52(4), Study.” Econometrica, September 2001, 69,
pp. 1007–28. pp. 1193–235.
Bolton, Gary E. “Bargaining and Dilemma Costa-Gomes, Miguel and Zauner, Klaus G. “Ul-
Games: From Laboratory Data Towards The- timatum Bargaining Behavior in Israel, Ja-
oretical Synthesis.” Experimental Econom- pan, Slovenia, and the United States: A
ics, 1998, 1(3), pp. 257– 81. Social Utility Analysis.” Games and Eco-
Bolton, Gary E and Ockenfels, Axel. “A Theory nomic Behavior, February 2001, 34(2), pp.
of Equity, Reciprocity, and Competition.” 238 – 69.
American Economic Review, March 2000, Crawford, Vincent P. “Adaptive Dynamics in
90(1), pp. 166 –93. Coordination Games.” Econometrica, Janu-
Brandts, Jordi and Holt, Charles A. “An Exper- ary 1995, 63(1), pp. 103– 43.
imental Test of Equilibrium Dominance in Davis, Douglas D. and Holt, Charles A. Experi-
Signaling Games.” American Economic Re- mental economics. Princeton, NJ: Princeton
view, December 1992, 82(5), pp. 1350 – 65. University Press, 1993.
. “Adjustment Patterns and Equilibrium Eckel, Catherine and Wilson, Rick. “The Human
Selection in Experimental Signaling Games.” Face of Game Theory: Trust and Reciprocity
International Journal of Game Theory, 1993, in Sequential Games.” Working paper, Rice
22(3), pp. 279 –302. University, 1999.
. “Naive Bayesian Learning and Adjust- Erev, Ido and Roth, Alvin E. “Predicting How
ments to Equilibrium in Signaling Games.” People Play Games: Reinforcement Learning
Working paper, University of Virginia, 1996. in Experimental Games with Unique, Mixed
Camerer, Colin and Ho, Teck-Hua. “Experience- Strategy Equilibria.” American Economic Re-
Weighted Attraction Learning in Normal view, September 1998, 88(4), pp. 848 – 81.
Form Games.” Econometrica, July 1999, Fehr, Ernst and Schmidt, Klaus. “A Theory of
67(4), pp. 827–74. Fairness, Competition, and Cooperation.”
Capra, C. Monica. “Noisy Expectation Forma- Quarterly Journal of Economics, August
tion in One-Shot Games.” Ph.D. dissertation, 1999, 114(3), pp. 817– 68.
University of Virginia, 1998. Fudenberg, Drew and Levine, David K. The the-
Capra, C. Monica; Goeree, Jacob K.; Gomez, Ro- ory of learning in games. Cambridge, MA:
sario and Holt, Charles A. “Anomalous Be- MIT Press, 1998.
havior in a Traveler’s Dilemma?” American Fudenberg, Drew and Tirole, Jean. Game theory.
Economic Review, June 1999, 89(3), pp. Cambridge, MA: MIT Press, 1993.
678 –90. Gibbons, Robert. “An Introduction to Applica-
. “Learning and Noisy Equilibrium Be- ble Game Theory.” Journal of Economic Per-
havior in an Experimental Study of Imperfect spectives, Winter 1997, 11(1), pp. 127– 49.
Price Competition.” International Economic Goeree, Jacob K. and Holt, Charles A. “An
Review, 2002 (forthcoming). Experimental Study of Costly Coordina-
Cho, In-Koo and Kreps, David M. “Signaling tion.” Working paper, University of Vir-
Games and Stable Equilibria.” Quarterly ginia, 1998.
VOL. 91 NO. 5 GOEREE AND HOLT: TREASURES AND CONTRADICTIONS 1421