UNRAVELLING BUENO DE MESQUITA’S
GROUP DECISION MODEL
Jason B. Scholz, Gregory J. Calbert and Glen A. Smith
C3I Division, Defence Science and Technology Organisation, PO Box 1500, Edinburgh, South Australia
Keywords: Multi-agent theory, Group decision making, Expected utility, Alliance, Risk, Influence.
Abstract: The development of societies of human and machine agents should benefit from an understanding of human
group decision processes. Political Scientist and Professor, Bruce Bueno De Mesquita has made significant
claims for the predictive accuracy of his computational model of group decision making, receiving much
popular press including newspaper articles, books and a television documentary entitled “The New
Nostradamus”. Despite these and many journal and conference publications related to the topic, no clear
elicitation of the model exists in the open literature. We expose and present the model by careful navigation
of the literature and illustrate the soundness of our interpretation by replicating De Mesquita’s own results.
We also discuss concerns regarding model sensitivity and convergence.
1 INTRODUCTION
There is little doubt that some of the greatest social
challenges for the future of mankind include
terrorism, war, climate change, poverty, and
economics. So, the pursuit of an integrated theory
capable of explanation and prediction of group
decision outcomes is a worthy endeavour. Such
efforts, often classed under the realm of
computational political science, aim to form testable
yet tractable models for human agency (Kollman et
al, 2010). Bueno De Mesquita (herein abbreviated to
BDM) has laid claims to such an achievement. An
example prediction was “…the ability to dominate
Iran’s politics resides with Khamenei and
Rafsanjani. And between these two – though the
contest is close – the advantage seems to lie with
Khamenei.” (BDM, 1984, p. 233)
The accuracy of this prediction is demonstrated
by the fact that Khamenei succeeded Khomeinei as
Supreme Leader of Iran in June 1989 and Rafsanjani
became the fourth president of Iran in August 1989.
BDM’s model of group decision making
considers conflict and agreement, and is based on
expected utility theory. BDM (1997) states:
The model itself depicts a game in which
actors simultaneously make proposals, and
exert influence on one another. They evaluate
options and build coalitions by shifting
positions on the issue in question. The above
steps are repeated sequentially until the issue
is resolved. (p. 238)
A New York Times article by Thomson (2009) gives
some insight into why the model has never been
fully disclosed:
…Bueno de Mesquita does not publish the
actual computer code of his model. (Bueno
de Mesquita cannot do so because his former
firm owns the actual code, but he counters
that he has outlined the math behind his
model in enough academic papers and books
for anyone to replicate something close to his
work.)
At first BDM (1997) appears to offer the most
promise in elucidating the model, however first
impressions prove misleading. Significant errors and
obfuscations become apparent to anyone who tries to
replicate the model and results from this and later
works. In the following, we carefully navigate and
interpret earlier works to derive a working model
and agent software that reproduces his published
results to an adequate level of accuracy.
2 EVOLUTION OF THE MODEL
BDM’s predictions depend on two parts. First, his
18 Scholz J., Calbert G. and Smith G..
UNRAVELLING BUENO DE MESQUITA’S GROUP DECISION MODEL .
DOI: 10.5220/0003121500180030
In Proceedings of the 3rd International Conference on Agents and Artiﬁcial Intelligence (ICAART-2011), pages 18-30
ISBN: 978-989-8425-40-9
Copyright c 2011 SCITEPRESS (Science and Technology Publications, Lda.)
method of data collection and interpretation from
human subjects; second, the computational model
which he applies to that data.
The first part is significant, but has not been
described by BDM in the open literature and so
evades current interpretation or analysis. The second
part, the computational model we examine further.
The model deals with a single ‘issue’
decomposed into a metric scale, with ‘position’
values (x) corresponding to states of the issue. BDM
illustrates, “The term xi represents each nation’s
preferred date, measured in years, by which
emission standards should be applied to mediumsized
automobiles as revealed at the outset of
discussions on the issue.” (BDM, 1994, p.77). We
will continue with this example later in the results
section. A number of ‘actors’ (i=1,2,…,n) exist, each
of which hold a single ‘position’ (xi) with regard to
the issue, represented by their assignment to a
location.
Each ‘actor’ is also considered to possess some
‘capability’ (ci) with respect to the ‘issue’.
‘Capability’ is sometimes interchangeably referred
to as ‘power’ or ‘resources’ by BDM. Like
‘position’, ‘capability’ is given a value on a metric
scale. This value represents an actor’s level of
influence with regard to the issue.
Lastly, each ‘actor’ is also considered to possess
some ‘salience’ (si) with respect to the ‘issue’.
‘Salience’ is sometimes interchangeably referred to
as ‘importance’, ‘priority’, ‘attention’ by BDM. Like
‘position’ and ‘capability’, ‘salience’ is given a
value on a metric scale. This value represents an
actor’s level of energy with regard to the issue.
Table 1 from BDM (1994, p. 78) illustrates.
Table 1: Example input data for the computational model.
The issue is ‘the date (years) of introduction of emission
standards for medium-sized automobiles’.
Actor (i) Capability
(ci)
Position
(xi)
Salience
(si)
Netherlands 0.08 4 80
Belgium 0.08 7 40
Luxembourg 0.03 4 20
Germany 0.16 4 80
France 0.16 10 60
Italy 0.16 10 60
UK 0.16 10 90
Ireland 0.05 7 10
Denmark 0.05 4 100
Greece 0.08 7 70
BDM’s model decomposes the social fabric into
pairwise ‘contests’ between actors with support or
otherwise of third-party alliances. Based on actor i’s
perception of expected utility, actor i considers
whether or not to challenge each other actor j, in an
attempt to convince them to adopt i’s position. The
expected utility includes an assessment of the level
of third-party support for actor i’s challenge. If actor
i’s expected utility of challenging actor j versus not
challenging is greater than zero, actor i will
challenge actor j, otherwise it will not. This model of
mind or agency is confrontational and wholly self-
interested.
Not surprisingly, BDM has adapted the model
over the years. So it is necessary to clearly identify
which version we are using when considering its
form and results.
BDM (1980) provides the earliest form, which is
repeated in BDM (1981). The notation is later
revised in BDM (1985), and includes a modification
to include a risk exponent; however the basic
expected utility calculations remain the same from
1980 to 1985. We are readily familiar with the
expected value of a random variable Z, with various
states Zw each with probability Pw of occurring as:
  
w
wwZPzE
Expected utility follows the same structure in
that the utilities of different contest outcomes are
estimated along with the associated probabilities.
An apparent motivation for BDM’s expected
utility model was predicting the outbreak of war as
per BDM (1981). It is thus not surprising to find a
confrontational mentality to the basic form of the
model. BDM considers an actor i to choose to
‘challenge’ a rival or opponent actor j. Thereby the
expected utility for i to challenge j is:
    fiisiic
i
UPUPUE  1
Where Usi refers to the utility for actor i if it
succeeds and Ufi is the utility for actor i if it fails.
BDM (1985, p. 158) extends this with a third
term relating to the third-party contribution to i’s
expected utility (using BDM’s notation):
   




jik
i
kj
i
kijkik
i
fii
i
siicij
i
UUPP
UPUPUE
,
))(1(
1
(1)
If actor i does not challenge j, i stays at the same
position and j may either remain where it is (status
quo) or j may move to a different position. If j
moves, the utility of the outcome may prove either
UNRAVELLING BUENO DE MESQUITA'S GROUP DECISION MODEL
19
better for i or worse for i. The expected utility for i
not challenging is then, BDM (1985, p.158):
 
    i
wi
i
bi
i
bi
i
bi
i
qi
i
qi
i
qincij
i
UQUQQ
UQUE


11
(2)
Where Qq refers to the probability of status quo and
Qb refers to a switch (value either 1 or 0) depending
on whether the outcome was better or worse. These
and other issues will be explained fully later.
The full form of the expected utility difference
combines (1) and (2):
     ncij
i
cij
i
ij
i
UEUEUE  (3)
In BDM and Lalman (1986) a problem with the
following term in (1) is identified:
 
,
( 1)( )i i i
ij ik jk ki kjm
k i j
E U P P U U

    (4)
The problem is described in BDM and Lalman
(1986):
Because of the manner in which third parties
are treated in earlier studies (Bueno De
Mesquita, 1981, 1985), the operational
estimate of expected utility values for any
decision maker could vary between (2N-2)
and –(2N-2), where N is the total number of
nations in the relevant international system.
Variations in the size of the international
community, then, affected the possible range
of values in the expected utility models set
out earlier. This is a serious shortcoming in
that it makes comparison of a single nation’s
utility scores in different years difficult. …
The new formulation fixes the range of
values, irrespective of system size, in a
theoretically meaningful way. (p. 1119)
BDM’s proposed solution involves removing the
term (4) from (1) and incorporating a more complex
form of calculation of the probability Pi. In BDM
(1985, p.161) the probability iP refers to “Pi = i’s
probability of succeeding in a bilateral contest with
j”.
From 1986 onwards, the definition of the
probability is changed to account for multilateral
contributions to the contest between i and j. The new
form is denoted
i
iP . This will be defined later.
The form of the expected utilities as stated in
BDM and Lalman (1986, p. 1118) are:
     ncij
i
cij
i
ij
i
UEUEUE  (5)
    
  i
sij
i
fi
i
i
i
si
i
ijcij
i
Us
UPUPsUE


1
1
(6)
 
    i
wi
i
bi
i
sqncij
i
UTTUQ
QUUE


11
(7)
The notable change is the inclusion of salience sj and
an extra term in the expected utility for challenge.
Otherwise, notation changes are minimal.
This latter structure remains throughout BDM
(1994, 1997, and 2002). In BDM (2009b) a new
structure of model is announced, however we do not
consider this new model further. As a result of the
multi-lateral scaling issue with the model in its pre-
1986 form, we focus on the model structure and
results for 1986 and later, using the form from
equations 5, 6 and 7.
3 UTILITIES
BDM (1997, p.242-243) uses equations (5), (6) and
(7) though with different notation. The probability of
status quo might be determined in a number of ways,
however, a value of Q=1.0 is assumed in BDM
(1985, p.161), corresponding to a stoic opposition
and a value of Q=0.5 is assumed in both BDM and
Lalman (1986, p.1122) and BDM (2009a, p.5),
corresponding to a maximally uncertain outcome of
whether the actor j will move or stay in position. No
explicit value for Q is specified in other papers.
3.1 Base Utilities
To find the expected utility, we need to calculate the
basic utilities:
i
sq
i
wi
i
bi
i
fi
i
si UUUUU ,,,, (8)
These utilities are a function of the policy position of
actors, xi and xj. BDM (1997, p.264) tells us that “…
i’s utility for kx , k
i
xu , is a decreasing function of
the distance between the proposal and i’s preferred
resolution, so that
 *
Ikk
i
xxfxu  . ” (9)
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
20
The notation j
i
xu is not the same as that used in (8),
so some transformation is probably required. Thus
we interpret (9) as the general class of model only. It
is worth pointing out that a specific class is stated in
BDM (1997, p.245):
ir
jij
i
xxxu *
1  (10)
However, (10) is inconsistent with the more detailed
earlier explanations, as the following will now
reveal.
One clue to the utility calculations is given in
BDM (1994):
Should i succeed, then i will derive the utility
associated with convincing j to switch from
its current policy stance to that supported by
i. This is denoted by dxu j
i
|
 , which equals
)( ji
i
xxu  . Should i fail, then it confronts
the prospect of having to abandon its
objectives in favour of those pursued by j,
denoted by ).(| ij
i
j
i
xxudxu   (p. 84)
Once again, BDM introduces additional notation
)( ji
i
xxu  and )( ij
i
xxu  which remains
undefined. However, BDM (1985, p.158) gives
utility for i’s success which is of the form of a
difference between positions i and j:
  iri
ij
i
iii
si
UU
U







 

4
2
42 (11)
Also, the utility for i’s failure, which is of the form
of a difference between positions j and i:
  iri
ii
i
iji
fi
UU
U







 

4
2
42 (12)
Noting also in BDM (1985):
The reason for the transformations by 2’s and
4’s is to preserve the original scale of
numbers while avoiding the generation of
imaginary numbers. Because ri can be less
than 1.0, the absence of transformations
would mean that for negative values of, for
instance Ufi, no real root would exist. This
problem is eliminated with the introduction
of these transformations. (p. 158)
i
iiU and
i
ijU are defined by BDM (1985):
With
i
iiU being equal to the value i attaches
to his own policy portfolio (Both iiU and
jjU are assumed to equal 1.0, with ijU and
jiU ranging between possible values of 1.0
and -1.0), and with
i
ijU being equal to the
value i attaches to j’s policies as a function of
their similarity to the policies of i. (p. 158)
Thus,
1 j
jj
j
ii
i
jj
i
ii UUUU (13)
And to satisfy the stated range requirement we
propose,
minmax
21
xx
xx
UU
jii
ji
i
ij


 (14)
Equation (14) is consistent with the statement and
equation at (9). Note that -1≤ i
iiU ≤1. Where,
xmax−xmin is the range of positions. Note the
maximum value of +1 occurs when policy positions
of i and j coincide and is at its minimum of -1 when
the positions are maximally separated.
Summarising, so far we have now accounted for
(8) parts a and b, which simplify to:
ir
jii
si
xx
xx
U











minmax
5.05.042 (15)
ir
jii
fi
xx
xx
U











minmax
5.05.042 (16)
Note the ranges 2)5.0(42  i
si
r
Ui
and
iri
fiU )5.0(422  are consistent with the
diagram in BDM (1985, p.159).
BDM (1985) does not explicitly define i
biU or
i
wiU , however, we are given in BDM (1985, p.158):
    0
2 4 2 4
ir
i i i i i
qi ii ij ii ijtn t
U U U U U       
  
(17)
UNRAVELLING BUENO DE MESQUITA'S GROUP DECISION MODEL
21
The subscripts t0 and tn are not defined, but in
BDM (1981, p.48) these correspond to before and
after j’s policy change, respectively. The utility
subscript q is usually signifies status quo, but we
believe this is an error in (17) and should instead
refer to j making a policy change (or move) which
gains or betters the situation for i. We are led to
believe this by BDM (1981, p.47-48):
 0t
i
ij
i
ii UU  =i’s perception of what may be
gained by succeeding in a bilateral conflict
with j…  0t
i
ii
i
ij UU  = i’s perception of what
may be lost by failing in a bilateral conflict
with j…
So we adopt the subscript, b indicating ‘better’
consistent with (8). Furthermore, we note a problem
with scaling for (17), as it will become undefined
(negative number raised to a power less than 1.0).
Thus, corrections are also required to realign scaling.
The result is as follows:
    ir
t
i
ij
i
iitn
i
ij
i
iii
bi
UUUU
U







 

8
4
42 0 (18)
Similarly we expect that j’s movement may
potentially result in a worse condition for i:
    ir
t
i
ii
i
ijtn
i
ii
i
iji
wi
UUUU
U







 

8
4
42 0 (19)
Note this adjusted scaling ensures
2)5.0(42  bi
r
Ui and ir
wiU )5.0(422  .
We are given a clue that some relation to the
median voter position is important by
“ ]||[ dxuanddxu j
i
j
i 
 are approximated by
comparing the value actor i attaches to the current
median voter prediction to the value i attaches to the
median anticipated if i accepts j’s preferred
outcome.” (BDM 1997, p. 248)
BDM (2009a) expresses this most clearly:
… they are anticipated to move towards the
median voter position if they make an
uncoerced move. This means that if B lies on
the opposite side of the median voter from A,
then A anticipates that if B moves
(probability=0.5), then B will move in such a
way as to come closer to the policy outcome
A supports and so A’s welfare will improve
without A having to exert any effort. If B lies
between the median voter position and A,
then whether A’s welfare improves or
worsens depends on how far B is expected to
move compared to A. The same is true if A
lies between B and the median. (p. 6)
We interpret this to mean that for no challenge
(uncoerced), and B (or j) moves, that A (or i)
expects B (or j) will move to the median position.
The cases are illustrated in figures 1 to 4.
i j j i
Figure 1: Case 1:  between i and j utility for i gets
better as a result of j moving.
i j

j i

Figure 2: Case 2: j between i and utility for i gets worse
as a result of j moving.
j i

i j

Figure 3: Case 3A: i between j and  utility for i gets
better as a result of j moving.
j i

i j

Figure 4: Case 3B: i between j and  utility for i gets
worse as a result of j moving.
For any of these cases then we expect,
 
minmax
0
21
xx
xx
U
ji
t
i
ij


 (20)
 
minmax
21
xx
x
U i
tn
i
ij




(21)
Substituting (20) and (21) into (18) yields:
  ir
jiii
bi
xx
xxx
U











minmax
25.05.042

(22)
Substituting (20) and (21) into (19) yields:
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
22
  ir
jiii
wi
xx
xxx
U











minmax
25.05.042

(23)
We note further, that these utilities must be applied
in the appropriate cases. Thus if case 1 is true, then
the probability of i’s utility improving is 1.0 and by
implication the probability of i’s utility worsening is
0.0. We believe this defines the probability T in
equation (7), though no known publication by the
author states this explicitly. So, cases 1 and 3A
correspond to T=1 and cases 2 and 3B correspond to
T=0. This helps explain the description in BDM
(1986, p. 1122).
Lastly, for the situation of no change in policy
(status quo) i does not challenge and j does not
move, BDM (1985, p.158) defines:
 i
i
r
r
i
sqU 5.042
4
)02(
42 


 
 (24)
We expect this corresponds to (8) part e.
To determine the calculations for actor j, BDM
(1985) notes:
Of course, the j
f
j
s UU , and j
qU terms (with
appropriate superscripts) are defined
analogously. These terms vary as a function
of whose estimate of expected utility is being
calculated (i.e., who is the superscripted
actor) by varying the risk exponent, so that
for expected utility equations with a j
superscript, j’s risk taking propensity is used
to estimate what j perceives to be the value of
success, failure, or no challenge for i in
accordance with the equations delineated
below. (p. 158)
Thus, the two main equations become:
      
 
1 1
(1 ) (1 )
i i i i i i
ij j i si i fi j si
i i i
sq bi wi
E U s P U P U s U
QU Q TU T U
    
    
(25a)
      
 
1 1
(1 ) (1 )
j j j j j j
ji j j sj j fj j sj
j j j
sq bj wj
E U s P U P U s U
QU Q TU T U
    
    
(25b)
3.2 The Median Voter Position
In the previous section, it became clear that the
“median voter position” must be determined in order
to calculate the utility terms in (22) and (23).
BDM defines comparative votes in direct proportion
to utility difference, capability and salience. The
votes ‘cast’ by agent i in comparing positions
jx and kx is given as BDM (1997, p.239):
 k
i
j
i
iikj
jk
i xuxuscxxv ,| (26)
We emphasise, that these votes cast to k may
indeed be negative, if for example, agent i prefers j
to .k To map this notation to that used previously,
we interpret:
ijj
i
Uxu  & ikk
i
Uxu  (27)
Using (14) we get:











minmax
2,|
xx
xxxx
scxxv
jiki
iikj
jk
i (28)
According to BDM (1997):
The prospect that a proposal will succeed is
assumed to depend on how much support can
be mustered in its favour as compared with
the feasible alternatives. This is calculated as
the sum of “votes” across all actors in
comparison between xj and xk . (p. 240)



n
i
jk
i
jk
vv
1
(29)
In general, this pairwise determination is termed a
Condorcet Method of voting. A Condorcet winner is
the candidate whom voters prefer to each other
candidate, when compared to them one at a time.
Black’s Median Voter theory now comes into
play, so “the decision adopted by the committee
becomes determinant as soon as the position of the
one optimum – which we can refer to conveniently
enough as the median optimum – is given.” (Black,
1948).
That is, in a majority election where a voter’s
attitude is represented as a point in one dimension, if
all voters vote for a candidate closest to their own
preference and there are only two candidates, then if
the candidates want to maximise their votes they
should commit to the policy attitude preferred by the
median voter.
The median voter’s ideal attitude is always a
Condorcet winner (Congleton, 2003). Thus the
median voter attitude index and the number of votes
at the median attitude may be determined.
UNRAVELLING BUENO DE MESQUITA'S GROUP DECISION MODEL
23
4 ALLIANCE PROBABILITY
The probabilities of equations (25) are determined
by the bilateral alliances. BDM determines these
probabilities by combining across all pairs, an
assessment of ‘who is with me’ (positive valued
vote) versus ‘who is against me’ (negative valued
vote) and normalising. BDM (1997, p.244) states the
estimator as:



n
k
ij
k
xuxuk
ij
k
i
vvP
j
k
i
k
1|
(30)
When more agents are ‘for’ than ‘against’, this raises
the probability of winning the bilateral contest. As
per previous derivation of votes, substitute and
expand (31):






 n
k
ikjkkk
ifk
ikjkkk
i
xxxxsc
xxxxsc
Pi
1
0arg
)(
)(
(31)
5 RISK PROPENSITY
As seen in the previous section, utility calculations
involve a risk exponent. This risk exponent is in turn
derived from the expected utility. BDM (1985,
p.157) is first to describe the basis for risk
calculation,
I define each nation’s security level as

ij
jiUE )( . The greater the sum, the more
utility i believes its adversaries expect to
derive from challenging i. … as this sum
decreases, i’s relative security increases, so
that i is assumed to have adopted safe
policies ...
BDM (1985, p.157) goes on to define:
max min
max min
2 ( ) ( ) ( )
( ) ( )
ji ji ji
i
ji ji
E U E U E U
R
E U E U
 


  
 
(32)
Note that BDM (1997, p.247) reverses the subscripts
of the above, which is inconsistent with his
conceptual basis of security. Further, BDM
(1997,p.247) provides an inconsistent transformation
formula which would not accommodate the range
11  iR . Thus, we choose the earlier conversion
formula from BDM (1985, p.157):
3/1
3/1
i
i
i
R
R
r


 (33)
The purpose of the formula according to BDM
(1985, p.157) is to ensure that ri ranges between 0.5
and 2, noting that the divisor of 3 appears arbitrary,
but effects curvature. Equation (33) is illustrated in
figure 5.
0
0.5
1
1.5
2 
‐1  ‐0.6 ‐0.2 0  0.2 0.6 1
ri
Ri 
Figure 5: Scaling conversion formula.
The following equation (34), expresses (32)
more precisely:
1, 1, 1,
1, 1,
2 ( ) max ( ) min ( )
max ( ) min ( )
n n n
i i i
ji ji ji
ii
j j i j j i j j i
i n n
i i
ji ji
ii
j j i j j i
E U E U E U
R
E U E U
     
   
 


   
   
   
   
   
   
  
 
(34)
We still need to know, however, how to calculate the
expected utilities in (25), which use a modified
notation. BDM (1985) describes:
Thus, the risk terms are calculated by
manipulating the alliance portfolios used as
the policy indicator through simulation to
locate the best and worse portfolios for any
given nation, where the best and worst are
defined in terms of the sum of expected
utilities of all others vis-à-vis the nation in
question under the assumption that utilities
are strictly a function of similarities in
alliance commitments. (Note: That is,
temporarily applying the expected utility
equations (without risk or uncertainty taken
into account) as developed in The War Trap,
I identify the worst and best case alliance
strategy for each nation each year, using the
original, linear utility functions to define the
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
24
range of possible expected gains or losses for
each nation. These, then, are utilized to
measure risk propensities and thereby to
introduce curvature into the utility functions.)
(p. 167-168)
This implies a process to first determine the
expected utilities of equations (25) using ri=1, then
apply (34) and (33) to estimate ri and lastly apply the
ri estimates to re-estimate the expected utilities of
equations (25).
6 DECISION
6.1 Offer Categories
The expected utilities  ij
i
UE and  ji
j
UE are used
to classify the ‘offers’ between all actor pairs into
categories according to potential outcomes as
illustrated in figure 6. An actor may expect to
conflict, compromise, capitulate, or stalemate with
another. Unfortunately no single publication by
BDM explains how to quantify these.
Confrontation +
j moves to i
Confrontation i
moves to j
i’s Expected
Utility
j’s Expected Utility
Compromise +
j moves part
way to i
Compromise i
moves part
way to j
Compel i
moves to j
CONFLICTCONFLICT
STALEMATESTALEMATE
COMPROMISECOMPROMISE
COMPROMISECOMPROMISE
CAPITULATECAPITULATE
CAPITULATECAPITULATE
Status Quo
i stays put
Compel +
j moves to i
 ij
i
UE
 ji
j
UE
Figure 6: Classifying the outcome of challenges according
to i’s viewpoint.
Conflict. Actors i and j conflict if 0)( ij
i
UE and
0)( ji
j
UE . So “If both i and j believe that they
have the upper hand in the relationship, then conflict
is likely and that conflict has an uncertain outcome.”
BDM (1997, p.244)
BDM (1984, p. 230) labels for the
“Confrontation-” octant, “Challenger Favored” and
for the “Confrontation+” octant “Favoring Focal
Group”. We interpret this to mean i moves to j and j
moves to i respectively, as shown in figure 6.
Compromise. Actor i has the upper hand if
0)( ij
i
UE , 0)( ji
j
UE and )()( ji
j
ij
i
UEUE  .
Actor j has the upper hand if
0)( ij
i
UE , 0)( ji
j
UE and )()( ji
j
ij
i
UEUE  .
BDM (1997) describes:
… both players agree that i has the upper
hand. In this instance, j is expected to be
willing to offer concessions to i, although the
concessions are not likely to be as large as
what i would like. The likely resolution of
their exchange is a compromise reflecting the
weighted average of i’s expectation and j’s.
(p. 243-244)
However, the weighted average is not clear. BDM
(1994) states (presumably with regard to i having the
‘upper hand’):
… the concession is assumed to equal the
distance on Ra between xi and xj multiplied by
the ratio of the absolute value of j’s expected
utility to i’s expected utility. This treats the
compromise as the weighted average of the
perceived enforceability of the demand… (p.
96)
We might interpret this literally as:
 
)(
)(
ˆ
ij
i
ji
j
ji
UE
UE
xxx  (35)
Noting this relates only to the octant labeled
“Compromise +” in figure 6. Considering the
boundary conditions in this octant, if
)()( ji
j
ij
i
UEUE  then 0ˆ x and actor j does not
move from xj and if )()( ji
j
ij
i
UEUE  then
1ˆ x and actor j moves from xj to xi. For the octant
labelled “Compromise -” we use:
 
)(
)(
ˆ
ji
j
ij
i
ji
UE
UE
xxx  (36)
Acquiescence and Stalemate. The states for
acquiescence and stalemate are illustrated in figure 6
and require no further explanation.
6.2 Offer Selection
Given that each actor has chosen who to challenge
and to remain silent for those not to be challenged,
UNRAVELLING BUENO DE MESQUITA'S GROUP DECISION MODEL
25
then each actor will have received ‘challenge offers’
from other actors. How does an actor come to a
decision on it which challenge offer it should
accept? BDM (1997) elucidates:
Each player would like to choose the best
offer made to it and each proposer enforces
its bid to the extent that it can. Those better
able to enforce their wishes than others can
make their proposals stick. Given equally
enforceable proposals, players move the least
that they can. … When the players finish
sorting out their choices among proposals,
each shifts to the position contained in the
proposal it accepted. (p. 251-252)
If all offers are equally enforceable, we would
propose to order these according to an actor’s
preferred choice as follows, so that actor i moves
‘the least that it can’. Thus, summarising the order
of decision choice for actor i is as follows:
1. Actor i conflicts with actor j and actor j (or with
some chance actor i) acquiesces.
2. Actor i compromises to actor j. Actor i loses
some ground.
3. Actor i acquiesces to actor j. Actor i loses most
ground.
4. Actor i stalemate with actor j. Actor i status quo.
Thus, for example if actor i is in conflict with
several other actors, each of which have greater
expected utility than i, then the agent will need to
concede to the one that allows i to move the least.
If all offers are not equally enforceable, then we
might expect an actor to be more likely to concede
to the most powerful actor. Thus, in the prior
example, actor i concedes to the actor with highest
expected utility.
7 RESULTS
BDM (1994) provides an example. The data for this
was introduced in table 1. BDM (1994) provides
three graphs of results. These compare expected
utility for Belgium versus the others, France versus
the others and the Netherlands versus the others.
Figures 7 and 8 compare the result using our
interpretation of the algorithm as given in section 6,
compared directly with the results published in
BDM (1994, p.91). No value for Q was given. We
chose Q=1.0.
Figure 7: Comparison of results for BDM (1994) (top) and
our interpretation (bottom), view from Belgium.
Note that some countries are not shown on
BDM’s graphs. In figure 7, our expected utility
results for Ireland and Greece were (0,0) and in
figure 8, UK and Italy were at (0,0).
As a result of the fact that BDM does not
explicitly plot the point locus of the expected
utilities, we can only reasonably assume the
quadrants where the names are labelled corresponds
to the location of each locus. The correspondence of
our results to this level of accuracy (within a
quadrant) is 100%. We note that if the expected
utilities were derived randomly, the probability of
getting any one of these points located in the correct
quadrant is one in four. In order to get all nine
results in the correct quadrants for any one graph of
the two graphs above would constitute a probability
of (1/4)9
~ 4x 10-6
. We therefore assert that BDM’s
results have effectively been reproduced.
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
26
Figure 8: Comparison of results for BDM (1994) (top) and
our interpretation (bottom), view from France.
We came across an issue with trying to
reproduce the result given by BDM for the
Netherlands as compared in figure 9. We assert that
the result published by BDM was in error. Our result
showing ‘Others’ as  ji
j
UE against  ji
i
UE for
i=Luxembourg is compared with BDM’s quoted
result in figure 10. This shows 100%
correspondence in terms of quadrant accuracy as for
the previous two results.
BDM (1994) summarises the final result:
The dominant outcome would be, as
indicated above, a lag of 8.35 years.
However, if the participants were prepared to
bear the costs of slightly prolonged
negotiations, then the model’s predicted
dominant outcome rises to 9.05 years and
stabilizes at that point. … The actual
resolution was for a delay of 8.833 years.
(p.98)
We found the median voter position at the end of
the first round to be 8.4 years. At the end of the
second, third, fourth and fifth rounds the median
voter position was for each 9.9 years.
Figure 9: Comparison of results for BDM (1994) (top) and
our interpretation (below), view from Netherlands.
8 DISCUSSION
As identified earlier, we chose Q=1.0 to reproduce
the results above. We observed that a value of Q=0.5
produces very different results. The following figure
11 provides an illustrative example.
In figure 11, the positions of Greece and
Belgium change entire quadrants if Q=1 or Q=0.5 is
chosen.
Recalling that Q relates to the probability of a
status quo and is an arbitrary parameter, it is not
desirable for results to be so sensitive.
We examined the applicability of the
interpretation to other examples from later papers.
Despite the fact that BDM (1994) and BDM (1997)
differ only in the detailed example used, it is
UNRAVELLING BUENO DE MESQUITA'S GROUP DECISION MODEL
27
perplexing that we were unable to reproduce the
results from the 1997 paper. Indeed attempts to
apply this algorithm (using either Q=0.5 or Q=1.0)
to the 1997 “Sultan” problem yielded wildly
different results to those published.
Figure 10: Comparison of results for BDM (1994) ‘view
from Netherlands’ and our interpretation, view from
Luxembourg.
Further insight on the evolution of the median
voter position over rounds is also warranted. This is
shown in table 2.
Table 2 shows that the median appears to
stabilise, but then continues to change. Calculation
of the mean voter position provides insight. The
coarseness of the median voter position becomes
evident. Indeed given the fact that the “compromise”
state allows for intermediate valued positions (as per
equation 36) it is surprising that BDM would want to
continue with median over mean values. In general
the results do not stabilise. There is no reason from
examining the algorithm to expect that they should.
Figure 11: Example for Netherlands comparing results
using Q=1.0 (top) and Q=0.5 (bottom).
Table 2: Evolution of the median and mean voter positions
for ‘the date of introduction of emission standards for
medium-sized automobiles’ problem in BDM (1994).
Round 1 2 3 4 5 6 7 8
Median 8.4 9.9 9.9 9.9 9.9 7.4 8.8 9.6
Mean 7.4 7.5 7.6 7.3 7.3 7.4 7.5 7.6
9 CONCLUSIONS
The algorithm outlined (and summarised in the
Appendix) has for the first time exposed and
provided independent means of replicating the
results of BDM’s computational model. This opens
BDM’s model, method and claims to scientific
discussion.
The correctness of the interpretation was
illustrated using the example from BDM (1994). We
note the chance of replicating to this level of
accuracy by random selection would be much less
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
28
than one in one million. This fulfils Bueno De
Mesquita’s own prediction that enough material is
available so that “anyone (may) replicate something
close to his work”!
Concerns with regard to the model’s sensitivity
and convergence have been identified.
Given these concerns, we conclude that adoption
of BDM’s model for agent development would be
premature at this time.
REFERENCES
Black, D. (1948). On the Rationale of Group Decisionmaking.
Political Economy 56, No 1, Feb: 23-34.
Congleton, R. D. (2003). The Median Voter Model. The
Encyclopedia of Public Choice. Kluwer Academic
Press: 382-386.
Bueno De Mesquita, B. B. (1980). An Expected Utility
Theory of International Conflict. The American
Political Science Review, Vol. 74, No. 4 (Dec), pp.
917-931.
Bueno De Mesquita, B. B. (1981). The War Trap. Yale,
CT: Yale University Press.
Bueno De Mesquita, B. B. (1984). Forecasting Policy
Decisions: An Expected Utility Approach to PostKhomeini
Iran, The American Political Science
Review, Vol. 17, No. 2, pp. 226-236.
Bueno De Mesquita, B. B. (1985). The War Trap
Revisited: A Revised Expected Utility Model.
American Political Science Review 79, No 1, 156-177.
Bueno De Mesquita, B.B. and Lalman, D. (1986). Reason
and War, The American Political Science Review, Vol.
80, No. 4, (Dec), pp. 1113-1129.
Bueno De Mesquita, B. B. (1994). Political Forecasting:
An Expected Utility Method, European Community
Decision Making, Yale University Press, Stockman, F.
(ed.), Chapter 4, pp 71-104.
Bueno De Mesquita, B. B. (1997). A Decision Making
Model: Its Structure and Form. International
Interactions 23: No 3-4 235-266.
Bueno De Mesquita, B. B. (2002). Predicting Politics,
Ohio State University Press.
Bueno De Mesquita, B. B. (2009a). A New Model for
Predicting Policy Choices: Preliminary Tests, 50th
Meeting of the International Studies Association, New
York, Feb 15-18.
Bueno De Mesquita, B. B. (2009b). The Predictioneer’s
Game (1st
ed.). New York: Random House.
Kollman, K, Miller, J. H. and Page, S.E. eds (2010)
Computational models in political economy. MIT
Press.
APPENDIX
The following summarises the full procedure.
1. Given i=1,2,…,n actors, initial positions for each actor
xi(t=0), ci, si and number of rounds=.
2. Let ri=1
3. Calculate the pairwise votes:













n
i
jiki
ii
jk
xx
xxxx
scv
1 minmax
Then find the maximum value which corresponds to
the Condorcet winner position or median = .
4. Calculate basic utilities,
ir
jii
si
xx
xx
U











minmax
5.05.042
ir
jii
fi
xx
xx
U











minmax
5.05.042
  ir
jiii
bi
xx
xxx
U











minmax
25.05.042

  ir
jiii
wi
xx
xxx
U











minmax
25.05.042

 iri
sqU 5.042
5. Calculate probabilities:






 n
k
ikjkkk
ifk
ikjkkk
i
xxxxsc
xxxxsc
Pi
1
0arg
)(
)(
6. Let Q=0.5 (or 1.0).
7. Calculate:
      
 i
wi
i
bi
i
sq
i
sij
i
fi
i
i
i
si
i
ijij
i
UTTUQQU
UsUPUPsUE
)1()1(
11


      
 j
wj
j
bj
j
sq
j
sjj
j
fj
j
j
j
sj
j
jjji
j
UTTUQQU
UsUPUPsUE
)1()1(
11


If second pass (used the calculated values of ri) then,
go to step 11.
8. Calculate:








































n
ijj
ji
i
i
n
ijj
ji
i
i
n
ijj
ji
i
i
n
ijj
ji
i
i
n
ijj
ji
i
i
UEUE
UEUEUE
R
,1,1
,1,1,1
)(min)(max
)(min)(max)(2
UNRAVELLING BUENO DE MESQUITA'S GROUP DECISION MODEL
29
9. Calculate:
3/1
3/1
i
i
i
R
R
r



10. Go to step 4, using calculated values of ri.
11. Determine new position decisions x, based on rules in
section 5 for octant of Eij(i) vs Eji(j).
12. Increment the rounds, t=t+1
13. If t= then stop.
ICAART 2011 - 3rd International Conference on Agents and Artificial Intelligence
30