election2004.nb

The TradeSports data

In[153]:=

The table itself is in this node

Analysis

The two big assumptions here are that the outcomes for the various "states" (we think of D.C. as a state) are independent Bernoulli trials, and that the TradeSports quotes are reasonable proxies for the means of these trials. Neither is a particularly believable assumption, but it's the best I can do. Also, no attempt is made to account for non-winner-takes-all outcomes in ME, NE, or CO, nor for the vagaries of faithless electors.

Total EVs up for grabs

In[316]:=

Out[316]=

Dynamic program for the PDF over EVs

In[317]:=

$dp[0] := PadRight[{1}, evs + 1] dp[n_] := With[{ev = table[[n, 6]], prob = Rationalize[table[[ ... ]], 0]/100, prev = dp[n - 1]}, prob * PadLeft[prev, evs + 1, 0, -ev] + (1 - prob) * prev]$

The PDF and CDF

In[319]:=

In[320]:=

The mean Bush-advantage

In[321]:=

Out[321]=

Other parameters of the distribution. Probability of Bush loss/tie/win is 41.7%, 1.5%, 56.8%

In[322]:=

Out[322]=

In[66]:=

Log plot of the PDF

In[323]:=

[Graphics:HTMLFiles/election2004_20.gif]

Out[323]=

The binomial distribution with the same mean, for comparison

In[117]:=

In[324]:=

Out[324]=

In[325]:=

[Graphics:HTMLFiles/election2004_26.gif]

Out[325]=

Plot of the PDF, with the binomial distribution for comparison

In[326]:=

In[327]:=

In[328]:=

[Graphics:HTMLFiles/election2004_31.gif]

Plot of the CDF

In[329]:=

cumpts = Table[{Which[i<0, Blue, i0, Green, i>0, Red], Line[{{i, cumdist[[i + evs/2 + 1]] - dist[[i + evs/2 + 1]]}, {i, cumdist[[i + evs/2 + 1]]}}]}, {i, -evs/2, evs/2}] ;

In[330]:=

In[331]:=

[Graphics:HTMLFiles/election2004_35.gif]

Quintiles

In[332]:=

$RowBox[{RowBox[{Table, [, RowBox[{RowBox[{Count, [, RowBox[{cumdist, ,, RowBox[{x_, /;, RowBox[{x, <, RowBox[{i, *, 0.2}]}]}]}], ]}], ,, {i, 4}}], ]}], -, evs/2}]$

Out[332]=

Hmm... the CDF is shockingly linear between 20% and 80%. Is there a simple explanation for that?

The following revised dynamic program computes a PDF for a given subset of the states.

In[333]:=

The PDF of the lose/tie/win variable for a given subset of the states with a given bias

In[335]:=

$ltw[l_, bias_] := With[{pdf = pdp[l]},  {Total[Take[pdf, bias]], pdf[[bias + 1]], Total[Drop[pdf, bias + 1]]}]$

The entropy of a PDF, in bits

In[336]:=

$(* ent[l_] := Module[{x}, Total[Map[Limit[x * Log[1/2, x], x#] &, l]]] *)$

The entropy of the lose/tie/win variable for a given subset of the states with a given bias

In[337]:=

In[338]:=

Out[338]=

The entropy of the election

In[339]:=

Out[339]=

The conditional entropy over the states l given the result for state n

In[340]:=

$RowBox[{condent[n_, l_, bias_], :=, RowBox[{With, [, RowBox[{RowBox[{{, RowBox[{l2 = Complemen ... ent[l2, bias2], +, RowBox[{RowBox[{(, RowBox[{1., -, prob}], )}], *, ltwent[l2, bias]}]}]}], ]}]}]$

The conditional entropies for the various states, as percentages of the total entropy

In[341]:=

Out[341]=

RowBox[{{, RowBox[{0.00113566, ,, 0.0000797879, ,, 0.00508159, ,, 0.00238467, ,, 0.0213238, ,, ... 98, ,, 0.000119589, ,, 0.00850346, ,, 0.00403897, ,, 0.001484, ,, 0.0136118, ,, 0.000125142}], }}]

The states sorted by the conditional entropies

In[342]:=

Out[342]=

RowBox[{{, RowBox[{RowBox[{{, RowBox[{FLORIDA, ,, 0.126925}], }}], ,, RowBox[{{, RowBox[{OHIO, ... {, RowBox[{ALASKA, ,, 0.0000797879}], }}], ,, RowBox[{{, RowBox[{DC, ,, 0.0000355594}], }}]}], }}]

The upper levels of the decision tree of the lose/tie/win variable

In[343]:=

$dtree[0, l_, bias_] := ltw[l, bias] dtree[n_, l_, bias_] := With[{cents = Map[condent[#, l, bi ... s], dtree[n - 1, Complement[l, {l[[order[[1]]]]}], bias - table[[l[[order[[1]]]], 6]]]}]]$

In[345]:=

Out[345]=

RowBox[{{, RowBox[{1.08083, ,, RowBox[{{, RowBox[{0.417066, ,, 0.0150389, ,, 0.567895}], }}], ... ], ,, RowBox[{{, RowBox[{0.00960385, ,, 0.00157518, ,, 0.988821}], }}]}], }}]}], }}]}], }}]}], }}]

Created by Mathematica (November 1, 2004)