Estimating mass sentiment from noisy, biased signals:
recent Australian elections and the Voice referendum

Simon Jackman

simonjackman@icloud.com

University of Sydney

25 August 2023

Polls are like 1940s radar

noisy sensor (sampling error)
likely a biased sensor (“house effects”)
snapshots of dynamic target (discrete field period)
target’s law of motion is unknown (not ballistic)
limited resolution (coarse reporting of published polls)
dependencies among multiple targets (vote shares sum to 100%)

“House” effects: biases specific to a polling company

sampling methodology (e.g., RDD, landline/mobile mix; quotas from web panel)
weighting procedures and selection of weighting variables (post-stratification via raking; propensity score matching)
survey mode (live interviewer, IVR, web self-complete)
question wording
response options (are minor parties or DK offered or volunteered?; are DKs pushed?)
field operations (time of day, day of week)
reporting conventions (DKs reported or not)
compounded in low or uncertain voter turnout environments

Goals of model

combine information from multiple noisy/biased signals
recover trajectory, learn about campaigns and changes in public opinion
learn about pollster biases
forecasts for outcomes

Model for poll averaging: setup & notation, scalar target

Let \(t\) index campaign days.
Poll \(p\) fielded on day \(t\) by polling company \(j\) yields a estimated voting intention, a proportion \(\color{cyan}{y_p} \in [0,1]\), with sample size \(\color{cyan}{n_p}\). Variance of this estimate is approximately \(\color{cyan}{V_p = y_p (1-y_p)/n_p}\).
True, latent voting intentions on day \(t\) are \(\color{orange}{\xi_t} \in [0,1]\). These are observed exactly on election days, \(\color{orange}{\xi_1}\) and \(\color{orange}{\xi_T}\), respectively.
Polling company \(j\) has a time-invariant “house effect” \(\color{orange}{\delta_j}\), such that \(E(\color{cyan}{y_p}) = \color{orange}{\xi_{t(p)}} + \color{orange}{\delta_{j(p)}}\).

State-space model for poll averaging: locally constant latent state

Measurement model: \(\color{cyan}{y_{p}} \sim N(\color{orange}{\xi_{t(p)}} + \color{orange}{\delta_{j(p)}} \, , \, \color{cyan}{V_p})\)
Dynamic model: \(\color{orange}{\xi_t} \sim N(\color{orange}{\xi_{t-1}}, \color{orange}{\omega^2})\)
Given published polls, \(\color{cyan}{\boldsymbol{Y}}\), sample sizes, field dates and identity of polling companies — and the model — we seek

trajectory of latent voting intentions \(\color{orange}{\boldsymbol{\xi}} = (\color{orange}{\xi_1}, \ldots, \color{orange}{\xi_T})'\)
house effects: \(\color{orange}{\boldsymbol{\delta}} = (\color{orange}{\delta_1}, \ldots, \color{orange}{\delta_J})'\)
“pace of change” parameter (innovation variance), \(\color{orange}{\omega^2}\).

Estimation and inference

Gaussian law of motion: Kalman filter.
in Bayesian statistics: dynamic linear model (West & Harrison).
house effects and partially observed polling data makes the model slightly non-standard for off-the-shelf Kalman filtering (many packages in R)
EM or MCMC via R and C/C++
jags via rjags (Plummer 2019).
Stan via RStan (Stan Development Team 2020)
nimble (de Valpine et al. 2017)
pomp (King et al. 2016)

Elaborations

optionally, use endpoint constraints from election results observed on \(\color{orange}{\xi_1}\) and (ex post) \(\color{orange}{\xi_T}\).
Augment model with unknown step/discontinuities \(\color{orange}{\gamma}\) in \(\color{orange}{\boldsymbol{\xi}}\) trajectory, for known “event” days (e.g., leadership changes).
add trend component to model: \[ \begin{align} \begin{pmatrix} \xi_{t+1} \\ \zeta_{t+1} \end{pmatrix} & = \begin{bmatrix} 1 & 1 \\ 0 & 1 \end{bmatrix} \begin{pmatrix} \xi_t \\ \zeta_t \end{pmatrix} + \begin{pmatrix} v_t \\ w_t \end{pmatrix} \\[12pt] v_t & \sim N(0, \sigma^2_v) \\ w_t & \sim N(0, \sigma^2_w) \end{align} \]

Elaborations, continued

volatility regimes: “campaign period” vs bulk of the electoral cycle.
multivariate targets: US presidential politics, tracking 11 “battleground” states.
- reasonable number of state-specific polls
- interesting and subtle choices about covariance structure of latent state vector

Identification of model parameters

as initially presented, the model is over-parameterised
\(E(\color{cyan}{y_p}) = \color{orange}{\xi_{t(p)}} + \color{orange}{\delta_{j(p)}}\).
Invariant to translation: indistinguishable from \(E(\color{cyan}{y_p}) = [\color{orange}{\xi_{t(p)}} + \color{red}{c}] + [\color{orange}{\delta_{j(p)}} - \color{red}{c}], \quad \forall\ \color{red}{c} \neq 0\).
Post-election, end-point constraints: anchor \(\xi_T\) to known election result, and/or \(\xi_1\) to past election result as may be appropriate.
“Sum-to-zero” normalisation of house effects \(\color{orange}{\delta}\); i.e., set \(\color{red}{c} = \color{orange}{\bar{\delta}}\), such that \(\xi_t\) are identified up to a translation equal to the average bias of all pollsters.
With \(\color{orange}{\xi_1}\) or \(\color{orange}{\xi_T}\) known we pin down the \(\{ \color{orange}{\mathbf{\xi}} \}\) trajectory and can relax normalising restriction on house effects, revealing absolute (vs relative) pollster biases.

Examples

Australian federal elections 2007-2022
Pollster biases
Voice referendum

Example, 2019 Australian federal election

Widely considered one of the biggest “misses” for the polling industry.
prompted an international review and formation of the Australian Polling Council

2019 miss on 2PP was large

Example, 2019 Australian federal election

levels_2019 = transpose(levels_2019_raw)
polls_2019 = transpose(polls_2019_raw)
actual_2019 = transpose(actual_2019_raw)
viewof form = Inputs.form(
  {
  
  theParty : Inputs.select(
      [ "ALP", "LNP", "GRN", "OTH", "LNP2PP"],
      {
        label: "Party: "
      }
    ),
    
  overlay_endpoint_constrained: Inputs.toggle({label: "Corrected:"})
  },
   {
    template: (inputs) => htl.html`<div style="display: flex; gap: 3em">
    ${Object.values(inputs)}
    </div>`
    }
)

function pollPlotter(pollData,xiData,theParty){

  const formatWeek = d3.timeFormat("%e %b %Y");

  const pollData_local = pollData.filter(d => d.party == theParty)
  const xiData_local = xiData
    .filter(d => d.party == theParty & d.type == "sum_to_zero")
  const xiData_overlay = xiData
    .filter(d => d.party == theParty & d.type == "endpoint_constrained")
  const maxDate = d3.max(xiData_local.map(d => d.datetime))
  const annotate = xiData_local.filter(d => d.datetime == maxDate)

  
  const out = Plot.plot(
    {
      width: 1200,
      height: 640,
      marginRight: 24,
      y: {
        label: null,
        grid: true,
        inset: 6,
        ticks: 7
    },
    
    x: {
      label: null,
      round: true,
      nice: d3.utcMonth,
      type: "utc",
      ticks: null,
      tickFormat: null
    },
    
    marks: [

      Plot.axisX(
        {
          ticks: "year", 
          tickSize: 28, 
          tickPadding: -15, 
          tickFormat: "  %Y", 
          textAnchor: "start"
        }
      ),
    
      Plot.axisX(
        {
          ticks: "month",
          tickSize: 8,
          tickPadding: -11,
          tickFormat: " ",
          textAnchor: "start"
        }
      ),

    /* Plot.gridX(
      {
        ticks: "month",
        stroke: "black",
        strokeOpacity: 0.15,
        insetBottom: -0.5
      }
    ), */
    
    form.overlay_endpoint_constrained ? null :
      Plot.areaY(
        xiData_local, 
        {
          x: "datetime", 
          y1: "lo", 
          y2: "up",
          fill: "orange",
          fillOpacity: 0.15
        }
      ),
    
      Plot.lineY(
        xiData_local,
        {
          x: "datetime",
          y: "xi",
          stroke: "white",
          strokeOpacity: form.overlay_endpoint_constrained ? .2 : 1,
          strokeWidth: 5
        }
      ),
    
      Plot.ruleY(annotate, { y: "actual"}),
      Plot.text(annotate,
        {
          x: "datetime",
          y: "actual",
          text: ["Actual"],
          textAnchor: "start",
          fontSize: 14,
          dy: -10,
          dx: -2
        }
      ),
        
      Plot.ruleX(
        xiData_local,
          Plot.pointerX(
          {
            x: "datetime",
            py: "xi",
            stroke: "#ccc",
            strokeWidth: 1
          }
        )
      ),
    
      Plot.dot(
        xiData_local,
          Plot.pointerX(
            {
              x: "datetime",
              y: "xi",
              r: 6,
              stroke: "red",
              fill: "white"
            }
        )
      ),
    
      Plot.text(
        xiData_local,
          Plot.pointerX(
          {
            px: "datetime",
            py: "xi",
            dy: -19,
            frameAnchor: "top-right",
            fontSize: 18,
            text: 
              (d) =>
              formatWeek(
                d3.utcParse("%Y-%m-%dT%H:%M:%SZ")(d.datetime)
                ) + " " 
                + theParty 
                + " " + d.xi.toFixed(1) 
                + " ±" + d.moe.toFixed(1)
          }
        )
      ),
    
      form.overlay_endpoint_constrained ? 
      Plot.areaY(
        xiData_overlay, 
        {
          x: "datetime", 
          y1: "lo", 
          y2: "up",
          fill: "red",
          fillOpacity: 0.15
        }
      ) : null,
    
      form.overlay_endpoint_constrained ? 
        Plot.lineY(
        xiData_overlay,
        {
          x: "datetime",
          y: "xi",
          stroke: "red",
          strokeWidth: 5
        }
      ) : null,
    
      Plot.dot(
        pollData_local,
        {
          x: "datetime",
          y: "y",
          r: 5,
          fill: "white"
        }
      ),
    
      Plot.tip(
        pollData_local,
        Plot.pointer(
          {
            x: "datetime",
            y: "y",
            title: (d) => 
              [
                d.pollster + " " + d3.format(".1f")(d.y) +"%", 
                "Start: " + d.start, 
                "End: " + d.end,
                "Sample size: " + d3.format(",d")(d.sample)
              ].join("\n")
          }
        )
      )
    
    ]
  
    }
  );
  return out;
}

pollPlotter(polls_2019,levels_2019,form.theParty)

Election Day error of poll averages, 2007 to 2022

Positive/negative errors = polls are over/under estimates of party support.
year	ALP	GRN	LNP	LNP2PP	OTH
2007	1.74	0.38	-1.00	-1.58	-1.37
2010	0.64	1.26	-1.48	-1.75	-0.27
2013	0.56	1.52	-1.19	-0.69	-0.77
2016	-1.56	1.29	0.06	0.59	-0.06
2019	1.60	0.10	-3.57	-2.46	1.97
2022	3.18	-0.30	-0.36	-1.18	-
Average	1.03	0.71	-1.26	-1.18	-0.10
MAE	1.58	0.82	1.10	1.38	0.77

Errors in poll averages, last 60 days of election campaigns 2007-2022

Known error in poll average 2007-2019 improves 2022 performance for ALP & 2PP poll averages

Corrected poll averages suggest LNP usually out-campaigns Labor

Voice referendum

36 polls
8 pollsters
- two of them contributing just one poll each
- two of them contributing just two polls each
- Resolve 11 polls, Essential 9 polls, YouGov 7 polls.
for simplicity, we compute \(y\) = Yes/(Yes + No)
post 2019, many pollsters reporting effective sample sizes

“Yes” has shed nearly 20 percentage points in 12 months

Plot = require("@observablehq/plot")
d3 = require("d3@7")

leveldata = transpose(leveldata_raw)
trenddata = transpose(trenddata_raw)
d_output_plot = transpose(d_output_plot_raw)
formatWeek = d3.timeFormat("%e %b %Y")

Plot.plot(
  {
    width: 1200,
    height: 700,
    marginRight: 32,
    y: {
      label: "Yes %",
      grid: true,
      inset: 6,
      ticks: 7
    },
    
    x: {
      label: null,
      round: true,
      nice: d3.utcWeek,
      type: "utc",
      ticks: null,
      tickFormat: null
    },
    
    marks: [

    Plot.axisX(
      {
        ticks: "year", 
        tickSize: 28, 
        tickPadding: -15, 
        tickFormat: "  %Y", 
        textAnchor: "start"
      }
    ),
    
    Plot.axisX(
      {
        ticks: "month",
        tickSize: 16,
        tickPadding: -21,
        tickFormat: "  %b",
        textAnchor: "start"
      }
    ),

    Plot.gridX(
      {
        ticks: "month",
        stroke: "black",
        strokeOpacity: 0.15,
        insetBottom: -0.5
      }
    ),
    
    
    Plot.areaY(
      leveldata, 
      {
        x: "datetime", 
        y1: "lo", 
        y2: "up",
        fill: "orange",
        fillOpacity: 0.15
      }
    ),
    
    Plot.lineY(
      leveldata,
      {
        x: "datetime",
        y: "mean",
        stroke: "white",
        strokeWidth: 5
      }
    ),
    
    Plot.ruleY([50]),
    
    Plot.ruleX(
      leveldata,
        Plot.pointerX(
        {
          x: "datetime",
          py: "mean",
          stroke: "#ccc",
          strokeWidth: 1
        }
      )
    ),
    
    Plot.dot(
      leveldata,
        Plot.pointerX(
        {
          x: "datetime",
          y: "mean",
          r: 6,
          stroke: "red",
          fill: "white"
        }
      )
    ),
    
    Plot.text(
      leveldata,
        Plot.pointerX(
          {
            px: "datetime",
            py: "mean",
            dy: -19,
            frameAnchor: "top-right",
            fontSize: 18,
            text: 
              (d) =>  [
              `${formatWeek(
                  d3.utcParse("%Y-%m-%dT%H:%M:%SZ")(d.datetime)
                  )
                  }`,
                `Poll average, Yes: ${d.mean.toFixed(1)}%` + 
                ` ± ${d.moe.toFixed(1)}`
              ].join("   ")
          }
        )
    ),
    
    Plot.dot(
      d_output_plot,
      {
        x: "mid_date",
        y: "y",
        r: 5,
        fill: "white"
      }
    ),
    
    Plot.tip(
      d_output_plot,
      Plot.pointer(
        {
          x: "mid_date",
          y: "y",
          title: (d) => 
            [
              d.Pollster + " " + d3.format(".1f")(d.y) +"%", 
              "Start: " + d.FieldedStart, 
              "End: " + d.FieldedEnd,
              "Yes: " + d.Yes_prop,
              "No: " + d.No_prop,
              "Undecided: " + d.Unsure_prop,
              "Sample size: " + d3.format(",d")(d.SampleSize)
            ].join("\n")
        }
      )
    )
    
  ]
  
  }
)

Voice, Yes trend, weekly ∆

Plot.plot(
  {
    width: 1200,
    height: 700,
    marginRight: 32,
    y: {
      label: null,
      grid: true,
      //inset: 2,
      ticks: 7
    },
    
    x: {
      label: null,
      round: true,
      nice: d3.utcWeek,
      type: "utc",
      ticks: null,
      tickFormat: null
    },
    
    marks: [

    Plot.axisX(
      {
        ticks: "year", 
        tickSize: 28, 
        tickPadding: -15, 
        tickFormat: "  %Y", 
        textAnchor: "start"
      }
    ),
    
    Plot.axisX(
      {
        ticks: "month",
        tickSize: 16,
        tickPadding: -21,
        tickFormat: "  %b",
        textAnchor: "start"
      }
    ),

    Plot.gridX(
      {
        ticks: "month",
        stroke: "black",
        strokeOpacity: 0.15,
        insetBottom: -0.5
      }
    ),
    
    
    Plot.areaY(
      trenddata, 
      {
        x: "datetime", 
        y1: d => d.lo*7, 
        y2: d => d.up*7,
        fill: "orange",
        fillOpacity: 0.15
      }
    ),
    
    Plot.lineY(
      trenddata,
      {
        x: "datetime",
        y: d => d.mean*7,
        stroke: "white",
        strokeWidth: 5
      }
    ),
    
    Plot.ruleY([0]),
    
    Plot.ruleX(
      trenddata,
        Plot.pointerX(
        {
          x: "datetime",
          py: d => d.mean*7,
          stroke: "#ccc",
          strokeWidth: 1
        }
      )
    ),
    
    Plot.dot(
      trenddata,
        Plot.pointerX(
        {
          x: "datetime",
          y: d => d.mean*7,
          r: 6,
          stroke: "red",
          fill: "white"
        }
      )
    ),
    
    Plot.text(
      trenddata,
        Plot.pointerX(
          {
            px: "datetime",
            py: d => d.mean*7,
            dy: -19,
            frameAnchor: "top-right",
            fontSize: 18,
            text: 
              (d) =>  [
              `${formatWeek(
                  d3.utcParse("%Y-%m-%dT%H:%M:%SZ")(d.datetime)
                  )
                  }`,
                `Weekly ∆ Yes: ${(d.mean*7).toFixed(2)}%` + 
                ` ± ${d.moe.toFixed(2)}`
              ].join("   ")
          }
        )
    )
    
  ]
  
  }
)

Voice, pollster biases

Summary

model can encompass different electoral & campaign settings
interesting professional journey from poll-averaging to election forecasting
in Australian context, discoveries include:
- poll bias stubbornly persistent, tendency to underestimate Coalition support, overestimate Labor
- after ex post correction for this bias, persistent trend to the Coalition over the closing months of Australian election cycles.
biases with respect to election poll averaging (interpreted as election forecasts) suggests caution in extrapolating from Voice poll average
- little to no experience with how poll averages wrt referenda translate into referenda results

Estimating mass sentiment from noisy, biased signals:recent Australian elections and the Voice referendum

Polls are like 1940s radar

“House” effects: biases specific to a polling company

Goals of model

Model for poll averaging: setup & notation, scalar target

State-space model for poll averaging: locally constant latent state

Estimation and inference

Elaborations

Elaborations, continued

Identification of model parameters

Examples

Example, 2019 Australian federal election

2019 miss on 2PP was large

Example, 2019 Australian federal election

Election Day error of poll averages, 2007 to 2022

Errors in poll averages, last 60 days of election campaigns 2007-2022

Known error in poll average 2007-2019 improves 2022 performance for ALP & 2PP poll averages

Corrected poll averages suggest LNP usually out-campaigns Labor

Voice referendum

“Yes” has shed nearly 20 percentage points in 12 months

Voice, Yes trend, weekly ∆

Voice, pollster biases

Summary

Thank you

Estimating mass sentiment from noisy, biased signals:
recent Australian elections and the Voice referendum