{"id":2264,"date":"2019-12-29T23:22:10","date_gmt":"2019-12-29T23:22:10","guid":{"rendered":"https:\/\/marriott-stats.com\/nigels-blog\/?p=2264"},"modified":"2021-01-19T12:53:36","modified_gmt":"2021-01-19T12:53:36","slug":"uk-general-elections-4-how-accurate-are-the-polls-updated-with-ge19","status":"publish","type":"post","link":"https:\/\/marriott-stats.com\/nigels-blog\/uk-general-elections-4-how-accurate-are-the-polls-updated-with-ge19\/","title":{"rendered":"UK General Elections #5 &#8211; How accurate are voting intention polls? &#8211; updated with GE19"},"content":{"rendered":"<p>After 3 general elections with severe polling errors, the UK opinion pollsters redeemed themselves in the <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/tag\/ge2019\/\" target=\"_blank\" rel=\"noopener noreferrer\">2019 UK General Election<\/a> with their most accurate performance since 1955.\u00a0 I base this statement on data provided by <a href=\"http:\/\/www.markpack.org.uk\/opinion-polls\/\" target=\"_blank\" rel=\"noopener noreferrer\">Mark Pack who has systematically recorded every opinion poll published since 1945<\/a>.\u00a0 The challenge now for the industry is to maintain this level of performance for the next election which may be easier said than done given that 5 out of the last 8 elections have experienced a major polling error.<\/p>\n<p><!--more--><\/p>\n<p>My analysis looks at only the polls that took place in the week before every general election between 1950 &amp; 2019.\u00a0 Note this is based on the fieldwork dates, not the publication date which can be a few days later.\u00a0 For the elections in the 1950&#8217;s, Gallup were the only pollster so instead of using the week before, I used the month before.<\/p>\n<p>For each election, I calculated the average vote share recorded across all pollsters for the Conservatives, Labour and Liberal Democrats (Liberals 1950 to 1979, Alliance 1983 &amp; 1987).\u00a0 The polling error for each party can then be calculated as the actual election result for Great Britain minus the average vote share from the polls.\u00a0 I use the figures for Great Britain rather than the United Kingdom since nearly all polls do not survey in Northern Ireland.<\/p>\n<p>The results can be seen in the chart below.\u00a0 The black line is the number of polling companies that polled in the run up to the election.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2266 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors1-GE19-300x141.png\" alt=\"\" width=\"732\" height=\"344\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors1-GE19-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors1-GE19-768x361.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors1-GE19-450x211.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors1-GE19.png 1001w\" sizes=\"auto, (max-width: 732px) 100vw, 732px\" \/><\/p>\n<p>After 3 general elections in a row with major polling errors, the polling industry breathed a sigh of relief in 2019 with all 3 parties predicted to within 2 percentage points.\u00a0 The last time this happened was in 1979.\u00a0 In fact, the pollsters performance is better than 1979 if you measure the RMSE (Root Mean Squared Error) based on the 3 parties.\u00a0 This is calculated by squaring the errors, finding the average of the squared errors and then taking the square root of the average.\u00a0 In 2019, the RMSE was 1.1% which is the lowest seen since 1955 and the second lowest on record.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2270 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors4-GE19-300x141.png\" alt=\"\" width=\"730\" height=\"343\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors4-GE19-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors4-GE19-768x361.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors4-GE19-450x212.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors4-GE19.png 1002w\" sizes=\"auto, (max-width: 730px) 100vw, 730px\" \/><\/p>\n<p>Has the polling industry overcome the issues it has had in the 3 elections prior to 2019?\u00a0 History tells us that RMSE this low are rare and higher errors can occur.\u00a0 During the election, I kept pointing out that 5 of the last 7 (now 8) elections saw at least one party experiencing a polling error of at least 4 percentage points, namely &#8211;<\/p>\n<ul>\n<li>1992 &#8211; An underestimate of 5.2% for the Conservatives.<\/li>\n<li>1997 &#8211; An overestimate of 4.0% for Labour.<\/li>\n<li>2010 &#8211; An overestimate of 4.1% for the Liberal Democrats.<\/li>\n<li>2015 &#8211; An underestimate of 4.1% for the Conservatives.<\/li>\n<li>2017 &#8211; An underestimate of 4.5% for Labour.<\/li>\n<\/ul>\n<p>Of the 3 elections that didn&#8217;t see such errors, 2001 still had a 3.5% overestimate for Labour and 1992 also had a 3.9% overestimate for Labour.\u00a0 Prior to 1992, only two elections experienced such large errors which were 1970 (Labour overestimated by 4%) and 1951 (Labour underestimated by 4.3%).\u00a0 Such a shift in error rate from 2 out of 12 to 5 out of 8 is statistically significant.\u00a0 So whilst I hope that the industry has made progress, the worst thing they can do is be complacent and assume they have the issues licked.<\/p>\n<p>One of the things that strikes me about the first chart above is how the errors for Labour and the Liberal Democrats are inversely correlated with each other with a correlation coefficient of -0.61.\u00a0 This makes sense in today&#8217;s environment where there is a lot of talk of a progressive alliance between Labour, Lib Dems and the Greens and one easily imagine a scenario where tactical voting means the polls overestimate Labour and underestimate the Lib Dems.\u00a0 However it would seem that this has been a dynamic in British elections for a very long period of time.\u00a0 The equivalent correlation between the Conservatives and Lib Dems is only -0.13.<\/p>\n<p>Given this, I have redone the chart by combining the Conservatives &amp; UKIP &amp; Brexit Party into one group and Labour, Lib Dems &amp; Greens into another group.\u00a0 In practice, the UKIP, Brexit Party &amp; Green errors are only known for the last 3 elections as those have been the only elections where pollsters have recorded votes for these parties separately rather than putting lumping them into Others.\u00a0 So for the most part the chart below is comparing the Conservative poll error with the combined Lab\/LD error.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2267 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors2-GE19-300x141.png\" alt=\"\" width=\"732\" height=\"344\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors2-GE19-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors2-GE19-768x361.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors2-GE19-450x212.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors2-GE19.png 1001w\" sizes=\"auto, (max-width: 732px) 100vw, 732px\" \/><\/p>\n<p>The lines represent centred 5-election moving averages and currently sit at a 1.1% underestimate for the CON\/UKIP\/BXP error and a 1.3% overestimate for the LAB\/LD\/GRN error.<\/p>\n<p>This revised chart makes things a lot clearer and also emphasises the exceptional nature of 2017.\u00a0 In 2010, the Lib Dems were badly overestimated but this was partly compensated by an underestimate in the Labour vote.\u00a0 In 2017, the underestimate in the Labour vote was only slightly compensated by an error in the Lib Dem &amp; Green vote.\u00a0 The CON+UKIP vote share was overestimated by 2.4% but this was almost entirely due to the UKIP vote being overestimated since the Conservative vote share was more or less spot on.<\/p>\n<p>In effect, 2017 was a repeat of the 1983 &amp; 1951 elections but on a larger scale.\u00a0 History shows that errors like these favouring Labour are exceptional and the norm has been errors (usually significant errors) that favour the Conservatives.\u00a0 The chart above shows only 6 elections out of 20 with errors favouring the &#8220;progressive alliance&#8221; with 4 of these taking place before 1966.\u00a0\u00a0Conversely, the 7 elections prior to 2017 all had errors favouring the Conservatives.<\/p>\n<p>So far I have been concentrating on the expected vote share for each party or combination of parties.\u00a0 In practice, when it comes to predicting the outcome of\u00a0an election that uses First Past the Post as its election system, the more important prediction is the Conservative lead over Labour.\u00a0 These parties have always been expected to take the top two places nationally so I have calculated the expected lead from the polls and compared it with the actual lead to produce the following chart.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2265 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors3-GE19-300x141.png\" alt=\"\" width=\"736\" height=\"346\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors3-GE19-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors3-GE19-768x360.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors3-GE19-450x211.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/12\/PollErrors3-GE19.png 1002w\" sizes=\"auto, (max-width: 736px) 100vw, 736px\" \/><\/p>\n<p>We see again how 2017 reflects 1983 though the error is larger than 1983 but not as large as 1951.\u00a0 The 5-election centred moving average still shows an underestimate of 1.5% in the Conservative lead over Labour and indeed this appears to have been the long run average since 1964.\u00a0 It is very tempting (and no doubt many people will try) to put reasons on this chart but treated as a time series in its own right, I have to say that I do not see any explanatory patterns apparent.<\/p>\n<p>If we define a SIGNIFICANT error as being one where the CON-LAB lead is out by 3%, then we can make the following observations about the 19 elections since 1950:-<\/p>\n<ol>\n<li>8 out of 20 elections did not experience a significant polling error (1955, 1959, 1964, 1974F, 1979, 1987, 2010, 2019)<\/li>\n<li>8 out of 20 elections experienced a significant polling error favouring the Conservatives. (1966, 1970, 1974O, 1992, 1997, 2001, 2005, 2015)<\/li>\n<li>4 out of 20 elections experienced a significant polling error favouring Labour (1950, 1951, 1983 &amp; 2017)<\/li>\n<li>The average polling error (in CON-LAB lead) is +1.5% and the standard deviation is 4.2%.<\/li>\n<li>If our null hypothesis is that the average polling error is 0%, then our t-statistic is +1.26 and the p-value (using 2-tailed t-test) is 23%.<\/li>\n<li>Nothing greatly changes if we confine our analysis to 1974 onwards i.e. from when the Northern Ireland parties and the Nationalists arrived on the political scene and the CON+LAB vote share saw a significant shift downwards.<\/li>\n<\/ol>\n<p>I chose 3% as a definition of a significant error as my experience shows that an error on this scale will mislead election forecasters as happened for the most part in 2017.<\/p>\n<p>Ahead of the 2019 election, I stated that <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/uk-general-elections-2-how-accurate-are-the-opinion-poll-updated-with-ge17\/\" target=\"_blank\" rel=\"noopener noreferrer\">I would calculate 3 scenarios as below<\/a>:<\/p>\n<ol>\n<li>A 4% underestimate in the CON-LAB lead as stated by the polls\u00a0which favours the Conservatives.<\/li>\n<li>No error in the CON-LAB lead as stated by the polls.<\/li>\n<li>A 4% overestimate in the CON-LAB lead as stated by the polls which favours Labour.<\/li>\n<\/ol>\n<p>I also stated that I would use a 2:1:1 ratio of these scenarios as my official forecast i.e. I would give scenario 1 50% weight, scenario 2 25% weight and scenario 3 25% weight.\u00a0 This is what <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/uk-general-election-2019-1-my-official-forecast\/\" target=\"_blank\" rel=\"noopener noreferrer\">I ended up doing<\/a> and I see no reason to change this for the next election though based on the numbers listed above, an alternative weighting would be 2:2:1.<\/p>\n<p>&nbsp;<\/p>\n<h4><strong><span style=\"color: #993300;\">&#8212;- Want to read more about polls and polling errors? &#8212;<\/span><\/strong><\/h4>\n<ol>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/forecasting2-do-election-pollsters-show-forecasting-skill\/\" target=\"_blank\" rel=\"noopener noreferrer\">Do pollsters show forecasting skill?<\/a><\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/uk-general-elections-3-who-is-the-most-accurate-pollster\/\" target=\"_blank\" rel=\"noopener noreferrer\">Who is the most accurate pollster?<\/a><\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/uk-opinion-poll-tracker-ge2019-final\/\" target=\"_blank\" rel=\"noopener noreferrer\">The final set of polls for GE2019.<\/a><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>After 3 general elections with severe polling errors, the UK opinion pollsters redeemed themselves in the 2019 UK General Election with their most accurate performance since 1955.\u00a0 I base this statement on data provided by Mark Pack who has systematically recorded every opinion poll published since 1945.\u00a0 The challenge now for the industry is to [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":2265,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[2,6,3],"tags":[21,19,140,33],"class_list":{"0":"post-2264","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-elections","8":"category-forecasting","9":"category-polling","10":"tag-election-forecasting","11":"tag-elections","12":"tag-ge2019","13":"tag-opinion-polls","14":"entry","15":"override"},"_links":{"self":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2264","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/comments?post=2264"}],"version-history":[{"count":5,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2264\/revisions"}],"predecessor-version":[{"id":3524,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2264\/revisions\/3524"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media\/2265"}],"wp:attachment":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media?parent=2264"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/categories?post=2264"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/tags?post=2264"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}