{"id":395,"date":"2017-05-01T16:38:21","date_gmt":"2017-05-01T15:38:21","guid":{"rendered":"https:\/\/marriott-stats.com\/nigels-blog\/?p=395"},"modified":"2019-02-09T10:45:41","modified_gmt":"2019-02-09T10:45:41","slug":"uk-general-elections-1-how-accurate-are-the-opinion-polls","status":"publish","type":"post","link":"https:\/\/marriott-stats.com\/nigels-blog\/uk-general-elections-1-how-accurate-are-the-opinion-polls\/","title":{"rendered":"UK General Elections #1 &#8211; How accurate are the opinion polls?"},"content":{"rendered":"<p>The French pollsters are congratulating themselves for getting the first round of the French Presidential Election right last weekend.\u00a0 Recently <a href=\"https:\/\/fivethirtyeight.com\/features\/the-u-k-snap-election-is-riskier-than-it-seems\/\" target=\"_blank\" rel=\"noopener noreferrer\">Nate Silver criticised the accuracy of polling in the UK<\/a>.\u00a0 For British pollsters, 2015 was a year they would like to forget and I am sure many of them will be nervous of the forthcoming general election but how nervous should they be?<\/p>\n<p><a href=\"https:\/\/www.youtube.com\/watch?v=b9vl6xrTIc0&amp;feature=youtu.be\" target=\"_blank\" rel=\"noopener noreferrer\">I was one of the few people to predict the polls would be wrong in 2015<\/a>\u00a0though the magnitude of the error was larger than I expected.\u00a0 Two years ago I based my conclusions on an analysis of polls between 1992 &amp; 2015 but for the 2017 election I have analysed a longer time period from 1950 to 2015.\u00a0 I have made use of the excellent work done by <a href=\"http:\/\/www.markpack.org.uk\/opinion-polls\/\" target=\"_blank\" rel=\"noopener noreferrer\">Mark Pack who has systematically recorded every opinion poll published since 1945<\/a>.\u00a0 Based on this, I am expecting the polls to be in error again with the Conservative lead over Labour underestimated by 2.6%.<\/p>\n<p><!--more--><\/p>\n<p>I arrived at this conclusion by identifying from Mark Pack&#8217;s Pollbase every poll that took place in the week leading up every general election between 1950 &amp; 2015.\u00a0 Note this is based on the fieldwork dates, not the publication date which can be a few days later.\u00a0 For the elections in the 1950&#8217;s, Gallup were the only pollster so instead of using the week before, I used the month before.<\/p>\n<p>For each election, I then calculated the average vote share recorded across all pollsters for the Conservatives, Labour and Liberal Democrats (Liberals 1950 to 1979, Alliance 1983 &amp; 1987).\u00a0 The polling error for each party can then be calculated as the actual election result for Great Britain minus the average vote share from the polls.\u00a0 I use the figures for Great Britain rather than the United Kingdom since nearly all polls do not survey in Northern Ireland.<\/p>\n<p>The results can be seen in the chart below.\u00a0 The black line is the number of polling companies that polled in the run up to the election.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-396 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors1-300x141.png\" alt=\"\" width=\"638\" height=\"300\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors1-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors1-768x360.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors1-450x211.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors1.png 1002w\" sizes=\"auto, (max-width: 638px) 100vw, 638px\" \/><\/p>\n<p>The error in 2015 is clearly shown but what is worse is that it was not the worse election for the polls.\u00a0 That dubious honour is held by 1992 with the Conservative underestimated by 5% and Labour overestimated by 4%.\u00a0 What is not well known is that the pollsters continued to get the Labour vote wrong by a similar order of magnitude in 1997 &amp; 2001.\u00a0 Those errors passed unnoticed because the pollsters did get the outcome right i.e. a Labour landslide, but I would contend that the pollsters should have been just as concerned about those years as they were about 1992 &amp; 2015.<\/p>\n<p>What is clear from the chart is that polls have been systematically underestimating the Conservatives and overestimating Labour for quite some time.\u00a0 When Gallup was the only pollster in the 1950s, the situation was the other way around but since new pollsters joined the party in 1959, the error for the Conservatives has varied between a 2.2% overestimate (in 1983) and a 5.2% underestimate (in\u00a01992) with a median underestimate of 1.1%.\u00a0 For Labour, the error ranges from a 4% overestimate (in 1970 &amp; 1997) to a 2.3% underestimate (in 1974 &amp; 2010) with a median overestimate of 2.3%.<\/p>\n<p>One of the things that strikes me about this chart is how the errors for Labour and the Liberal Democrats are inversely correlated with each other with a correlation coefficient of -0.67.\u00a0 This makes sense in today&#8217;s environment where there is a lot of talk of a progressive alliance between Labour, Lib Dems and the Greens and one easily imagine a scenario where tactical voting means the polls overestimate Labour and underestimate the Lib Dems.\u00a0 However it would seem that this has been a dynamic in British elections for a very long period of time.\u00a0 The equivalent correlation between the Conservatives and Lib Dems is only -0.13.<\/p>\n<p>A similar thing may happen in this election with the Conservatives and UKIP so I have redone the chart by combining the Conservatives &amp; UKIP into one group and Labour, Lib Dems &amp; Greens into another group.\u00a0 In practice, the UKIP &amp; Green errors are only known for 2015 as that has been the only election where pollsters have recorded votes for these parties separately rather than putting lumping them into Others.\u00a0 So for the most part the\u00a0chart below\u00a0is comparing the Conservative poll error with the combined Lab\/LD error.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-397 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors2-300x141.png\" alt=\"\" width=\"642\" height=\"302\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors2-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors2-768x360.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors2-450x211.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors2.png 1002w\" sizes=\"auto, (max-width: 642px) 100vw, 642px\" \/><\/p>\n<p>The lines represent centred 5-election moving averages and currently sit at a 1.9% underestimate for the CON\/UKIP error and a 2.4% overestimate for the LAB\/LD\/GRN error.\u00a0 The difference between these two numbers represent a 0.5% underestimate for other parties which always happens as polls can never pick up the votes for the myriad of tiny parties and independents.<\/p>\n<p>This revised chart makes things a lot clearer.\u00a0 For the last 7 elections, the polls have always underestimated the Conservatives (&amp;UKIP) and overestimated the combined Labour\/Lib Dem (&amp; Green) vote.\u00a0 It was this kind of information that led me to predict a poll error in 2015.\u00a0 Since 2015, there has been a lot of work by polling companies to try and avoid a repetition but as 1997 shows, the last time they tried to rectify a major error, they didn&#8217;t actually succeed so why should it be different this time around?<\/p>\n<p>The one point that might bring this streak to an end is that the fact that the 2017 general election bears a lot of similarities with the 1983 general election.\u00a0 1983 is the first election I can remember in detail and the similarities are quite striking namely:<\/p>\n<ul>\n<li>The previous saw a major event that realigned British politics, the Falklands War in 1982 and the EU referendum in 2016.<\/li>\n<li>The incumbent government was Conservative and the opposition Labour.<\/li>\n<li>Labour was led by a leader widely seen as weak and ineffective, Michael Foot in 1983, Jeremy Corbyn in 2017.<\/li>\n<li>The Conservatives were led by a female Prime Minister perceived as strong, Margaret Thatcher in 1983, Theresa May in 2017.<\/li>\n<li>Labour&#8217;s policies were seen as very left wing in both elections whilst the Conservatives were seen as distinctly right wing.<\/li>\n<li>There was a 3rd party insurgency.\u00a0 In 1983 this was the Liberal\/SDP alliance.\u00a0 In 2017, arguably the insurgency incurred in the previous election with UKIP &amp; SNP\u00a0but the combined votes of UKIP, Greens &amp; SNP plus the Lib Dems who are acting as a insurgent party stills adds up to quite a bit.<\/li>\n<li>A Conservative landslide was widely expected with the only question being how large their majority would be.<\/li>\n<\/ul>\n<p>As the chart above shows, 1983 was the last time a significant overestimate in the Conservative vote occurred with their actual vote 2.3% below what the polls were showing.\u00a0 Does that mean with all the similarities between 1983 &amp; 2017 that we will see a similar overestimate.\u00a0 After all, the two major polling errors of 2015 &amp; 1992 both occurred in elections where the parties were thought to be neck &amp; neck with a hung parliament the most likely outcome.\u00a0 Can we make the case that the narrative of the polls has an effect on voter behaviour?<\/p>\n<p>Some people do indeed make such a connection but as a professional statistician, I can&#8217;t subscribe to an interpretation that reeks of &#8220;correlation equals causation&#8221;.\u00a0 For a start I can point out 1987, when another Conservative landslide was expected, the polls got it bang on.\u00a0 Also there is no justification for overemphasising one data point and completely ignoring all the other elections.\u00a0 Saying that, I do intend to produce forecasts assuming a poll error similar to 1983 but such a forecast will merely be one scenario rather than a true forecast.\u00a0 My final forecast is likely to be a weighted average of a number of scenarios including an 1983 one.<\/p>\n<p>In the end, I decided to calculate a weighted average of all errors since 1950 using the number of pollsters in my first chart as my weight.\u00a0 I think one can give less weight to the early years given that there were fewer pollsters and the trend over the last 7 years is quite striking and by using the number of pollsters as my weight, I am using an independently derived weighting scheme rather than a cherry picked one.\u00a0\u00a0 This gives the following figures.<\/p>\n<ul>\n<li>The expected polling error for the Conservatives + UKIP is an underestimate of 1.3%<\/li>\n<li>The expected polling error for Labour + Lib Dems + Greens is an overestimate of 1.8%<\/li>\n<li>The expected polling error for the SNP &amp; Plaid Cymru is zero.<\/li>\n<li>The expected polling error for other minor parties is an underestimate of 0.5%<\/li>\n<li>The probability of the Conservatives + UKIP being OVERESTIMATED is 25%<\/li>\n<li>The probability of Labour + Lib Dems + Greens being OVERESTIMATED is 85%<\/li>\n<\/ul>\n<p>The two probabilities stated are again weighted averages.\u00a0 The Conservatives one appears to be quite high given that no such overestimate has taken place for 7 years but statistically this is a valid calculation using the concept of likelihoods.\u00a0 These probabilities are almost certainly going to be the weights I will use when generating and combining scenarios but I will decide for sure nearer the time.<\/p>\n<p>So far I have been concentrating on the expected vote share for each party.\u00a0 In practice, when it comes to predicting the outcome of the election that uses First Past the Post as its election system, the more important prediction is the Conservative lead over Labour.\u00a0 These parties have always been expected to take the top two places nationally so I have calculated the expected lead from the polls and compared it with the actual lead to produce the following chart.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-398 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors3-300x141.png\" alt=\"\" width=\"625\" height=\"294\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors3-300x141.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors3-768x360.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors3-450x211.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2017\/05\/PollErrors3.png 1002w\" sizes=\"auto, (max-width: 625px) 100vw, 625px\" \/><\/p>\n<p>In 1983, the Conservative lead over Labour was overestimated by 3.4% but otherwise the weighted average of the expected error in the estimate of the Conservative lead is 2.6% and the probability of this being overestimated still 25%.\u00a0 Interestingly, the last time the Conservative lead was overestimated was in 2010.\u00a0 This occurred because although the Conservative vote was underestimated as expected and the combined Lab\/LD vote was overestimated as expected, the Lib Dems enjoyed a major bounce during the election campaign which was not sustained in the ballot box so their votes were significantly overestimated whilst Labour&#8217;s vote was underestimated.\u00a0 2010 was the first time I made an election prediction (CON 277, LAB 225, LD 114, OTH 34) which turned out to be wide of the mark!\u00a0 Whilst I got the Conservative lead over Labour right, 2o10 taught me not to trust the polls and this led to my forecast that there would be an error in 2015.\u00a0 For 2017, the similarity with 1983 is weighing on my mind at present which is why <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/uk-general-election-2017-forecast-1-latest-prediction\/\">my general election forecast currently assumes no polling error <\/a>but nearer the time, I will make a decision on how to incorporate expected polling errors.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The French pollsters are congratulating themselves for getting the first round of the French Presidential Election right last weekend.\u00a0 Recently Nate Silver criticised the accuracy of polling in the UK.\u00a0 For British pollsters, 2015 was a year they would like to forget and I am sure many of them will be nervous of the forthcoming [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[90],"tags":[21,19,33],"class_list":{"0":"post-395","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-archive","7":"tag-election-forecasting","8":"tag-elections","9":"tag-opinion-polls","10":"entry","11":"override"},"_links":{"self":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/395","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/comments?post=395"}],"version-history":[{"count":3,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/395\/revisions"}],"predecessor-version":[{"id":401,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/395\/revisions\/401"}],"wp:attachment":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media?parent=395"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/categories?post=395"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/tags?post=395"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}