{"id":2028,"date":"2019-10-30T23:23:35","date_gmt":"2019-10-30T23:23:35","guid":{"rendered":"https:\/\/marriott-stats.com\/nigels-blog\/?p=2028"},"modified":"2019-11-04T08:20:57","modified_gmt":"2019-11-04T08:20:57","slug":"rugby-world-cup-who-will-win-in-2019-3","status":"publish","type":"post","link":"https:\/\/marriott-stats.com\/nigels-blog\/rugby-world-cup-who-will-win-in-2019-3\/","title":{"rendered":"Rugby World Cup #3 &#8211; Who will win in 2019? &#8211; Model Evaluation"},"content":{"rendered":"<p>After 43 matches with 37 correctly predicted, the stage is set for an epic final between England &amp; South Africa to settle the 2019 Rugby World Cup (Men&#8217;s).\u00a0 Ahead of making a prediction for that match, I have examined my model in depth and in this post I explore whether or not the model needs to be adjusted.<\/p>\n<p><!--more-->I<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/rugby-world-cup-who-will-win-in-2019\/\" target=\"_blank\" rel=\"noopener noreferrer\">f you&#8217;ve read my earlier posts<\/a>, then you will know that I have used two models for predicting match outcomes:<\/p>\n<ul>\n<li><span style=\"color: #008000\"><strong>HIGHRANK<\/strong><\/span> &#8211; whichever team has the higher rank is expected to win the match.<\/li>\n<li><strong><span style=\"color: #008000\">EXPWIN<\/span><\/strong> &#8211; a linear regression model which uses the difference in ranking points to predict the margin of victory and the probability of winning for the higher ranked team.<\/li>\n<\/ul>\n<p>For 41 matches, both models have given the same prediction.\u00a0 The two matches where they disagreed were the two South Africa games against Japan &amp; Wales in the knockout rounds with each model having one correct &amp; one incorrect prediction.\u00a0 I will evaluate each model separately but I have been using <span style=\"color: #008000\"><strong>EXPWIN<\/strong><\/span> as my main model due to its greater flexibility so I will spend most of the time on that.<\/p>\n<h4><span style=\"color: #008000\"><strong>How has HIGHRANK performed?<\/strong><\/span><\/h4>\n<p>The <span style=\"color: #008000\"><strong>HIGHRANK<\/strong><\/span> performance is summarised in the table below and clearly show it has performed in line with history.\u00a0 Whilst it might look like the closest games [0,3) have performed worse than history, the smaller sample size for the world cup means such differences can be expected.\u00a0 Indeed if I combine [0,3) &amp; [3,7) then history shows 32 correct predictions (or 71%) out of 45 matches compared to 12 correct predictions (or 75%) out of 16 world cup matches which is no difference at all.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2035 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-14-300x107.png\" alt=\"\" width=\"479\" height=\"171\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-14-300x107.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-14-450x161.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-14.png 706w\" sizes=\"auto, (max-width: 479px) 100vw, 479px\" \/><\/p>\n<h4><span style=\"color: #008000\"><strong>How has EXPWIN performed?<\/strong><\/span><\/h4>\n<p>Let&#8217;s remind ourselves of the <span style=\"color: #008000\"><strong>EXPWIN<\/strong><\/span> model formula which was built using linear regression using the entire dataset shown in the<img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-1989\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/09\/WC19-4-300x250.png\" alt=\"\" width=\"412\" height=\"343\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/09\/WC19-4-300x250.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/09\/WC19-4-419x350.png 419w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/09\/WC19-4.png 751w\" sizes=\"auto, (max-width: 412px) 100vw, 412px\" \/> chart.<\/p>\n<p style=\"padding-left: 40px\"><strong><span style=\"color: #008000\">Expected Score Gap<\/span> = 1.75 * <span style=\"color: #008000\">Ranking Points Gap<\/span> &#8211; 1.58<\/strong><\/p>\n<p>The Residual Standard Error is 13.7 points or 2 converted tries and the R-squared is 0.33.\u00a0 There is no evidence that this model differed between the 5 datasets shown in the chart (3 6 Nations &amp; 2 Autumn Internationals).\u00a0 The errors are consistent with a normal distribution which means that <span style=\"color: #008000\"><strong>EXPWIN<\/strong><\/span> can also be used to calculate the probability of the stronger ranked team winning or <strong><span style=\"color: #008000\">P(WIN)<\/span><\/strong>.\u00a0 This is the probability that the Score Gap is greater than zero assuming a normal distribution where the mean is the Expected Score Gap using the above formula and the standard deviation is 13.7.<\/p>\n<p>The next chart shows the same scatter plot for the 43 world cup matches so far (pool stages are solid circles, knockouts are hollow diamonds) along with the <span style=\"color: #008000\"><strong>EXPWIN<\/strong><\/span> model as a solid dashed black line.\u00a0 Despite the lack of evidence that there was a difference between the 6 <img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2030 alignleft\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-11-300x260.png\" alt=\"\" width=\"382\" height=\"331\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-11-300x260.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-11-404x350.png 404w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-11.png 721w\" sizes=\"auto, (max-width: 382px) 100vw, 382px\" \/>Nations &amp; Autumn Internationals, I decided it was still worth showing separate thinner dashed black lines for the <span style=\"color: #008000\"><strong>EXPWIN<\/strong><\/span> model had I based this solely on 6 Nations data or Autumn International data.\u00a0 This is one way of showing how much parameter error (error in the model coefficients) might be expected to occur in the world cup since no model is ever perfect.<\/p>\n<p>The solid red line is the same model formulation as <strong><span style=\"color: #008000\">EXPWIN<\/span><\/strong> but this time just using the world cup data alone.\u00a0 Lets call this <span style=\"color: #ff0000\"><strong>EXPWINWC<\/strong><\/span> and the formula for this line is as follows:<\/p>\n<p><strong><span style=\"color: #008000\">Expected Score Gap<\/span> = 1.99 * <span style=\"color: #008000\">Ranking Points Gap<\/span> + 0.79<\/strong><\/p>\n<p>The question we have to ask ourselves is does this constitute evidence that the <strong><span style=\"color: #008000\">EXPWIN<\/span><\/strong> model was wrong?\u00a0 One can answer this using multivariate linear regression to determine if the difference in the model coefficients are statistically significant but I still like to look at the question visually.\u00a0 What I see in this scatter plot is that the slope is more or less the same as <strong><span style=\"color: #008000\">EXPWIN<\/span><\/strong> especially when we focus on close matches with ranking point gaps of less than 7 points.\u00a0 This is the scenario we have for the final where the ranking point gap is only 1.6 points in favour of England.<\/p>\n<p>The issue is the model intercept.\u00a0 It would appear that in this world cup, the stronger teams have been doing slightly better than expected.\u00a0 <img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-2036\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-12-300x284.png\" alt=\"\" width=\"417\" height=\"395\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-12-300x284.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-12-369x350.png 369w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-12.png 723w\" sizes=\"auto, (max-width: 417px) 100vw, 417px\" \/>One way to explore this question is to plot the distribution of the errors and compare this with the expected distribution of errors assuming a normal distribution with mean 0 and standard deviation 13.7.\u00a0 This is what is shown in the chart here.\u00a0 I see two things here.\u00a0 First the spread of errors is consistent with expectations.\u00a0 Indeed the standard deviation of the observed errors is 15.0 points which is similar to the expected 13.7 points.\u00a0 Second the observed errors are skewed towards the right instead of being symmetrical thus indicating that model errors have tended to favour the stronger team.<\/p>\n<p>The 95% confidence interval for the mean error of these 43 matches is +0.4 to +9.6 points and the mean error is +5.0 points.\u00a0 In other words, if we decide that this is evidence that the model is underestimating the performance of stronger teams then England expected margin of victory should be 5 points higher than the current expected margin of only 1 point using <span style=\"color: #008000\"><strong>EXPWIN<\/strong><\/span>. Of course given that the residual standard error is 14 points this is no great change in the grand scheme of things but for a bookie seeking to price their odds properly, such a small change in the expected margin of victory could be quite significant.\u00a0 <span style=\"color: #008000\"><strong>P(WIN)<\/strong> <\/span>changes from 53% to 67% or in betting odds from 11\/10 on to 2\/1 on which is definitely not small beer!<\/p>\n<p>A natural question to ask is whether the 6 knockout matches behave differently to the pool stages.\u00a0 Alternatively, we might ask whether the errors are correlated with time i.e. were the errors larger in the early games but are now averaging zero?\u00a0 The best way to answer these questions is to use the principles of <strong>Statistical Process Control (SPC)<\/strong> and plot a <strong>Control Chart<\/strong> as below.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2034 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-13-300x177.png\" alt=\"\" width=\"660\" height=\"389\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-13-300x177.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-13-768x454.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-13-450x266.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2019\/10\/WC19-13.png 1004w\" sizes=\"auto, (max-width: 660px) 100vw, 660px\" \/><\/p>\n<p>The black line is a measure of the underlying trend and I can&#8217;t see any great change over the tournament.\u00a0 Errors favouring stronger teams were immediately apparent in the first round of matches and remained largely stable until the 5th round of matches (recall 3 matches were cancelled in the last round due to typhoon Hagibis).\u00a0 However, the knockout matches do seem to be consistent with the pool stages overall.\u00a0 So taking everything together it does look like our forecast for the final should be giving more weight to England.<\/p>\n<p>&nbsp;<\/p>\n<h4><span style=\"color: #008000\"><strong>My articles on the 2019 Rugby World Cup<\/strong><\/span><\/h4>\n<ol>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/rugby-world-cup-who-will-win-in-2019\/\" target=\"_blank\" rel=\"noopener noreferrer\">Who will win in 2019 &#8211; Initial predictions ahead of Pool stage<\/a><\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/rugby-world-cup-who-will-win-in-2019-2\/\" target=\"_blank\" rel=\"noopener noreferrer\">Who will win in 2019 &#8211; Revised predictions ahead of Knockout stage<\/a><\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/rugby-world-cup-who-will-win-in-2019-4\/\" target=\"_blank\" rel=\"noopener noreferrer\">Who will win in 2019 &#8211; Final prediction ahead of the Final<\/a><\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/rugby-world-cup-who-will-win-in-2019-3\/\" target=\"_blank\" rel=\"noopener noreferrer\">How accurate were my predictions &#8211; written before the Final<\/a><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>After 43 matches with 37 correctly predicted, the stage is set for an epic final between England &amp; South Africa to settle the 2019 Rugby World Cup (Men&#8217;s).\u00a0 Ahead of making a prediction for that match, I have examined my model in depth and in this post I explore whether or not the model needs [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":2036,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[6,4],"tags":[14,12,134,15,133],"class_list":["post-2028","post","type-post","status-publish","format-standard","has-post-thumbnail","category-forecasting","category-sport","tag-forecasts","tag-rugby","tag-rwc2019","tag-sport-analytics","tag-world-cup","entry","override"],"_links":{"self":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2028","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/comments?post=2028"}],"version-history":[{"count":3,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2028\/revisions"}],"predecessor-version":[{"id":2049,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2028\/revisions\/2049"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media\/2036"}],"wp:attachment":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media?parent=2028"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/categories?post=2028"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/tags?post=2028"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}