{"id":2842,"date":"2020-06-28T13:23:25","date_gmt":"2020-06-28T12:23:25","guid":{"rendered":"https:\/\/marriott-stats.com\/nigels-blog\/?p=2842"},"modified":"2020-06-28T13:23:25","modified_gmt":"2020-06-28T12:23:25","slug":"estimating-excess-deaths-in-england-to-june-26th","status":"publish","type":"post","link":"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-june-26th\/","title":{"rendered":"COVID19 Deaths #2H &#8211; Estimated Excess Deaths in England up to 26th June"},"content":{"rendered":"<p>In many countries across the world, the total effect of the Coronavirus pandemic is now being measured using the concept of <a href=\"https:\/\/medium.com\/@theintersectuk\/in-excess-10dfc0548b87\" target=\"_blank\" rel=\"noopener noreferrer\">Excess Deaths.<\/a>\u00a0 However, publication of such data by the Office of National Statistics for England is up to 2 weeks slower than the daily deaths published by Public Health England.\u00a0 In this post, I update my model which uses the PHE series to estimate what the ONS will publish for excess deaths in England on Tuesday 30th June.<\/p>\n<p><!--more--><\/p>\n<p>I intend to update this post every week and you can<a href=\"https:\/\/twitter.com\/MarriottNigel\" target=\"_blank\" rel=\"noopener noreferrer\"> follow me on Twitter<\/a> to be told when I have made updates.\u00a0 Previous posts are listed below.<\/p>\n<ol>\n<li>Estimates using my first model specification for weeks ending <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/how-many-people-have-died-of-covid19\/\" target=\"_blank\" rel=\"noopener noreferrer\">20th April<\/a>,\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england\/\" target=\"_blank\" rel=\"noopener noreferrer\">1st May<\/a>,\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-may-8th\/\" target=\"_blank\" rel=\"noopener noreferrer\">8th May<\/a>,\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-may-15th\/\" target=\"_blank\" rel=\"noopener noreferrer\">15th May<\/a>,<a href=\"https:\/\/twitter.com\/MarriottNigel\/status\/1265194829249679360\" target=\"_blank\" rel=\"noopener noreferrer\">22nd May<\/a> (a tweet instead of a blog post)<\/li>\n<li>Estimates using my second model specification (described in this post) for weeks ending\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-may-29th\/\" target=\"_blank\" rel=\"noopener noreferrer\">29th May<\/a>,\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-june-5th\/\" target=\"_blank\" rel=\"noopener noreferrer\">5th June<\/a>,\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-june-12th\/\" target=\"_blank\" rel=\"noopener noreferrer\">12th June<\/a>,\u00a0<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-june-19th\/\" target=\"_blank\" rel=\"noopener noreferrer\">19th Jun<\/a>e<\/li>\n<\/ol>\n<p>The reader is advised to read these previous estimates so as to familiarise his or herself with the methods and terminology used throughout this post.<\/p>\n<h4><strong><span style=\"color: #993300\">Time Series used in this post<\/span><\/strong><\/h4>\n<p>I&#8217;ve used the following 4 time series, each denoted by a 4 letter code.\u00a0 Clicking on this will take you to the source data.<\/p>\n<ol>\n<li><span style=\"color: #ff0000\"><b><a href=\"https:\/\/coronavirus.data.gov.uk\/downloads\/csv\/coronavirus-deaths_latest.csv\" target=\"_blank\" rel=\"noopener noreferrer\"><span style=\"color: #333399\">PHEr<\/span><\/a> <span style=\"color: #333399\">&#8211; Public Health England COVID19 Registrations &#8211; <\/span><\/b><span style=\"color: #000000\">Daily number of deaths by date of registration with COVID19 on the death certificate and confirmed with a positive test in an NHS\/PHE laboratory<\/span><\/span><span style=\"color: #000000\">.\u00a0 Published everyday, this is the most common headline figure.\u00a0 The link given here contains a further link to a spreadsheet with the relevant data.<\/span><\/li>\n<li><strong><span style=\"color: #008000\"><a style=\"color: #008000\" href=\"https:\/\/www.ons.gov.uk\/peoplepopulationandcommunity\/birthsdeathsandmarriages\/deaths\/datasets\/weeklyprovisionalfiguresondeathsregisteredinenglandandwales\" target=\"_blank\" rel=\"noopener noreferrer\">ONSr<\/a>\u00a0<\/span><\/strong>&#8211; <span style=\"color: #008000\"><strong>ONS COVID19 Registrations<\/strong><\/span> &#8211; <span style=\"color: #ff0000\"><span style=\"color: #000000\">Daily number of deaths by date of registration with COVID19 on the death certificate from all locations.\u00a0 This is published weekly on a Tuesday but the daily data can be found on the COVID19-ENGLAND tab of the downloaded spreadsheet.<\/span><\/span><\/li>\n<li><span style=\"color: #993300\"><strong><a style=\"color: #993300\" href=\"https:\/\/www.ons.gov.uk\/peoplepopulationandcommunity\/birthsdeathsandmarriages\/deaths\/datasets\/weeklyprovisionalfiguresondeathsregisteredinenglandandwales\" target=\"_blank\" rel=\"noopener noreferrer\">ONSx<\/a>\u00a0<\/strong>&#8211; <strong>ONS Excess Death Registrations<\/strong><\/span> &#8211; <span style=\"color: #ff0000\"><span style=\"color: #000000\">Daily number of deaths by date of registration with COVID19 on the death certificate from all locations.\u00a0 This is published weekly on a Tuesday and can be extracted from the WEEKLY DATA tab of the downloaded spreadsheet.\u00a0 I use the day of week pattern of the ONSr series to convert the ONSx weekly data into ONSx daily data.<\/span><\/span><\/li>\n<li><a href=\"https:\/\/www.ons.gov.uk\/peoplepopulationandcommunity\/birthsdeathsandmarriages\/deaths\/datasets\/numberofdeathsincarehomesnotifiedtothecarequalitycommissionengland\" target=\"_blank\" rel=\"noopener noreferrer\"><strong><span style=\"color: #800080\">CQCn<\/span><\/strong><\/a> &#8211; <span style=\"color: #800080\"><strong>Care Quality Commission COVID19 Notifications<\/strong> <\/span>\u00a0&#8211; All care home are required to notify the CQC of any death in their home within a short period.\u00a0 Since the outbreak, care homes are now able to say if they suspect the death was COVID19 related without a test.\u00a0 The data is passed onto the ONS who published the data weekly.<\/li>\n<\/ol>\n<p>I have only extracted data for England from these sources but some also cover Scotland, Wales &amp; Northern Ireland.\u00a0 For more information about these and other COVID19 relates time series, <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/covid19-deaths-latest-data-england\/\" target=\"_blank\" rel=\"noopener noreferrer\">please click here<\/a>.<\/p>\n<h4><span style=\"color: #993300\"><strong>My Weekly Estimates &amp; Extrapolations for Excess Deaths in England<\/strong><\/span><\/h4>\n<p>My estimates of excess deaths for the weeks ending 19th &amp; 26th June are shown below along with e<a href=\"https:\/\/marriott-stats.com\/nigels-blog\/covid19-deaths-latest-data-england\/\" target=\"_blank\" rel=\"noopener noreferrer\">xtrapolations (not estimates) for ONSr which I explain in a separate post (see sections 1 &amp; 4)<\/a>.\u00a0 Two estimates for ONSx are given, EstPHE based on PHEr data and EstCQC based on CQCn data. Both are described in this post and in posts 6 to 9 above.\u00a0 Please note in links 1 to 4 above, an older model was used which is now retired and is not explained here.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2853 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSx-FC-200626-300x283.png\" alt=\"\" width=\"520\" height=\"491\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSx-FC-200626-300x283.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSx-FC-200626-768x725.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSx-FC-200626-371x350.png 371w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSx-FC-200626.png 944w\" sizes=\"auto, (max-width: 520px) 100vw, 520px\" \/><\/p>\n<p>There were 599 excess deaths in England in the week ending 12th June, <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england-to-june-12th\/\" target=\"_blank\" rel=\"noopener noreferrer\">800 higher than my estimate.<\/a>\u00a0 I have said before I would prefer to be overestimating than underestimating and the error is within the 95% confidence interval but I am regarding this as a serious underestimate.<\/p>\n<h4><span style=\"color: #993300\"><strong>Why write a series of posts on estimasting excess deaths?<\/strong><\/span><\/h4>\n<p>I intend this weekly series of posts about estimating excess deaths to be a real time case study about the difference between <strong>Technical &amp; Fundamental forecasting<\/strong>, a concept that I talk about in more depth in my 1-day training course &#8220;<strong><span style=\"color: #008000\"><em>I<\/em><\/span><\/strong><a href=\"https:\/\/marriott-stats.com\/identifying-trends-in-data-making-forecasts\/\" target=\"_blank\" rel=\"noopener noreferrer\"><span style=\"color: #008000\"><strong><em><strong>de<\/strong>ntify trends &amp; make forecasts<\/em><\/strong><\/span><\/a>&#8220;.\u00a0 These are the two avenues open to a forecaster when trying to forecast a quantity Q over a timeline T.<\/p>\n<ol>\n<li>Predict Q(t+i) using the history of Q up to time period t only.\u00a0 This involves identifying the underlying pattern of Q over time and then <strong>extrapolating<\/strong> that pattern into the future.\u00a0 This is sometimes known as <strong>Technical forecasting<\/strong> in financial markets.<\/li>\n<li>Predict Q(t+i) based on its relationship with an input variables X(t+j) (i not necessarily equal to j).\u00a0 This requires statistical <strong>modelling<\/strong> to quantify the relationship between Q &amp; X.\u00a0 X can then used to predict Q in the future.\u00a0 This is sometimes known as <strong>Fundamental forecasting<\/strong> in financial markets.<\/li>\n<\/ol>\n<p>There is never a right or wrong answer to this question.\u00a0 The advantage of extrapolation is that it only requires the history of Q itself and no other information.\u00a0 The disadvantage is that no insight is gained as to why Q is changing and you have to assume that the historical pattern observed will repeat itself in the future.\u00a0 Modelling on other hand will give you insight and can spot if the pattern of Q is going to change in the future.\u00a0 The difficulty is that you may need to forecast X in the future before you can use X in the future which has the effect of shifting uncertainty in Q to uncertainty in X rather than giving you greater accuracy.<\/p>\n<h4><strong><span style=\"color: #993300\">Modelling ONSx as a function of PHEr<\/span><\/strong><\/h4>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-2828 alignnone\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622-300x197.png\" alt=\"\" width=\"673\" height=\"442\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622-300x197.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622-1024x673.png 1024w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622-768x505.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622-1536x1009.png 1536w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622-450x296.png 450w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONS-Excess-2020-200622.png 1542w\" sizes=\"auto, (max-width: 673px) 100vw, 673px\" \/><\/p>\n<p>In the case of excess deaths, our output time series Q(t) is ONSx(t) and our input time series is PHEr(t).\u00a0 Because the PHEr is published at least two weeks in advance of ONSx, we do not have a problem with not knowing what PHEr is going to be in the future since we already have the data as shown in the table above.\u00a0 Therefore modelling would appear to be the better option but how good is it?<\/p>\n<p>Since both ONSx and PHEr are based on death registrations one would expect there to be some relationship in terms of timing.\u00a0 The big difference between the two time series is that PHEr only counts deaths with a positive test for COVID19 undertaken in a PHE\/NHS laboratory whereas ONSx counts all deaths over and above a baseline.<\/p>\n<p>We now have 12 weeks of data with significant excess deaths plus of couple weeks beforehand when the first COVID19 deaths were recorded.\u00a0 That is enough to start building a model using weekly data only.\u00a0 In this model the output variable I am aiming to predict is ONSx but ONSm.\u00a0 ONSx is defined as<\/p>\n<p><span style=\"color: #993300\"><strong>ONSx\u00a0 =\u00a0 ONSa\u00a0 &#8211;\u00a0 \u00a0ONSb<\/strong><\/span><\/p>\n<p>where ONSa is total number of deaths from all causes and ONSb is the baseline number of deaths defined to be average of 2015 to 2019.\u00a0 My new output variable is<\/p>\n<p><span style=\"color: #993300\"><strong>ONSm\u00a0 \u00a0=\u00a0 \u00a0ONSa\u00a0 \/\u00a0 ONSb<\/strong><\/span><\/p>\n<p>I call ONSm the <strong>Mortality Ratio<\/strong>.\u00a0 The advantage of this is it makes is easier to predicted negative excess deaths which occurs when ONSm is less than 1.\u00a0 It also allows for log transformations of the output variable which couldn&#8217;t be done with ONSx but can be done with ONSm and is equal to log(ONSa) minus log(ONSb).<\/p>\n<p>I have plotted ONSm against both PHEr and CQCn on the same scatter plot here since PHEr and CQCn are similar in scale.\u00a0 ONS week numbers are used <img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-2829\" src=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSmPHErCQCf-200622-300x272.png\" alt=\"\" width=\"375\" height=\"340\" srcset=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSmPHErCQCf-200622-300x272.png 300w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSmPHErCQCf-200622-768x697.png 768w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSmPHErCQCf-200622-385x350.png 385w, https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/06\/ONSmPHErCQCf-200622.png 795w\" sizes=\"auto, (max-width: 375px) 100vw, 375px\" \/>as labels and the most recent week ending 12th June is week 24.\u00a0 We are trying to predict ONSm for week 25 where we already know what PHEr (866) &amp; CQCn (197) are from the table at the beginning.\u00a0 The labels with white backgrounds (weeks 15 &amp; 19) had Friday bank holidays (Good Friday &amp; VE Day respectively).\u00a0 The reason I highlight this is because PHEr and ONSm are based on death registrations and bank holidays result in reduced staffing levels for compiling the data and thus artificially lower death counts.\u00a0 In contrast, I believe the effect of Monday bank holidays is more limited since staff have the rest of the week to catch up.<\/p>\n<p>One new effect is now apparent following the week 23 data.\u00a0 It would appear the slope of the relationship when excess deaths were increasing between weeks 11 &amp; 15 is shallower than the slope for weeks 16 to 24 when excess deaths were falling.\u00a0 I have taken this effect into account to arrive at an estimated mortality ratio of 0.94 for week ending 19th June and 0.89 for week ending 26th June with 95% confidence intervals of +\/- 0.1.\u00a0 This converts into estimates for ONSx of -526 for week ending 19th June and -757 for week ending 26th June with 95% confidence intervals of +\/-882.\u00a0 These are the numbers appearing in the ESTPHE column in the table shown at the start of this post and if correct would mark the end of the first wave of the COVID19 pandemic.<\/p>\n<p><span style=\"color: #993300\"><em>**IMPORTANT &#8211; PHE made a change in the way they <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/covid19-deaths-latest-data-england\/\">record deaths in the week ending 29th May (week 22) as described in this link<\/a>.\u00a0 For the purposes of using the model shown in the chart here, I included an extra dummy variable since week 22 in my model hence why this week is highlighted in purple in the chart.\u00a0 My forecast for weeks 25 &amp; 26 take this effect into account<\/em><\/span><\/p>\n<p>CQCn data is only available from week 16 (week ending 17th April) and so cannot be incorporated directly into the model above.\u00a0 If I build a separate model for the blue labels on the scatter plot, I get an estimate for ONSm of 1.02 which converts to an estimate for ONSx of +138.\u00a0\u00a0Clearly a CQCn based forecast is very different from a PHEr based forecast and is based on 9 weeks data as opposed to 14 weeks for PHEr.\u00a0 Despite, I now think I have enough data to start including CQCn based forecasts in my table at the beginning and these are shown in the ESTCQC column.<\/p>\n<p>&nbsp;<\/p>\n<h4><span style=\"color: #993300\"><strong>Comparing Estimated ONSx with Extrapolated ONSx<\/strong><\/span><\/h4>\n<p><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/estimating-excess-deaths-in-england\/\" target=\"_blank\" rel=\"noopener noreferrer\">A few weeks ago, I pointed out the value<\/a> of comparing my modelled (or fundamental) estimate above with <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/covid19-deaths-latest-data-england\/\" target=\"_blank\" rel=\"noopener noreferrer\">an extrapolated (or technical) estimate (see section 6 of this link)<\/a> as a sense check.\u00a0 My extrapolated estimate for week ending 12th June was +760 deaths which was much closer than my ESTPHE estimate.\u00a0 For week ending 19th June, my extrapolated estimate is +395 deaths which is higher than both my CQCn (+138) and PHEr (-526) estimates.\u00a0 I have however pointed out before than it is not possible for my extrapolated ONSx extrapolation model to predict negative excess deaths which is a known flaw so we may now be at the level where the extrapolation model will no longer work.<\/p>\n<p>&nbsp;<\/p>\n<h4><span style=\"color: #993300\"><strong>&#8211; More posts about COVID19 &#8211;<\/strong><\/span><\/h4>\n<ol>\n<li>A very useful <a href=\"https:\/\/www.statslife.org.uk\/features\/4474-a-statistician-s-guide-to-coronavirus-numbers\" target=\"_blank\" rel=\"noopener noreferrer\">guidance to interpreting statistics of COVID19<\/a> published by the Royal Statistical Society.<\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/coronavirus-useful-data-and-links\/\" target=\"_blank\" rel=\"noopener noreferrer\">My collection of links to all kinds of material<\/a> related to the statistics of COVID19, epidemiological modelling and testing.<\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/be-more-accurate-with-a-smaller-sample-size\/\" target=\"_blank\" rel=\"noopener noreferrer\">How large a sample is needed<\/a> in order to decide whether COVID19 restrictions can be lifted?\u00a0 A lot, lot less than you think!<\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/covid19-deaths-latest-data-england\/\" target=\"_blank\" rel=\"noopener noreferrer\">Latest trends and data for COVID19 deaths in England<\/a><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>In many countries across the world, the total effect of the Coronavirus pandemic is now being measured using the concept of Excess Deaths.\u00a0 However, publication of such data by the Office of National Statistics for England is up to 2 weeks slower than the daily deaths published by Public Health England.\u00a0 In this post, I [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":2829,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[6],"tags":[164,163,25,180,179,169,182,168,183],"class_list":{"0":"post-2842","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-forecasting","8":"tag-coronavirus","9":"tag-covid19","10":"tag-forecasting-model","11":"tag-nhs","12":"tag-ons","13":"tag-pandemic","14":"tag-phe","15":"tag-sars-cov-2","16":"tag-trend-extrapolation","17":"entry","18":"override"},"_links":{"self":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2842","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/comments?post=2842"}],"version-history":[{"count":2,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2842\/revisions"}],"predecessor-version":[{"id":2858,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/2842\/revisions\/2858"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media\/2829"}],"wp:attachment":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media?parent=2842"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/categories?post=2842"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/tags?post=2842"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}