{"id":3341,"date":"2020-10-11T15:31:43","date_gmt":"2020-10-11T14:31:43","guid":{"rendered":"https:\/\/marriott-stats.com\/nigels-blog\/?p=3341"},"modified":"2026-04-19T14:11:31","modified_gmt":"2026-04-19T13:11:31","slug":"data-driven-decision-making-statistical-guidance-for-contaminated-land-surveys","status":"publish","type":"post","link":"https:\/\/marriott-stats.com\/nigels-blog\/data-driven-decision-making-statistical-guidance-for-contaminated-land-surveys\/","title":{"rendered":"Data Driven Decision Making #1 &#8211; New Statistical Guidance for Land Contamination Surveys"},"content":{"rendered":"<p>On 30th September 2020, <span style=\"color: #008000;\"><a style=\"color: #008000;\" href=\"https:\/\/www.claire.co.uk\/\" target=\"_blank\" rel=\"noopener noreferrer\"><strong>CL:AIRE<\/strong><\/a><\/span> (the industry body for the land contamination &amp; remediation sector) published new professional guidance for &#8220;<span style=\"color: #993300;\"><em><strong><a style=\"color: #993300;\" href=\"https:\/\/claire.co.uk\/home\/news-new\/new-cl-aire-publication-guidance-on-applying-statistics-to-land-contamination-decision-making.html?highlight=WyJjb21wYXJpbmciLCJzb2lsIl0=\" target=\"_blank\" rel=\"noopener noreferrer\">Comparing Soil Contamination Data with a Critical Concentration<\/a><\/strong><\/em><\/span>&#8220;.\u00a0 The 46-page document advises how to use statistics when assessing land contamination and deciding whether it is safe for development.\u00a0 I was the lead author of the guidance and I spent 4 years working with CL:AIRE&#8217;s steering committee on what the guidance should cover.\u00a0 The 4 years were bookended by statement &amp; editorial published by the <span style=\"color: #008000;\"><a style=\"color: #008000;\" href=\"https:\/\/www.amstat.org\/\" target=\"_blank\" rel=\"noopener noreferrer\"><strong>ASA<\/strong> (American Statistical Association<\/a><\/span>) on the use &amp; misuse of P-Values in <a href=\"https:\/\/www.amstat.org\/\/asa\/files\/pdfs\/P-ValueStatement.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">2016<\/a> &amp; <a href=\"https:\/\/www.tandfonline.com\/doi\/full\/10.1080\/00031305.2019.1583913\" target=\"_blank\" rel=\"noopener noreferrer\">2019<\/a> respectively and in writing this guidance I felt was I an ambassador for turning those into something that could used by non-statisticians to make real life decisions that have an impact on us all.<\/p>\n<p><!--more--><\/p>\n<h5><span style=\"color: #993300;\"><strong><a style=\"color: #993300;\" href=\"https:\/\/claire.co.uk\/phoca-download\/1-publications-library\/9-other-cl-aire-documents.html?download=745:2020-stats-guidance\" target=\"_blank\" rel=\"noopener noreferrer\">Click here to download the full CL:AIRE guidance document as a PDF<\/a><\/strong>.\u00a0<\/span><\/h5>\n<p>&nbsp;<\/p>\n<h5><strong><span style=\"color: #008000;\">My presentations &amp; webinars about the new guidance<\/span><\/strong><\/h5>\n<ol>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/10\/MSC-SILC-Annual-Forum-Presentation-March-2020.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Presentation to the SILC conference on 8th March 2020<\/a> with the subtitle &#8220;<span style=\"color: #993300;\"><em>What&#8217;s changed in the guidance and why<\/em><\/span>&#8220;<\/li>\n<li><a href=\"https:\/\/www.gotostage.com\/channel\/cd149a82410a45fba1ff796da9301533\/recording\/343ad0dc662245b6902366279a2c1aed\/watch?source=CHANNEL\" target=\"_blank\" rel=\"noopener noreferrer\">Presentation to the SOBRA virtual conference on 2nd December 2020<\/a>.\u00a0 The link takes you to the whole conference and I am the first speaker about 10 mins in.\u00a0 My talk lasts about 40 mins.\u00a0 My thanks to the Society of Brownfield Risk Assessment and the other presenters for allowing me to share this link.<\/li>\n<li><a href=\"https:\/\/www.youtube.com\/watch?v=Ftohl87AF1c&amp;feature=youtu.be\" target=\"_blank\" rel=\"noopener noreferrer\">Presentation to the SCLF AGM on 1oth December 2020<\/a> &#8211; This is a longer presentation than what was presented to SOBRA and starts 2m30s in.\u00a0 My thanks to the Scottish Contaminated Land Forum for allowing me to share this Youtube link.<\/li>\n<li><span style=\"color: #993300;\"><strong><a style=\"color: #993300;\" href=\"https:\/\/www.youtube.com\/watch?v=Uc5fBahnjhU\" target=\"_blank\" rel=\"noopener\"><em>&#8220;Dr Groundlove &#8211; or how I learned to stop worrying and love the Central Limit Theorem&#8221;<\/em><\/a>, <\/strong><span style=\"color: #333333;\">a Royal Statistical Society webinar<\/span><\/span><span style=\"color: #333333;\">\u00a0o<\/span>n 1st March 2021 &#8211; I was one of 3 speakers as listed below.\n<ul>\n<li>4 mins in &#8211; <strong>Peter Witherington<\/strong> from <strong>RSK Environmental Ltd<\/strong> on what is contaminated land and why statistics is needed.<\/li>\n<li>32 mins in &#8211; <strong>Myself<\/strong> on the statistical issues that had to be considered during the writing of the new guidance, especially on the limitations of the Central Limit Theorem.<\/li>\n<li>69 mins in &#8211; <strong>Ron Wasserstein<\/strong>, executive director of the <strong>American Statistical Association<\/strong> and lead author of the ASA P-value statements referred to below.\u00a0 He compared the reality of the guidance with what he hoped to see when the ASA statements were published.<\/li>\n<li>93 mins in &#8211; an interesting discussion of some of the statistical issues including Bayesian approaches.<\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/www.linkedin.com\/feed\/update\/urn:li:activity:6815782885187026944\/\" target=\"_blank\" rel=\"noopener\">Presentation to the ELQF (East Land Quality Forum) on 22nd June 2021<\/a>.\u00a0 This repeats some of the material of the earlier presentation but I added some slides referring to prequal report I wrote in 2017 which laid the groundwork for the guidance.<\/li>\n<li><a href=\"https:\/\/marriott-stats.com\/nigels-blog\/wp-content\/uploads\/2020\/10\/MSC-ENBIS-CLAIRE-moving-beyond-p-0.05-June-2022.pdf\" target=\"_blank\" rel=\"noopener\">Presentation to ENBIS (European Network of Business &amp; Industrial Statisticians)<\/a> on 1st July 2022.\u00a0 I describe the different statistical approach taken in the new guidance compared to the 2008 version.\u00a0 In particular I describe how the ASA 2016 statement on p-values influenced the 2020 version.\u00a0 The link takes you to a PDF of the slides.\u00a0 The session was recorded but I haven&#8217;t been able to get a link to it.<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<h5><span style=\"color: #008000;\"><strong>My approach to writing the new guidance<\/strong><\/span><\/h5>\n<p>In the late 90s, I bought a house in Reading, a new development built on a former industrial estate.\u00a0 I received a survey report which summarised the tests made on the soil in my garden and how much risk it presented to potential occupants and the wider environment.\u00a0 My brother, who was a laboratory scientist at the time, remarked I could mine my garden for metals and suggested I shouldn&#8217;t grow fruit and vegetables in the garden.\u00a0 I had no interest in doing so but it was my first contact with the land contamination industry.<\/p>\n<p>This industry surveys land to ensure a site is suitable for its new use and to prevent unacceptable risks from contamination.\u00a0 Planning officers decide if the appropriate processes and decisions on surveys and analysis have been undertaken by industry practitioners and the right decisions have been made.\u00a0 Practitioners working in the land contamination industry are a mixture of scientists and engineers and whilst many will have received basic training in statistics, they are not experts in <a href=\"https:\/\/marriott-stats.com\/nigels-blog\/stats-training-materials-hypothesis-testing\/\" target=\"_blank\" rel=\"noopener noreferrer\">statistical inference<\/a> hence the need for professional guidance in statistics.<\/p>\n<p>Whilst writing the new guidance, I realised <a href=\"https:\/\/amstat.tandfonline.com\/doi\/full\/10.1080\/00031305.2016.1154108\" target=\"_blank\" rel=\"noopener noreferrer\">the concluding paragraph from the 2016 ASA statement<\/a> perfectly captured what I wanted the guidance to convey and I reproduce it here broken down as 6 bullet points &#8211;<\/p>\n<p style=\"padding-left: 40px;\"><span style=\"color: #993300;\"><em>&#8220;Good statistical practice, as an essential component of good scientific practice, emphasizes &#8230;<\/em><\/span><\/p>\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li><span style=\"color: #993300;\"><em>&#8230; principles of good study design and conduct,<\/em><\/span><\/li>\n<li><strong><span style=\"color: #993300;\"><em>&#8230; a variety of numerical and graphical summaries of data, <\/em><\/span><\/strong><\/li>\n<li><span style=\"color: #993300;\"><em>&#8230; understanding of the phenomenon under study, <\/em><\/span><\/li>\n<li><strong><span style=\"color: #993300;\"><em>&#8230; interpretation of results in context, <\/em><\/span><\/strong><\/li>\n<li><span style=\"color: #993300;\"><em>&#8230; complete reporting and <\/em><\/span><\/li>\n<li><span style=\"color: #993300;\"><em><strong>&#8230; proper logical and quantitative understanding of what data summaries mean.<\/strong> <\/em><\/span><\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p style=\"padding-left: 40px;\"><span style=\"color: #993300;\"><em>&#8230; No single index should substitute for scientific reasoning.&#8221;<\/em><\/span><\/p>\n<p>I wish I had fully realised the importance of this paragraph at the beginning of the project as I could have then recommended the guidance was laid out in this fashion.\u00a0 For reasons that were perfectly understandable at the time given the wishes of the steering committee, the draft guidance followed a different layout but during the revision process, I tried to steer the layout back to the ASA layout with the result the final version ended up somewhere in between.\u00a0 However, I did add Appendix A1 to the final version where I explicitly made the link between what was written and the ASA 2016 statement.<\/p>\n<p>When taking a sample for the purpose of making decisions, the first thing a statistician wants to know is what is the population that has to be sampled and what are the criteria for making decisions.\u00a0 In the land contamination industry, this is delivered by something called the <strong>Conceptual Site Model (CSM)<\/strong> where a competent practitioner pulls together all that is already known about the site and combines that knowledge with his or her understanding of how contaminants behave in soil &amp; groundwater and what are the potential risks to humans and the wider environment.\u00a0 The result is a model of the site called the CSM which is then used to break the site down into 3 parts &#8211;<\/p>\n<ol>\n<li>Areas that are suitable for use and safe for development.<\/li>\n<li>Areas that are not suitable for use and the risks will need to be addressed and may require remediation.<\/li>\n<li>Areas that are unclear and need to be sampled further in order for a decision to be made.<\/li>\n<\/ol>\n<p>For areas of type 3, a suitable sampling &amp; measurement plan using statistical principles will then need to be developed and a threshold for decision making, known as a <strong>Critical Concentration<\/strong>, needs to be specified in advance.\u00a0 The results of the land survey can then be analysed and interpreted using the new guidance hence its title of &#8220;<em><strong><span style=\"color: #993300;\">Comparing Soil Contamination Data with a Critical Concentration<\/span><\/strong>&#8220;.<\/em><\/p>\n<p>It is important to note the new guidance only covers the last step of this process, the statistical analysis and decision making, and to my mind focuses on the even numbered bullet points of the ASA statement.\u00a0 The odd numbered bullet points are covered by the CSM and Sample Design steps which are not explored in the guidance but are essential pre-requisites in order to use the guidance.\u00a0 This explains the copious number of caveats and pre-requisites at the beginning of the document as the steering committee was worried about people jumping to the analysis without having done the CSM and Sample Design work.\u00a0 These are large subjects in their own right and they need separate guidance to be written.\u00a0 It was this debate over the pre-requisites and the extent to which they should be referred to in the new guidance that explains why it took 4 years to publish it.<\/p>\n<p>&nbsp;<\/p>\n<h5><span style=\"color: #008000;\"><strong>Acknowledgements<\/strong><\/span><\/h5>\n<p>I would like to thank CL:AIRE for asking me to write the new guidance.\u00a0 It was a hugely educational process and one that forced me to examine my understanding of some basic statistical ideas (such as the <a href=\"https:\/\/en.wikipedia.org\/wiki\/Central_limit_theorem\" target=\"_blank\" rel=\"noopener\">Central Limit Theorem<\/a>) as well as teaching me about the issues the land contamination industry has to deal with.\u00a0 I sincerely hope the eventual outcome for the guidance is that the sentiment I expressed at the end of Appendix A1 is the one that comes to pass.<\/p>\n<p style=\"padding-left: 40px;\"><span style=\"color: #993300;\"><em>&#8220;The guidance is written on the assumption that it will be read and used by people with a scientific training who are capable of exercising scientific judgement and who wish to use statistics to SUPPLEMENT their professional judgement, not to REPLACE their professional judgement.&#8221;<\/em><\/span><\/p>\n<h5><span style=\"color: #993300;\"><strong>Buy my Dot &amp; Box Plot Template!<\/strong><\/span><\/h5>\n<p>If you are a practitioner who wishes to put the new CL:AIRE guidance into practice, my Dot &amp; Box Plot Template spreadsheet can get you started.\u00a0 This is a Microsoft Excel spreadsheet which allows you to enter sample data for as many sites as you wish and to produce dot plots, box plots, summary statistics and confidence intervals for each site.<\/p>\n<p>You can find <a href=\"https:\/\/marriott-stats.com\/product\/msc-dot-box-plot-template-latest-version\/\" target=\"_blank\" rel=\"noopener\">more information and details on how to purchase your copy here<\/a>.\u00a0 Please note, my ecommerce platform is used for training courses only hence why when you land on that page it will look like you are booking a training course but you are in fact purchasing a copy of the template.<\/p>\n<p>If you would like to have a free 30 minute demonstration of the template before you make your purchase, please <a href=\"https:\/\/marriott-stats.com\/contact-us\/\" target=\"_blank\" rel=\"noopener\">contact me to arrange this<\/a>.<\/p>\n<h5><strong><span style=\"color: #993300;\">&#8212; Want to be notified of future posts like this one? &#8212;<\/span><\/strong><\/h5>\n<p>If you enjoyed this article, why not subscribe to my newsletter to receive notifications of future articles, news and offers?\u00a0 You can choose from a variety of categories and articles similar to this one will appear under the <span style=\"color: #008000;\"><strong>Statistics Training<\/strong><\/span> category.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>On 30th September 2020, CL:AIRE (the industry body for the land contamination &amp; remediation sector) published new professional guidance for &#8220;Comparing Soil Contamination Data with a Critical Concentration&#8220;.\u00a0 The 46-page document advises how to use statistics when assessing land contamination and deciding whether it is safe for development.\u00a0 I was the lead author of the [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[7],"tags":[200,159,194,101,195,196,198,36,197,100,40,199,119],"class_list":{"0":"post-3341","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-stats-training","7":"tag-asa","8":"tag-box-plot","9":"tag-claire","10":"tag-confidence-intervals","11":"tag-contaminated-land","12":"tag-contamination","13":"tag-dot-plot","14":"tag-evidence","15":"tag-land-survey","16":"tag-p-values","17":"tag-presenting-data","18":"tag-professional-guidance","19":"tag-spatial-statistics","20":"entry","21":"override"},"_links":{"self":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/3341","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/comments?post=3341"}],"version-history":[{"count":26,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/3341\/revisions"}],"predecessor-version":[{"id":6907,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/posts\/3341\/revisions\/6907"}],"wp:attachment":[{"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/media?parent=3341"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/categories?post=3341"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/marriott-stats.com\/nigels-blog\/wp-json\/wp\/v2\/tags?post=3341"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}