{"id":905,"date":"2020-07-07T15:53:06","date_gmt":"2020-07-07T15:53:06","guid":{"rendered":"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/?p=905"},"modified":"2020-07-07T16:08:21","modified_gmt":"2020-07-07T16:08:21","slug":"why-you-should-read-the-book-of-why","status":"publish","type":"post","link":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/2020\/07\/07\/why-you-should-read-the-book-of-why\/","title":{"rendered":"Why you should read \u2018The Book of Why\u2019?"},"content":{"rendered":"<p><em><strong>By Nadia Chechli\u0144ska, Research Associate<\/strong><\/em><strong><em>|<\/em><\/strong><\/p>\n<p><em>We all know &#8216;correlation does not imply causation\u2019 &#8211; but how do we imply causation? In the second of her book reviews, Nadia explains how \u2018The Book of Why\u2019<\/em><em>\u00a0can help you to accurately attribute causality.<\/em><\/p>\n<p>Judea Pearl is one of the most influential scientists in research on causality. In his latest book \u2018<em>The Book of Why \u2013 The new science of cause and effect<\/em>\u2019<a href=\"#_edn1\" name=\"_ednref1\"><strong>[i]<\/strong><\/a><em> (<\/em>co-authored with Dana Mackenzie) Pearl describes how thinking about causality has developed to the form we now recognise, with a strong emphasis on the concept of <strong>causal inference<\/strong>; the process of analysing the relationships between events in a way that reveals the true causes of a phenomenon, when it can be observed, and most importantly \u2013 \u201cWhy?\u201d.<\/p>\n<h3><strong>Causal diagrams \u2013 a tool for visualising causes and effects<\/strong><\/h3>\n<p>The core message Pearl wants to convey is that the world around us is more complex than we realise. We are often tempted to simplify the relationships between phenomena and to attribute cause-and-effect relations when they are absent, and we fail to notice the real cause of an event \u2013 even if it is right in front of us. This happens when we want to reduce cognitive effort when making a judgement, and use shortcuts to facilitate <a href=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/2020\/04\/07\/are-we-still-thinking-fast-and-slow-review\/\">fast thinking<\/a>. To protect us from such biases when designing a study or a project, researchers need to appreciate the pre-requisites for causal inference.<\/p>\n<p>In What Works (WW) we engage in a similar process when creating a <a href=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/2019\/12\/20\/all-i-want-for-christmas-is-a-theory-of-change\/\">Theory of Change<\/a>, where we outline a cause-and-effect logic chain, propose mediating variables, and consider the assumptions behind our reasoning. At this stage, we hypothesise what the causal relationships are between relevant variables and outline how these will be investigated.<\/p>\n<p>To make this process more systematic and understandable, Pearl proposes a tool for boosting our cognitive abilities when thinking about causality &#8211; <strong>causal diagrams<\/strong>. In the past, researchers lacked language to talk about causality and therefore their methodology was limited to correlational studies. Causal diagrams are used to visually represent variables and the possible causal relationships between, enhancing researchers\u2019 understanding of causal relationships.<\/p>\n<p>The diagram below is the simplest example of how we could use causal models in WW. It indicates that participation in K+, <a href=\"https:\/\/www.kcl.ac.uk\/study\/widening-participation\/our-activities\/k-plus\">our flagship outreach programme,<\/a> causes changes in expected university enrolment rates for K+ students \u2013 which is in line with our assumptions.<\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-910\" src=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure1.png\" alt=\"\" width=\"469\" height=\"98\" srcset=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure1.png 646w, https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure1-300x63.png 300w\" sizes=\"auto, (max-width: 469px) 100vw, 469px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>Figure 1<\/p>\n<p>Causal models help us visualise where we can expect a causal relationship, but also where causation cannot be inferred because of limited information. According to Pearl, the main reason for engaging in causal inference is to determine research questions and then to consider what data needs to be collected to answer them. This way, we can acknowledge potential outcomes and limitations <em>before<\/em> conducting the research.<\/p>\n<p>Pearl\u2019s Ladder of Causation (see picture on the right) is a hierarchical representation of what causal conclusions (if any) can be drawn from a particular set of data.<\/p>\n<h3><strong>We can\u2019t make causal inferences with correlation data<a href=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure2.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-909 alignright\" src=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure2.png\" alt=\"\" width=\"196\" height=\"421\" srcset=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure2.png 287w, https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyFigure2-140x300.png 140w\" sizes=\"auto, (max-width: 196px) 100vw, 196px\" \/><\/a><\/strong><\/h3>\n<p>Through seeing different phenomena, we can detect regularities in our environment and adapt accordingly. Pearl argues that <strong>association<\/strong> is the bottom rung in the Ladder of Causation \u2013 we associate two events if observing one increases or decreases the chances of observing the other. For example, when enrolment at college decreases, the number of teachers also decreases.<\/p>\n<p>Let\u2019s come back to our example from above. After collecting data on the K+ university enrolment rates, you run a statistical correlation and you may notice that you \u2018see\u2019 enrolment most often when you \u2018see\u2019 participation in K+ programme. In other words, if you observe an increase in K+ participation, you expect to observe an increase in enrolment as well.<\/p>\n<p>However, knowing that K+ participation is associated with higher enrolment rates provides no information about whether the K+ participation is the<em> cause<\/em> of higher enrolment rates. Cognitively, we are very quick to assign a causal relationship between our correlated variables, but according to Pearl \u2013 we cannot make causal conclusions based on simple associations between variables.<\/p>\n<p style=\"text-align: right\">Figure 2<\/p>\n<h3><strong>Interventions help us detect causal relationships between events<\/strong><\/h3>\n<p style=\"text-align: left\">One of the reasons why we may see a correlation between two events, even if they are not causally related, is that they are both caused by a third variable \u2013 a <em>confounder<\/em>. Constructing a causal model helps in detecting confounding variables, because it requires us to think more carefully about the phenomena relevant to the variable of interest. If we want to measure whether it is the K+ programme that causes an increase in enrolment, we need to do more than analyse passively collected data \u2013 we need to intervene with the reality.<\/p>\n<p><strong>Intervention<\/strong>, which is a second rung up the Ladder of Causation, is a method of testing whether the causal relationship between variables is real or illusionary (caused by an extraneous variable). To this aim, researchers carry out experiments in which they perform an action while making sure the potential confounders (observable and latent) stay constant.<\/p>\n<p>Causal models help in performing an intervention mentally before performing it in real life. Visualising the relationships between the K+ programme and enrolment helps us better understand the whole process of an intervention, what results one may expect, and what conclusions can be drawn.<\/p>\n<p>What Works supports other departments in statistical evaluation of their projects. To show what we do with their data, we present them with a statistical formula (see below). Here we try to explain algebraically how the extraneous variables (confounders; X<sub>i<\/sub> &#8211; e.g. previous attainment) affect the variables of interest (Y \u2013 enrolment rates).<\/p>\n<p><a href=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/Bookofwhyfigure3.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-908\" src=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/Bookofwhyfigure3.png\" alt=\"\" width=\"338\" height=\"80\" srcset=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/Bookofwhyfigure3.png 338w, https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/Bookofwhyfigure3-300x71.png 300w\" sizes=\"auto, (max-width: 338px) 100vw, 338px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>Now let\u2019s see how Pearl\u2019s causal models help us express the same relationships visually:<\/p>\n<p><a href=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWHyFig4.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-907\" src=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWHyFig4.png\" alt=\"\" width=\"438\" height=\"227\" srcset=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWHyFig4.png 625w, https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWHyFig4-300x156.png 300w\" sizes=\"auto, (max-width: 438px) 100vw, 438px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>Figure 3<\/p>\n<p>Here we can clearly see and explain that previous attainment is a confounding variable, which affects both K+ participation and enrolment. If we see that the group in the K+ programme was more likely to enrol into university than the control group, when statistically keeping previous attainment constant, we can conclude that the two events are <em>causally related <\/em>(this is simplified, in reality there are many potential observable and latent confounders).<\/p>\n<p>A common tool used to run interventions is the <strong>Randomised Controlled Trial (RCT).<\/strong> The main idea behind the RCTs is that we randomly assign participants to two conditions, collect data from both groups, and only then compare them and draw causal conclusions (as in the example above). This is the only intervention design which ensures control over confounders but is also a one which is not always possible to run. There are cases in which randomisation is difficult or impossible \u2013 and this is where the causal models are particularly useful in determining potential confounding variables.<\/p>\n<h3><strong>Moving on to the \u201cWhy\u201d question <\/strong><\/h3>\n<p>In research, however, we often want to know more than that \u2013 we want to understand <em>why<\/em> we observed the causal relationship between variables. To do so, we need to think about <strong>counterfactuals<\/strong>: if something had been done in a different way, would we have achieved the same situation? It is not the same as comparing a treatment group to a control group, because counterfactuals are considered only <em>after<\/em> we know the results of our actions (e.g. interventions). When considering counterfactuals, we can only make guesses and <em>imagine<\/em> the alternative world, because we are not able to go back in time and actually change the past.<\/p>\n<p>Our guesses are based on some assumptions and these assumptions are reflected in a form of a causal model. Controlling for confounders and manipulating a single variable is not enough to make accurate guesses about the alternative worlds. We simply need more data, which can give us more clues about the past to better predict the alternative future. Pearl proposes using a <strong>mediation analysis<\/strong>, which allows us to account for indirect effects of our action in an intervention, but it requires collecting additional data.<\/p>\n<p>In What Works, we already account for mediators at the stage of constructing a Theory of Change. We measure the short-term outcomes and hypothesise that they mediate the long-term impacts of an intervention, which makes causal and counterfactual inferences more accurate. For example, imagine that besides the enrolment rates (long-term impact), researchers also collected data on student\u2019s self-efficacy (short-term outcome) and included them in the causal model (see below).<\/p>\n<p><a href=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyfig5.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft wp-image-906\" src=\"http:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyfig5.png\" alt=\"\" width=\"523\" height=\"252\" srcset=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyfig5.png 793w, https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyfig5-300x145.png 300w, https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/files\/2020\/07\/BookofWhyfig5-768x370.png 768w\" sizes=\"auto, (max-width: 523px) 100vw, 523px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>Figure 4<\/p>\n<p>Figure four illustrates that K+ programme causes an increase in self-efficacy, which in turn causes a change in enrolment rates \u2013 now the counterfactuals are easier to reason about and we are much closer to understanding <em>why<\/em> the causation occurs.<\/p>\n<p>Climbing up to the top of the ladder of causation allows us not only to detect the cause-and-effect relationships but also enables us to place our research question in a broader context and understand it better.<\/p>\n<h3><strong>Conclusions<\/strong><\/h3>\n<p>The main lesson from Pearl\u2019s book is that prior to analysing or even collecting data, we need to realise what causal questions we really want to ask and what type of data we need to answer them. In What Works, we encourage practitioners to formulate research questions, plan methodologies, and create analytical strategies through a Theory of Change and\/or research protocols. We believe that this way we are one step closer to finding out what are the true causes of phenomena we investigate, and how we can use this knowledge to design the most effective behaviour change in the future.<\/p>\n<p>_______________________________________________________________________<\/p>\n<p><em><a href=\"https:\/\/confirmsubscription.com\/h\/j\/B4359A69338427CC\">Click here<\/a>\u00a0to join our mailing list.<br \/>\nFollow us on Twitter:\u00a0<a href=\"https:\/\/twitter.com\/kclwhatworks\" target=\"_blank\" rel=\"noopener noreferrer\">@KCLWhatWorks<\/a><\/em><\/p>\n<p><a href=\"#_ednref1\" name=\"_edn1\">[i]<\/a> Pearl, J. (2019) The Book of Why: the new science of cause and effect<\/p>\n","protected":false},"excerpt":{"rendered":"<div class=\"mh-excerpt\"><p>By Nadia Chechli\u0144ska, Research Associate| We all know &#8216;correlation does not imply causation\u2019 &#8211; but how do we imply causation? In the second of her <a class=\"mh-excerpt-more\" href=\"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/2020\/07\/07\/why-you-should-read-the-book-of-why\/\" title=\"Why you should read \u2018The Book of Why\u2019?\">&#8212; [Read&nbsp;More] <\/a><\/p>\n<\/div>","protected":false},"author":75,"featured_media":924,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,10],"tags":[],"class_list":["post-905","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-behavioural-insights","category-evaluation"],"_links":{"self":[{"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/posts\/905","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/users\/75"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/comments?post=905"}],"version-history":[{"count":16,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/posts\/905\/revisions"}],"predecessor-version":[{"id":928,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/posts\/905\/revisions\/928"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/media\/924"}],"wp:attachment":[{"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/media?parent=905"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/categories?post=905"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/behaviouralinsights\/wp-json\/wp\/v2\/tags?post=905"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}