{"id":1878,"date":"2017-11-29T10:00:21","date_gmt":"2017-11-29T09:00:21","guid":{"rendered":"http:\/\/blogs.kcl.ac.uk\/editlab\/?p=1878"},"modified":"2018-02-16T15:50:05","modified_gmt":"2018-02-16T14:50:05","slug":"replication-in-the-age-of-genomics","status":"publish","type":"post","link":"https:\/\/blogs.kcl.ac.uk\/editlab\/2017\/11\/29\/replication-in-the-age-of-genomics\/","title":{"rendered":"Replication in the Age of Genomics"},"content":{"rendered":"<h2 style=\"text-align: right\"><strong>Replication is the lifeblood of science \u2013 but how do you replicate when your cohort is the biggest available? EDiT Lab Associate Member and genomics <span style=\"text-decoration: line-through\">crank-turner <\/span>postdoc extraordinaire Joni gives three ways how.<br \/>\n<\/strong><\/h2>\n<p><!--more--><\/p>\n<p style=\"text-align: left\"><strong><a href=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2016\/07\/joni-coleman-200x280.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-250 size-thumbnail alignright\" src=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2016\/07\/joni-coleman-200x280-150x150.jpg\" alt=\"joni-coleman-200x280\" width=\"150\" height=\"150\" srcset=\"https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2016\/07\/joni-coleman-200x280-150x150.jpg 150w, https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2016\/07\/joni-coleman-200x280-50x50.jpg 50w, https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2016\/07\/joni-coleman-200x280-100x100.jpg 100w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/a><\/strong><\/p>\n<hr \/>\n<p style=\"text-align: left\">There is little more important than replication in science. While we might lust over novelty and fetishise the unexpected, it\u2019s the confirmation of the known that actually moves science forward. Traditionally, that has meant that you find something, you ask your frenemy with some similar data to see if they see the same thing, you confirm your finding, and then you get all the power and wealth and everyone wants to hire you. Or, more normally, they don\u2019t see anything of the sort, you swear loudly, sulk for a day and then spend the next few weeks trying to understand exactly what\u2019s gone wrong.<\/p>\n<div id=\"attachment_1882\" style=\"width: 310px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2017\/11\/Astrid_smiling.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1882\" class=\"wp-image-1882 size-medium\" src=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2017\/11\/Astrid_smiling-300x211.jpg\" alt=\"I have no idea who these scientists are, but they're probably laughing at your failure to replicate\" width=\"300\" height=\"211\" srcset=\"https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2017\/11\/Astrid_smiling-300x211.jpg 300w, https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2017\/11\/Astrid_smiling.jpg 1024w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><p id=\"caption-attachment-1882\" class=\"wp-caption-text\">I have no idea who these scientists are, but they&#8217;re probably laughing at your failure to replicate that finding<\/p><\/div>\n<p>But what do you do when there are no frenemies left? Not because you have finally completed your nefarious plan for world domination (<em><a title=\"mentioning no names\" href=\"https:\/\/twitter.com\/psychgenomics\" target=\"_blank\">Gerome<\/a><\/em>), but because all the frenemies are now friends, and all the data are now one. In short, what do you do when there is no replication cohort? This, to an extent, is the situation that many of the investigators interested in psychiatric disorders now find themselves in. Many of the available genetic data on psychiatric disorders is now gathered together by the <a href=\"http:\/\/www.med.unc.edu\/pgc\" target=\"_blank\">Psychiatric Genetics Consortium<\/a>. So, how can you test whether your findings are real? Here\u2019s three ways how\u2026<\/p>\n<h3>1. Look within<\/h3>\n<p>In 2009, the International Schizophrenia Consortium had a slight problem. They had expended a great deal of time and effort to gather a <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC3912837\/\" target=\"_blank\">large sample of individuals with schizophrenia<\/a> and had consciously created a carefully-curated control cohort (which isn\u2019t easy to say with a mouthful of beans). But, for all their work, they hadn\u2019t quite got the punchline \u2013 they had found only a single genetic region (albeit <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4426292\/\" target=\"_blank\">one<\/a> that <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4112379\/\" target=\"_blank\">is<\/a>\u00a0quite <a href=\"http:\/\/www.ebi.ac.uk\/gwas\/search?query=MHC\" target=\"_blank\">important<\/a>) that differed between cases and controls at a level greater than chance (given that they had searched the whole genome), and the difference observed was only very slight.<\/p>\n<p>So how to prove that this fledgling \u201cgenome-wide association study\u201d (GWAS) thing\u00a0for which they\u2019d been pushing wasn\u2019t a wash, and that, against all the <a href=\"https:\/\/www.youtube.com\/watch?v=kEs8rK5Cqt8\" target=\"_blank\">slings and arrows of outrage fortune<\/a>, the field should persevere? To do so, they developed the concept of \u201cleave one out polygenic risk scoring\u201d. Rather than examining how variants differ between cases and controls in all samples (call them ABCDEFG), they instead only looked at a sub-sample (ABCDEF) Then, they took the results from that sub-sample analysis and looked at whether weighting variants in the rest of the cohort (G) by the ABCDEF results gave a significant prediction. And it did \u2013 the genetic differences seen in one part of the cohort were (overall if not individually) mirrored in the other, independent part. Furthermore, this happened in all of the seven possible set-ups, suggesting that the overall effect was likely to be robust in entirely independent cohorts. And so, a crucial technique for internal replication was born, along with a <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pubmed?term=((genetic%20risk%20score)%20OR%20polygenic%20risk%20score)%20OR%20genomic%20risk%20score\" target=\"_blank\">whole field<\/a>\u2026<\/p>\n<div id=\"attachment_1881\" style=\"width: 310px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2017\/11\/btu848f1p.gif\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1881\" class=\"wp-image-1881 size-medium\" src=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2017\/11\/btu848f1p-300x195.gif\" alt=\"A Polygenic Risk Scoring Plot (Euesden et al, 2015: https:\/\/academic.oup.com\/bioinformatics\/article\/31\/9\/1466\/200539)\" width=\"300\" height=\"195\" \/><\/a><p id=\"caption-attachment-1881\" class=\"wp-caption-text\">A Polygenic Risk Scoring Plot (Euesden et al, 2015: https:\/\/academic.oup.com\/bioinformatics\/article\/31\/9\/1466\/200539)<\/p><\/div>\n<h3>2. Squint a bit<\/h3>\n<p>At the beginning of the year, a <a href=\"https:\/\/www.nature.com\/articles\/ng.3869\" target=\"_blank\">big GWAS of IQ<\/a> was published, identifying 18 parts of the genome where variants differ with increasing IQ. This was not the first such GWAS, but it represented a big step forward in the number of regions identified. But furthermore, it represented a big chunk of the available data for replication \u2013 there were a few cohorts not included, but they were generally difficult to access, or considerably smaller than this new GWAS (and so a failed replication might simply be because the replication cohort was too small to see anything. So how to replicate? Leave one out polygenic scoring showed promising results, but could the investigators go further? Obviously. Why else would I be bringing up this paper (other than the scientific brilliance, wit and sheer aura of raw physical attractiveness of the fifth-named author?). Sniekers and colleagues used another technique for replication, that of the proxy-replication.<\/p>\n<p>Educational success is correlated with IQ. It is also relatively straightforward to measure. If you have a group of genotyped people, and permission to access their exam results (or to ask them how many years they were in education for), you can do a GWAS of educational attainment. This had in fact been very successful \u2013 the <a href=\"https:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC4883595\/\" target=\"_blank\">latest such GWAS<\/a> had identified 74 regions. Looking up 306 variants that differed strongly with IQ in their data, the authors showed that 305 of these affected educational attainment in the same direction as in IQ, and that 9 of the 16 top regions they could test from the IQ differed in educational attainment more than would be expected by chance alone. When the exact same trait isn\u2019t available, related traits can be informative.<\/p>\n<h3>3. Wait<\/h3>\n<p>GWAS are like buses. They carry a lot of people, and you tend to get on strange ones when you\u2019re drunk. <em>[Edit: What?! From context alone I can only assume you were actually going for \u201cthere\u2019s always another one along in a minute\u201d\u2026]<\/em><\/p>\n<div id=\"attachment_1880\" style=\"width: 310px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2017\/11\/Thomas_Built_Buses_Saf-T-Liner_C2_Yellow_School_Bus.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1880\" class=\"wp-image-1880 size-medium\" src=\"http:\/\/blogs.kcl.ac.uk\/editlab\/wp-content\/blogs.dir\/166\/files\/2017\/11\/Thomas_Built_Buses_Saf-T-Liner_C2_Yellow_School_Bus-300x172.jpg\" alt=\"A shiny GWAS\" width=\"300\" height=\"172\" srcset=\"https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2017\/11\/Thomas_Built_Buses_Saf-T-Liner_C2_Yellow_School_Bus-300x172.jpg 300w, https:\/\/blogs.kcl.ac.uk\/editlab\/files\/2017\/11\/Thomas_Built_Buses_Saf-T-Liner_C2_Yellow_School_Bus.jpg 1024w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><p id=\"caption-attachment-1880\" class=\"wp-caption-text\">A shiny GWAS<\/p><\/div>\n<p>With time come new samples, new data and replication. Take the 18 IQ loci \u2013 in a cohort nearly four times the size, 15 of those loci remained significant. Or look at the schizophrenia region found in 2009 \u2013 that\u2019s now arguably the most robust region in schizophrenia genetics, with a plausible functional role (that perhaps wasn\u2019t the one initially expected) and is now coming up in other psychiatric disorders like <a href=\"https:\/\/www.biorxiv.org\/content\/early\/2017\/07\/24\/167577\" target=\"_blank\">depression<\/a> (arguing that it might be associated with psychiatric disorders more generally). As such, a kind of replication reminiscent of the traditional method is still possible in GWAS, through building new cohorts into the older cohorts, and seeing what survives.<\/p>\n<p>Because it\u2019s now Christmas, as told by the <a href=\"https:\/\/www.youtube.com\/watch?v=Jw1Y-zhQURU\" target=\"_blank\">traditional herald angels\u00a0John and Lewis<\/a>, I leave you with a present of blogs yet to come focussed on the spirit of GWAS past. Here, I\u2019ve said a kind of replication can be achieved by the addition of new samples to older GWAS. Next time, I will show that it can be informative to look back at old GWAS to understand the evolution of newly-discovered hits.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Replication is the lifeblood of science \u2013 but how do you replicate when your cohort is the biggest available? EDiT Lab Associate Member and genomics crank-turner postdoc extraordinaire Joni gives three ways how.<\/p>\n","protected":false},"author":163,"featured_media":1880,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9],"tags":[38,22,71,83,230,67,219],"class_list":["post-1878","post","type-post","status-publish","format-standard","has-post-thumbnail","category-research-matters","tag-genetics","tag-genomics","tag-mental-health-research","tag-methodology","tag-replication","tag-research-methods","tag-statistics"],"_links":{"self":[{"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/posts\/1878","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/users\/163"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/comments?post=1878"}],"version-history":[{"count":17,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/posts\/1878\/revisions"}],"predecessor-version":[{"id":1913,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/posts\/1878\/revisions\/1913"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/media\/1880"}],"wp:attachment":[{"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/media?parent=1878"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/categories?post=1878"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.kcl.ac.uk\/editlab\/wp-json\/wp\/v2\/tags?post=1878"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}