Genome-wide de novo risk score implicates promoter variation in autism spectrum disorder

  • Joon-Yong An
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Kevin Lin
    Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
  • Lingxue Zhu
    Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
  • Donna M. Werling
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Shan Dong
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Harrison Brand
    Center for Genomic Medicine and Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
  • Harold Z. Wang
    Center for Genomic Medicine and Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
  • Xuefang Zhao
    Center for Genomic Medicine and Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
  • Grace B. Schwartz
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Ryan L. Collins
    Center for Genomic Medicine and Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
  • Benjamin B. Currall
    Center for Genomic Medicine and Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
  • Claudia Dastmalchi
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Jeanselle Dea
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Clif Duhn
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Michael C. Gilson
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Lambertus Klei
    Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA.
  • Lindsay Liang
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Eirene Markenscoff-Papadimitriou
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Sirisha Pochareddy
    Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA.
  • Nadav Ahituv
    Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA.
  • Joseph D. Buxbaum
    Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
  • Hilary Coon
    Department of Psychiatry, University of Utah School of Medicine, Salt Lake City, UT, USA.
  • Mark J. Daly
    Program in Medical and Population Genetics and the Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA, USA.
  • Young Shin Kim
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Gabor T. Marth
    Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA.
  • Benjamin M. Neale
    Program in Medical and Population Genetics and the Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA, USA.
  • Aaron R. Quinlan
    Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, UT, USA.
  • John L. Rubenstein
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Nenad Sestan
    Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA.
  • Matthew W. State
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • A. Jeremy Willsey
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.
  • Michael E. Talkowski
    Center for Genomic Medicine and Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
  • Bernie Devlin
    Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA.
  • Kathryn Roeder
    Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
  • Stephan J. Sanders
    Department of Psychiatry, UCSF Weill Institute for Neurosciences, University of California, San Francisco, CA, USA.

説明

<jats:sec><jats:title>INTRODUCTION</jats:title><jats:p>The DNA of protein-coding genes is transcribed into mRNA, which is translated into proteins. The “coding genome” describes the DNA that contains the information to make these proteins and represents ~1.5% of the human genome. Newly arising de novo mutations (variants observed in a child but not in either parent) in the coding genome contribute to numerous childhood developmental disorders, including autism spectrum disorder (ASD). Discovery of these effects is aided by the triplet code that enables the functional impact of many mutations to be readily deciphered. In contrast, the “noncoding genome” covers the remaining ~98.5% and includes elements that regulate when, where, and to what degree protein-coding genes are transcribed. Understanding this noncoding sequence could provide insights into human disorders and refined control of emerging genetic therapies. Yet little is known about the role of mutations in noncoding regions, including whether they contribute to childhood developmental disorders, which noncoding elements are most vulnerable to disruption, and the manner in which information is encoded in the noncoding genome.</jats:p></jats:sec><jats:sec><jats:title>RATIONALE</jats:title><jats:p>Whole-genome sequencing (WGS) provides the opportunity to identify the majority of genetic variation in each individual. By performing WGS on 1902 quartet families including a child affected with ASD, one unaffected sibling control, and their parents, we identified ~67 de novo mutations across each child’s genome. To characterize the functional role of these mutations, we integrated multiple datasets relating to gene function, genes implicated in neurodevelopmental disorders, conservation across species, and epigenetic markers, thereby combinatorially defining 55,143 categories. The scope of the problem—testing for an excess of de novo mutations in cases relative to controls for each category—is challenging because there are more categories than families.</jats:p></jats:sec><jats:sec><jats:title>RESULTS</jats:title><jats:p>Comparing cases to controls, we observed an excess of de novo mutations in cases in individual categories in the coding genome but not in the noncoding genome. To overcome the challenge of detecting noncoding association, we used machine learning tools to develop a de novo risk score to look for an excess of de novo mutations across multiple categories. This score demonstrated a contribution to ASD risk from coding mutations and a weaker, but significant, contribution from noncoding mutations. This noncoding signal was driven by mutations in the promoter region, defined as the 2000 nucleotides upstream of the transcription start site (TSS) where mRNA synthesis starts. The strongest promoter signals were defined by conservation across species and transcription factor binding sites. Well-defined promoter elements (e.g., TATA-box) are usually observed within 80 nucleotides of the TSS; however, the strongest ASD association was observed distally, 750 to 2000 nucleotides upstream of the TSS.</jats:p></jats:sec><jats:sec><jats:title>CONCLUSION</jats:title><jats:p>We conclude that de novo mutations in the noncoding genome contribute to ASD. The clearest evidence of noncoding ASD association came from mutations at evolutionarily conserved nucleotides in the promoter region. The enrichment for transcription factor binding sites, primarily in the distal promoter, suggests that these mutations may disrupt gene transcription via their interaction with enhancer elements in the promoter region, rather than interfering with transcriptional initiation directly.</jats:p><jats:fig fig-type="figure" orientation="portrait" position="float"><jats:caption><jats:title>Promoter regions in autism.</jats:title><jats:p>De novo mutations from 1902 quartet families are assigned to 55,143 annotation categories, which are each assessed for autism spectrum disorder (ASD) association by comparing mutation counts in cases and sibling controls. A de novo risk score demonstrated a noncoding contribution to ASD driven by promoter mutations, especially at sites conserved across species, in the distal promoter or targeted by transcription factors.</jats:p></jats:caption><jats:graphic xmlns:xlink="http://www.w3.org/1999/xlink" orientation="portrait" position="float" xlink:href="362_aat6576_fa.jpeg"/></jats:fig></jats:sec>

収録刊行物

  • Science

    Science 362 (6420), eaat6576-, 2018-12-14

    American Association for the Advancement of Science (AAAS)

被引用文献 (6)*注記

もっと見る

問題の指摘

ページトップへ