Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
-
- Aurélien Griffon
- INSERM, UMR1090 TAGC, Marseille, F-13288, France
-
- Quentin Barbier
- INSERM, UMR1090 TAGC, Marseille, F-13288, France
-
- Jordi Dalino
- INSERM, UMR1090 TAGC, Marseille, F-13288, France
-
- Jacques van Helden
- INSERM, UMR1090 TAGC, Marseille, F-13288, France
-
- Salvatore Spicuglia
- INSERM, UMR1090 TAGC, Marseille, F-13288, France
-
- Benoit Ballester
- INSERM, UMR1090 TAGC, Marseille, F-13288, France
抄録
<jats:title>Abstract</jats:title><jats:p>The large collections of ChIP-seq data rapidly accumulating in public data warehouses provide genome-wide binding site maps for hundreds of transcription factors (TFs). However, the extent of the regulatory occupancy space in the human genome has not yet been fully apprehended by integrating public ChIP-seq data sets and combining it with ENCODE TFs map. To enable genome-wide identification of regulatory elements we have collected, analysed and retained 395 available ChIP-seq data sets merged with ENCODE peaks covering a total of 237 TFs. This enhanced repertoire complements and refines current genome-wide occupancy maps by increasing the human genome regulatory search space by 14% compared to ENCODE alone, and also increases the complexity of the regulatory dictionary. As a direct application we used this unified binding repertoire to annotate variant enhancer loci (VELs) from H3K4me1 mark in two cancer cell lines (MCF-7, CRC) and observed enrichments of specific TFs involved in biological key functions to cancer development and proliferation. Those enrichments of TFs within VELs provide a direct annotation of non-coding regions detected in cancer genomes. Finally, full access to this catalogue is available online together with the TFs enrichment analysis tool (http://tagc.univ-mrs.fr/remap/).</jats:p>
収録刊行物
-
- Nucleic Acids Research
-
Nucleic Acids Research 43 (4), e27-e27, 2014-12-03
Oxford University Press (OUP)