A synergistic DNA logic predicts genome-wide chromatin accessibility
Author(s)
Sherwood, Richard I.; Emons, Bart J.M.; Hashimoto, Tatsunori Benjamin; Kang, Daniel D.; Rajagopal, Nisha; Barkal, Amira; Zeng, Haoyang; Srinivasan, Sharanya; Jaakkola, Tommi S.; Gifford, David K.; ... Show more Show less
DownloadHashimoto_A synergistic DNA lo.pdf (1.286Mb)
PUBLISHER_CC
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types.
Date issued
206-08Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Genome Research
Publisher
Cold Spring Harbor Laboratory Press
Citation
Hashimoto, Tatsunori et al. “A Synergistic DNA Logic Predicts Genome-Wide Chromatin Accessibility.” Genome Research 26.10 (2016): 1430–1440.
Version: Final published version
ISSN
1088-9051
1549-5469