The evolution and mutational robustness of chromatin accessibility in Drosophila.

TitleThe evolution and mutational robustness of chromatin accessibility in Drosophila.
Publication TypeJournal Article
Year of Publication2023
AuthorsKhodursky S, Zheng EB, Svetec N, Durkin SM, Benjamin S, Gadau A, Wu X, Zhao L
JournalGenome Biol
Volume24
Issue1
Pagination232
Date Published2023 Oct 16
ISSN1474-760X
KeywordsAnimals, Chromatin, Chromosomes, Drosophila, Mutation, Regulatory Sequences, Nucleic Acid
Abstract

BACKGROUND: The evolution of genomic regulatory regions plays a critical role in shaping the diversity of life. While this process is primarily sequence-dependent, the enormous complexity of biological systems complicates the understanding of the factors underlying regulation and its evolution. Here, we apply deep neural networks as a tool to investigate the sequence determinants underlying chromatin accessibility in different species and tissues of Drosophila.

RESULTS: We train hybrid convolution-attention neural networks to accurately predict ATAC-seq peaks using only local DNA sequences as input. We show that our models generalize well across substantially evolutionarily diverged species of insects, implying that the sequence determinants of accessibility are highly conserved. Using our model to examine species-specific gains in accessibility, we find evidence suggesting that these regions may be ancestrally poised for evolution. Using in silico mutagenesis, we show that accessibility can be accurately predicted from short subsequences in each example. However, in silico knock-out of these sequences does not qualitatively impair classification, implying that accessibility is mutationally robust. Subsequently, we show that accessibility is predicted to be robust to large-scale random mutation even in the absence of selection. Conversely, simulations under strong selection demonstrate that accessibility can be extremely malleable despite its robustness. Finally, we identify motifs predictive of accessibility, recovering both novel and previously known motifs.

CONCLUSIONS: These results demonstrate the conservation of the sequence determinants of accessibility and the general robustness of chromatin accessibility, as well as the power of deep neural networks to explore fundamental questions in regulatory genomics and evolution.

DOI10.1186/s13059-023-03079-5
Alternate JournalGenome Biol
PubMed ID37845780
PubMed Central IDPMC10578003
Grant ListR35 GM133780 / GM / NIGMS NIH HHS / United States
T32 GM066699 / GM / NIGMS NIH HHS / United States
T32 GM007739 / GM / NIGMS NIH HHS / United States

Person Type: