Title | The evolution and mutational robustness of chromatin accessibility in Drosophila. |
Publication Type | Journal Article |
Year of Publication | 2023 |
Authors | Khodursky S, Zheng EB, Svetec N, Durkin SM, Benjamin S, Gadau A, Wu X, Zhao L |
Journal | Genome Biol |
Volume | 24 |
Issue | 1 |
Pagination | 232 |
Date Published | 2023 Oct 16 |
ISSN | 1474-760X |
Keywords | Animals, Chromatin, Chromosomes, Drosophila, Mutation, Regulatory Sequences, Nucleic Acid |
Abstract | BACKGROUND: The evolution of genomic regulatory regions plays a critical role in shaping the diversity of life. While this process is primarily sequence-dependent, the enormous complexity of biological systems complicates the understanding of the factors underlying regulation and its evolution. Here, we apply deep neural networks as a tool to investigate the sequence determinants underlying chromatin accessibility in different species and tissues of Drosophila. RESULTS: We train hybrid convolution-attention neural networks to accurately predict ATAC-seq peaks using only local DNA sequences as input. We show that our models generalize well across substantially evolutionarily diverged species of insects, implying that the sequence determinants of accessibility are highly conserved. Using our model to examine species-specific gains in accessibility, we find evidence suggesting that these regions may be ancestrally poised for evolution. Using in silico mutagenesis, we show that accessibility can be accurately predicted from short subsequences in each example. However, in silico knock-out of these sequences does not qualitatively impair classification, implying that accessibility is mutationally robust. Subsequently, we show that accessibility is predicted to be robust to large-scale random mutation even in the absence of selection. Conversely, simulations under strong selection demonstrate that accessibility can be extremely malleable despite its robustness. Finally, we identify motifs predictive of accessibility, recovering both novel and previously known motifs. CONCLUSIONS: These results demonstrate the conservation of the sequence determinants of accessibility and the general robustness of chromatin accessibility, as well as the power of deep neural networks to explore fundamental questions in regulatory genomics and evolution. |
DOI | 10.1186/s13059-023-03079-5 |
Alternate Journal | Genome Biol |
PubMed ID | 37845780 |
PubMed Central ID | PMC10578003 |
Grant List | R35 GM133780 / GM / NIGMS NIH HHS / United States T32 GM066699 / GM / NIGMS NIH HHS / United States T32 GM007739 / GM / NIGMS NIH HHS / United States |
Submitted by bel2021 on February 16, 2024 - 10:40am