
pmid: 39443793
pmc: PMC11525185
AbstractCis-regulatory elements (CREs) control gene expression, orchestrating tissue identity, developmental timing and stimulus responses, which collectively define the thousands of unique cell types in the body1–3. While there is great potential for strategically incorporating CREs in therapeutic or biotechnology applications that require tissue specificity, there is no guarantee that an optimal CRE for these intended purposes has arisen naturally. Here we present a platform to engineer and validate synthetic CREs capable of driving gene expression with programmed cell-type specificity. We take advantage of innovations in deep neural network modelling of CRE activity across three cell types, efficient in silico optimization and massively parallel reporter assays to design and empirically test thousands of CREs4–8. Through large-scale in vitro validation, we show that synthetic sequences are more effective at driving cell-type-specific expression in three cell lines compared with natural sequences from the human genome and achieve specificity in analogous tissues when tested in vivo. Synthetic sequences exhibit distinct motif vocabulary associated with activity in the on-target cell type and a simultaneous reduction in the activity of off-target cells. Together, we provide a generalizable framework to prospectively engineer CREs from massively parallel reporter assay models and demonstrate the required literacy to write fit-for-purpose regulatory code.
Male, 570, Neural Networks, Cells, Regulatory Sequences, Nucleic Acid, Article, Cell Line, Machine Learning, Computer, Mice, Deep Learning, Genes, Reporter, Animals, Humans, Computer Simulation, Reporter, Zebrafish, Genome, JGM, Nucleic Acid, Genome, Human, Reproducibility of Results, Genes, Gene Expression Regulation, Organ Specificity, Female, Neural Networks, Computer, Genetic Engineering, Regulatory Sequences, Human
Male, 570, Neural Networks, Cells, Regulatory Sequences, Nucleic Acid, Article, Cell Line, Machine Learning, Computer, Mice, Deep Learning, Genes, Reporter, Animals, Humans, Computer Simulation, Reporter, Zebrafish, Genome, JGM, Nucleic Acid, Genome, Human, Reproducibility of Results, Genes, Gene Expression Regulation, Organ Specificity, Female, Neural Networks, Computer, Genetic Engineering, Regulatory Sequences, Human
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 109 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 0.1% |
