
Abstract The problem of reverse engineering gene regulatory networks from high-throughput expression data is one of the biggest challenges in bioinformatics. In order to benchmark network inference algorithms, simulators of well-characterized expression datasets are often required. However, existing simulators have been criticized because they fail to emulate key properties of gene expression data. In this study we address two problems. First, we propose mechanisms to faithfully assess the realism of a synthetic gene expression dataset. Second, we design an adversarial simulator of expression data, gGAN, based on a Generative Adversarial Network. We show that our model outperforms existing simulators by a large margin, achieving realism scores that are up to 17 times higher than those of GeneNetWeaver and SynTReN. More importantly, our results show that gGAN is, to our best knowledge, the first simulator that passes the Turing test for gene expression data proposed by Maier et al. (2013).
Gene Expression Profiling, Escherichia coli, Humans, Gene Expression, Escherichia coli; Gene Expression; Gene Expression Profiling; Humans, Original Papers
Gene Expression Profiling, Escherichia coli, Humans, Gene Expression, Escherichia coli; Gene Expression; Gene Expression Profiling; Humans, Original Papers
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 35 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
