
AbstractSeveral implicit methods to infer horizontal gene transfer (HGT) focus on pairs of genes that have diverged only after the divergence of the two species in which the genes reside. This situation defines the edge set of a graph, the later-divergence-time (LDT) graph, whose vertices correspond to genes colored by their species. We investigate these graphs in the setting of relaxed scenarios, i.e., evolutionary scenarios that encompass all commonly used variants of duplication-transfer-loss scenarios in the literature. We characterize LDT graphs as a subclass of properly vertex-colored cographs, and provide a polynomial-time recognition algorithm as well as an algorithm to construct a relaxed scenario that explains a given LDT. An edge in an LDT graph implies that the two corresponding genes are separated by at least one HGT event. The converse is not true, however. We show that the complete xenology relation is described by an rs-Fitch graph, i.e., a complete multipartite graph satisfying constraints on the vertex coloring. This class of vertex-colored graphs is also recognizable in polynomial time. We finally address the question “how much information about all HGT events is contained in LDT graphs” with the help of simulations of evolutionary scenarios with a wide range of duplication, loss, and HGT events. In particular, we show that a simple greedy graph editing scheme can be used to efficiently detect HGT events that are implicitly contained in LDT graphs.
binary relation, FOS: Computer and information sciences, Gene families, gene families, Gene Transfer, Horizontal, Discrete Mathematics (cs.DM), Applications of graph theory, Polynomial-time recognition algorithm, Article, 106005 Bioinformatik, Problems related to evolution, DUPLICATIONS, HISTORY, Computer Science - Data Structures and Algorithms, Binary relation, RECONSTRUCTION, Data Structures and Algorithms (cs.DS), Quantitative Biology - Populations and Evolution, Phylogeny, Indirect phylogenetic methods, MAXIMUM, COMPLEXITY, Fitch graph, ALGORITHMS, Populations and Evolution (q-bio.PE), 106045 Theoretische Biologie, polynomial-time recognition algorithm, Horizontal gene transfer, 106045 Theoretical biology, EVOLUTION, Later-divergence-time, REPLACEMENT, FOS: Biological sciences, BACTERIA, later-divergence-time, TREES, horizontal gene transfer, Genetics and epigenetics, xenology, indirect phylogenetic methods, 106005 Bioinformatics, Xenology, Algorithms, Computer Science - Discrete Mathematics
binary relation, FOS: Computer and information sciences, Gene families, gene families, Gene Transfer, Horizontal, Discrete Mathematics (cs.DM), Applications of graph theory, Polynomial-time recognition algorithm, Article, 106005 Bioinformatik, Problems related to evolution, DUPLICATIONS, HISTORY, Computer Science - Data Structures and Algorithms, Binary relation, RECONSTRUCTION, Data Structures and Algorithms (cs.DS), Quantitative Biology - Populations and Evolution, Phylogeny, Indirect phylogenetic methods, MAXIMUM, COMPLEXITY, Fitch graph, ALGORITHMS, Populations and Evolution (q-bio.PE), 106045 Theoretische Biologie, polynomial-time recognition algorithm, Horizontal gene transfer, 106045 Theoretical biology, EVOLUTION, Later-divergence-time, REPLACEMENT, FOS: Biological sciences, BACTERIA, later-divergence-time, TREES, horizontal gene transfer, Genetics and epigenetics, xenology, indirect phylogenetic methods, 106005 Bioinformatics, Xenology, Algorithms, Computer Science - Discrete Mathematics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 11 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
