
Abstract Background Many Bioinformatics studies begin with a multiple sequence alignment as the foundation for their research. This is because multiple sequence alignment can be a useful technique for studying molecular evolution and analyzing sequence structure relationships. Results In this paper, we have proposed a Vertical Decomposition with Genetic Algorithm (VDGA) for Multiple Sequence Alignment (MSA). In VDGA, we divide the sequences vertically into two or more subsequences, and then solve them individually using a guide tree approach. Finally, we combine all the subsequences to generate a new multiple sequence alignment. This technique is applied on the solutions of the initial generation and of each child generation within VDGA. We have used two mechanisms to generate an initial population in this research: the first mechanism is to generate guide trees with randomly selected sequences and the second is shuffling the sequences inside such trees. Two different genetic operators have been implemented with VDGA. To test the performance of our algorithm, we have compared it with existing well-known methods, namely PRRP, CLUSTALX, DIALIGN, HMMT, SB_PIMA, ML_PIMA, MULTALIGN, and PILEUP8, and also other methods, based on Genetic Algorithms (GA), such as SAGA, MSA-GA and RBT-GA, by solving a number of benchmark datasets from BAliBase 2.0. Conclusions The experimental results showed that the VDGA with three vertical divisions was the most successful variant for most of the test cases in comparison to other divisions considered with VDGA. The experimental results also confirmed that VDGA outperformed the other methods considered in this research.
570, Evolution, QH301-705.5, Computer applications to medicine. Medical informatics, anzsrc-for: 46 Information and Computing Sciences, R858-859.7, anzsrc-for: 49 Mathematical sciences, Biochemistry, 3105 Genetics, Evolution, Molecular, 3102 Bioinformatics and Computational Biology, 46 Information and Computing Sciences, Humans, anzsrc-for: 31 Biological Sciences, Biology (General), Child, Molecular Biology, Base Sequence, anzsrc-for: 080201 Analysis of Algorithms and Complexity, anzsrc-for: 01 Mathematical Sciences, Molecular, Computational Biology, DNA, Sequence Analysis, DNA, anzsrc-for: 4602 Artificial Intelligence, 004, Computer Science Applications, anzsrc-for: 3105 Genetics, 4602 Artificial Intelligence, anzsrc-for: 06 Biological Sciences, Generic health relevance, anzsrc-for: 3102 Bioinformatics and Computational Biology, anzsrc-for: 08 Information and Computing Sciences, Sequence Analysis, Sequence Alignment, Algorithms, 31 Biological Sciences, Research Article
570, Evolution, QH301-705.5, Computer applications to medicine. Medical informatics, anzsrc-for: 46 Information and Computing Sciences, R858-859.7, anzsrc-for: 49 Mathematical sciences, Biochemistry, 3105 Genetics, Evolution, Molecular, 3102 Bioinformatics and Computational Biology, 46 Information and Computing Sciences, Humans, anzsrc-for: 31 Biological Sciences, Biology (General), Child, Molecular Biology, Base Sequence, anzsrc-for: 080201 Analysis of Algorithms and Complexity, anzsrc-for: 01 Mathematical Sciences, Molecular, Computational Biology, DNA, Sequence Analysis, DNA, anzsrc-for: 4602 Artificial Intelligence, 004, Computer Science Applications, anzsrc-for: 3105 Genetics, 4602 Artificial Intelligence, anzsrc-for: 06 Biological Sciences, Generic health relevance, anzsrc-for: 3102 Bioinformatics and Computational Biology, anzsrc-for: 08 Information and Computing Sciences, Sequence Analysis, Sequence Alignment, Algorithms, 31 Biological Sciences, Research Article
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 38 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
