Alignment-free Sequence Comparison: Benefits, Applications, and Tools
- Published: 01 Oct 2017
- Publisher: Springer Science and Business Media LLC
- Funder: Fundação para a Ciência e a Tecnologia, I.P. (FCT)
- Project Code: 147353
- Funding stream: 5876
Altschul, SF, Madden, TL, Schäffer, AA, Zhang, J, Zhang, Z, Miller, W. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25: 3389-402 [OpenAIRE] [PubMed] [DOI]
Pearson, WR, Lipman, DJ. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988; 85: 2444-8 [OpenAIRE] [PubMed] [DOI]
Thompson, JD, Higgins, DG, Gibson, TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994; 22: 4673-80 [OpenAIRE] [PubMed] [DOI]
Edgar, RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004; 32: 1792-7 [OpenAIRE] [PubMed] [DOI]
Katoh, K, Misawa, K, Kuma, K, Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002; 30: 3059-66 [OpenAIRE] [PubMed] [DOI]
Finn, RD, Bateman, A, Clements, J, Coggill, P, Eberhardt, RY, Eddy, SR. Pfam: the protein families database. Nucleic Acids Res. 2014; 42: D222-30 [OpenAIRE] [PubMed] [DOI]
Darling, AE, Mau, B, Perna, NT. ProgressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One. 2010; 5: e11147 [OpenAIRE] [PubMed] [DOI]
Schwartz, S, Kent, WJ, Smit, A, Zhang, Z, Baertsch, R, Hardison, RC. Human-mouse alignments with BLASTZ. Genome Res. 2003; 13: 103-7 [OpenAIRE] [PubMed] [DOI]
Blanchette, M, Kent, WJ, Riemer, C, Elnitski, L, Smit, AF, Roskin, KM. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 2004; 14: 708-15 [OpenAIRE] [PubMed] [DOI]
Duffy, S, Shackelton, LA, Holmes, EC. Rates of evolutionary change in viruses: patterns and determinants. Nat Rev Genet. 2008; 9: 267-76 [OpenAIRE] [PubMed] [DOI]
Song, N, Joseph, JM, Davis, GB, Durand, D. Sequence similarity network reveals common ancestry of multidomain proteins. PLoS Comput Biol. 2008; 4: e1000063 [OpenAIRE] [PubMed] [DOI]
Terrapon, N, Weiner, J, Grath, S, Moore, AD, Bornberg-Bauer, E. Rapid similarity search of proteins using alignments of domain arrangements. Bioinformatics. 2014; 30: 274-81 [OpenAIRE] [PubMed] [DOI]
Xiong, J. Essential bioinformatics. 2006
Rost, B. Twilight zone of protein sequence alignments. Protein Eng. 1999; 12: 85-94 [OpenAIRE] [PubMed] [DOI]
Chattopadhyay, AK, Nasiev, D, Flower, DR. A statistical physics perspective on alignment-independent protein sequence comparison. Bioinformatics. 2015; 31: 2469-74 [OpenAIRE] [PubMed] [DOI]
Supplementary Outcomes
Related research
- Funder: Fundação para a Ciência e a Tecnologia, I.P. (FCT)
- Project Code: 147353
- Funding stream: 5876
Altschul, SF, Madden, TL, Schäffer, AA, Zhang, J, Zhang, Z, Miller, W. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997; 25: 3389-402 [OpenAIRE] [PubMed] [DOI]
Pearson, WR, Lipman, DJ. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988; 85: 2444-8 [OpenAIRE] [PubMed] [DOI]
Thompson, JD, Higgins, DG, Gibson, TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994; 22: 4673-80 [OpenAIRE] [PubMed] [DOI]
Edgar, RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004; 32: 1792-7 [OpenAIRE] [PubMed] [DOI]
Katoh, K, Misawa, K, Kuma, K, Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002; 30: 3059-66 [OpenAIRE] [PubMed] [DOI]
Finn, RD, Bateman, A, Clements, J, Coggill, P, Eberhardt, RY, Eddy, SR. Pfam: the protein families database. Nucleic Acids Res. 2014; 42: D222-30 [OpenAIRE] [PubMed] [DOI]
Darling, AE, Mau, B, Perna, NT. ProgressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One. 2010; 5: e11147 [OpenAIRE] [PubMed] [DOI]
Schwartz, S, Kent, WJ, Smit, A, Zhang, Z, Baertsch, R, Hardison, RC. Human-mouse alignments with BLASTZ. Genome Res. 2003; 13: 103-7 [OpenAIRE] [PubMed] [DOI]
Blanchette, M, Kent, WJ, Riemer, C, Elnitski, L, Smit, AF, Roskin, KM. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 2004; 14: 708-15 [OpenAIRE] [PubMed] [DOI]
Duffy, S, Shackelton, LA, Holmes, EC. Rates of evolutionary change in viruses: patterns and determinants. Nat Rev Genet. 2008; 9: 267-76 [OpenAIRE] [PubMed] [DOI]
Song, N, Joseph, JM, Davis, GB, Durand, D. Sequence similarity network reveals common ancestry of multidomain proteins. PLoS Comput Biol. 2008; 4: e1000063 [OpenAIRE] [PubMed] [DOI]
Terrapon, N, Weiner, J, Grath, S, Moore, AD, Bornberg-Bauer, E. Rapid similarity search of proteins using alignments of domain arrangements. Bioinformatics. 2014; 30: 274-81 [OpenAIRE] [PubMed] [DOI]
Xiong, J. Essential bioinformatics. 2006
Rost, B. Twilight zone of protein sequence alignments. Protein Eng. 1999; 12: 85-94 [OpenAIRE] [PubMed] [DOI]
Chattopadhyay, AK, Nasiev, D, Flower, DR. A statistical physics perspective on alignment-independent protein sequence comparison. Bioinformatics. 2015; 31: 2469-74 [OpenAIRE] [PubMed] [DOI]