Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1017/978131...

Part of book or chapter of book . 2017 . Peer-reviewed

Data sources: Crossref

https://www.cambridge.org/core...

Other literature type

Data sources: Microsoft Academic Graph

Select content type to embed

All Research products

arrow_drop_down

<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>

COPY SCRIPT

For further information contact us at helpdesk@openaire.eu

Multiple Sequence Alignment

Name: Multiple Sequence Alignment
Creator: Tandy Warnow

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Other literature type 26 Oct 2017Publisher:Cambridge University Press (CUP)

Authors: Tandy Warnow;

doi: 10.1017/9781316882313.011

Multiple Sequence Alignment

- Summary
- Related research
  (1)
- Metrics

Abstract

Introduction Phylogeny estimation generally begins by estimating a multiple sequence alignment on the set of sequences. Once the multiple sequence alignment is computed, a tree can then be computed on the alignment (Figure 9.1). Not surprisingly, errors in multiple sequence alignment estimation tend to produce errors in estimated trees (Ogden and Rosenberg, 2006; Nelesen et al., 2008; Liu et al., 2009a; Wang et al., 2012) and other downstream analyses. Hence, multiple sequence alignment is an important part of phylogeny estimation. As we have seen, there are many methods for estimating trees from gap-free data. However, because multiple sequence alignments almost always contain gaps, represented as dashes, phylogeny estimation methods must be modified to be able to analyze alignments with dashes. Typically this is performed by treating the dashes as missing data (i.e., missing data means there is an actual nucleotide or amino acid, but it is not known). Alternatively, the dashes are sometimes treated as an additional state in the sequence evolution model, thus producing five states for nucleotide alignments or 21 states for amino acid alignments. Finally, sometimes sites (i.e., columns in the multiple sequence alignment) containing dashes are eliminated from the alignment before a tree is computed. The different treatments of sequence alignments can result in quite different theoretical and empirical performance. Multiple sequence alignments are computed for different purposes, including phylogeny estimation and protein structure prediction, and the definition of what constitutes a correct alignment depends, at least in part, on the purpose for the alignment. For some biological datasets, curated alignments, typically based on experimentally confirmed structural features of the molecules (e.g., secondary structures or tertiary structures of RNAs and proteins), are used as benchmarks for evaluating alignment methods. Examples of such benchmarks for evaluating large amino acid alignments include HomFam (Sievers et al., 2011), BAliBASE (Thompson et al., 1999), and the 10AA collection (Nguyen et al., 2015b), while the Comparative Ribosomal Website (CRW) provides benchmarks for RNA alignment (Cannone et al., 2002). Evolutionary alignments, on the other hand, are defined by the evolutionary history relating the sequences.

Related Organizations

University of Illinois at Urbana Champaign
United States
University of Illinois System
United States

1 Research products, page 1 of 1

Datasets for EMMA: A New Method for Computing Multiple Sequence Alignments given a Constraint Subset Alignment
2022IsAmongTopNSimilarDocuments

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Upload OA version

Are you the author? Do you have the OA version of this publication?

uploadUpload now!

Multiple Sequence Alignment

Multiple Sequence Alignment

1 Research products, page 1 of 1

Datasets for EMMA: A New Method for Computing Multiple Sequence Alignments given a Constraint Subset Alignment