
Whole genome comparison and clustering cannot be routinely performed without access to significant resources. If as expected, repositories continue to grow at the current rate, increasingly large and expensive systems will be required in order to maintain the status quo. The high-proportion of uncharacterised gene-sequences, combined with the fact that the majority of sequence analysis techniques are alignment-based, raises the possibility that alternative approaches might be able to identify relationships that have otherwise been missed. There is a need for alternative ways to predict function. PSST is an analysis tool with parallels to both pairwise algorithms and multiple motif-based pattern approaches. It is significantly faster than BLAST, and for some families including GPCRs, the tool is more sensitive and selective as well. For others it is worse. This paper describes the algorithm, its implementation, its evaluation against a diverse set of protein families, and discusses the reasons behind its behaviour.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
