A genotype calling algorithm for affymetrix SNP arrays

descriptionPublicationkeyboard_double_arrow_right Article 02 Nov 2005 English Publisher:Oxford University Press (OUP)Journal:Bioinformatics, volume 22, pages 7-12 (issn: 1367-4803, eissn: 1367-4811,

Copyright policy )

Authors: Nusrat, Rabbee; Terence P, Speed;

doi: 10.1093/bioinformatics/bti741

pmid: 16267090

A genotype calling algorithm for affymetrix SNP arrays

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Abstract Motivation: A classification algorithm, based on a multi-chip, multi-SNP approach is proposed for Affymetrix SNP arrays. Current procedures for calling genotypes on SNP arrays process all the features associated with one chip and one SNP at a time. Using a large training sample where the genotype labels are known, we develop a supervised learning algorithm to obtain more accurate classification results on new data. The method we propose, RLMM, is based on a robustly fitted, linear model and uses the Mahalanobis distance for classification. The chip-to-chip non-biological variance is reduced through normalization. This model-based algorithm captures the similarities across genotype groups and probes, as well as across thousands of SNPs for accurate classification. In this paper, we apply RLMM to Affymetrix 100 K SNP array data, present classification results and compare them with genotype calls obtained from the Affymetrix procedure DM, as well as to the publicly available genotype calls from the HapMap project. Availability: The RLMM software is implemented in R and is available from Bioconductor or from the first author at nrabbee@post.harvard.edu. Contact: nrabbee@stat.berkeley.edu Supplementary information:

Related Organizations

University of California, Berkeley
United States
Walter and Eliza Hall Institute of Medical Research
Australia

Keywords

Models, Statistical, Genotype, Models, Genetic, Gene Expression Profiling, DNA Mutational Analysis, Normal Distribution, Computational Biology, Sequence Analysis, DNA, Polymorphism, Single Nucleotide, Alternative Splicing, Gene Frequency, Haplotypes, Humans, Regression Analysis, Algorithms, Alleles, Software, Oligonucleotide Array Sequence Analysis

1 Research products, page 1 of 1

RLMM
2017IsSupplementedBy

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	309
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

309

Top 1%

gold

Fields of Science (3) View all

medical and health sciences

basic medicine

Fields of Science

medical and health sciences

basic medicine

View all

Related to Research communities

ELIXIR GR