Weighted Distance Weighted Discrimination and Pairwise Variable Selection for Classification

Name: Weighted Distance Weighted Discrimination and Pairwise Variable Selection for Classification
Creator: Qiao, Xingye

Qiao, Xingye

Found an issue? Give us feedback

UNC Dataversearrow_drop_down

UNC Dataverse

Other literature type . 2010

Data sources: Datacite

Weighted Distance Weighted Discrimination and Pairwise Variable Selection for Classification

descriptionPublicationkeyboard_double_arrow_right Other literature type 01 Jan 2010 English Publisher:The University of North Carolina at Chapel Hill University Libraries

Authors: Qiao, Xingye;

doi: 10.17615/zcaw-ad20

Weighted Distance Weighted Discrimination and Pairwise Variable Selection for Classification

- Summary
- Related research
  (3)
- Metrics

Abstract

Statistical machine learning has attracted a lot of attention in recent years due to its broad applications in various fields. The driving statistical problem that is common throughout this dissertation is classification. This dissertation covers two major topics in classification. The first topic is weighted Distance Weighted Discrimination (weighted DWD or wDWD), an improved version of a recently proposed classification method. We show significant improvements are available in several situations. Using our proposed optimal weighting schemes, we show that wDWD is Fisher consistent under the overall misclassification criterion. In addition, we propose three alternative criteria and provide the corresponding optimal weights or adaptive weighting schemes for each of them. Mathematical validation of these ideas is established through the High-Dimensional, Low Sample-Size (HDLSS) asymptotic properties of wDWD. An important contribution is the weakening of the assumptions from Hall et al. (2005) and Ahn et al. (2007). We then extend the results to two classes. The HDLSS asymptotic properties of wDWD that we discuss here contain two results, one is about the misclassification rate of wDWD, the other explores the angle between the DWD direction and the optimal classification direction. The second topic of this dissertation is variable selection for classification. The goal is to find those variables that have weak marginal effects, but can lead to good classification results when they are viewed jointly. To accomplish this, we use a within-class permutation test called Significance test of Joint Effect (SigJEff). The resulting object of SigJEff is a set of pairs of variables with statistically significant joint effects. To extend our scope to joint effects with more than two variables, we introduce a new visualization approach to display the mutiscale joint effects, called Multiscale Significance Display (MSD), and a general framework for variable selection procedures based on MSD, called Multiscale Variable Screening (MVS). MSD is a moving window approach, and it evaluates the joint effects of the variables in this window. The moving window is based on an order of variables. MVS seeks to find the best initial ordering in an iterative manner.

Related Organizations

University of North Carolina at Chapel Hill
United States

3 Research products, page 1 of 1

Structural and Thermodynamic Basis of Epitope Binding by Neutralizing and Nonneutralizing Forms of the Anti-HIV-1 Antibody 4E10
2015IsAmongTopNSimilarDocuments
COMMON PROPER-MOTION WIDE WHITE DWARF BINARIES SELECTED FROM THE SLOAN DIGITAL SKY SURVEY
2012IsAmongTopNSimilarDocuments
Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now

Weighted Distance Weighted Discrimination and Pairwise Variable Selection for Classification

Weighted Distance Weighted Discrimination and Pairwise Variable Selection for Classification

3 Research products, page 1 of 1

Structural and Thermodynamic Basis of Epitope Binding by Neutralizing and Nonneutralizing Forms of the Anti-HIV-1 Antibody 4E10

COMMON PROPER-MOTION WIDE WHITE DWARF BINARIES SELECTED FROM THE SLOAN DIGITAL SKY SURVEY

Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data