descriptionPublicationkeyboard_double_arrow_right Article , Preprint 02 May 2013Embargo end date: 01 Jan 2013Publisher:Society for Industrial & Applied Mathematics (SIAM)Journal:Proceedings of the 2013 SIAM International Conference on Data MiningFunded by:NSF | III: Medium: Collaborativ..., NSF | PIRE: Training and Worksh..., NSF | Collaborative Research: G... +2 projects

Authors: Kong, Xiangnan; Yu, Philip S.; Wang, Xue; Ragin, Ann B.;

doi: 10.1137/1.9781611972832.10 , 10.48550/arxiv.1301.6626

pmid: 25949925

pmc: PMC4418485

arXiv: 1301.6626

Discriminative Feature Selection for Uncertain Graph Classification

- Summary
- Subjects
- Metrics

Abstract

Mining discriminative features for graph data has attracted much attention in recent years due to its important role in constructing graph classifiers, generating graph indices, etc. Most measurement of interestingness of discriminative subgraph features are defined on certain graphs, where the structure of graph objects are certain, and the binary edges within each graph represent the "presence" of linkages among the nodes. In many real-world applications, however, the linkage structure of the graphs is inherently uncertain. Therefore, existing measurements of interestingness based upon certain graphs are unable to capture the structural uncertainty in these applications effectively. In this paper, we study the problem of discriminative subgraph feature selection from uncertain graphs. This problem is challenging and different from conventional subgraph mining problems because both the structure of the graph objects and the discrimination score of each subgraph feature are uncertain. To address these challenges, we propose a novel discriminative subgraph feature selection method, DUG, which can find discriminative subgraph features in uncertain graphs based upon different statistical measures including expectation, median, mode and phi-probability. We first compute the probability distribution of the discrimination scores for each subgraph feature based on dynamic programming. Then a branch-and-bound algorithm is proposed to search for discriminative subgraphs efficiently. Extensive experiments on various neuroimaging applications (i.e., Alzheimer's Disease, ADHD and HIV) have been performed to analyze the gain in performance by taking into account structural uncertainties in identifying discriminative subgraph features for graph classification.

Related Organizations

University of Illinois at Chicago
United States
King Abdulaziz University
Saudi Arabia
University of Chicago
United States
University of Illinois at Urbana Champaign
United States
Northwestern University
United States

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Databases, Statistics - Machine Learning, Databases (cs.DB), Machine Learning (stat.ML), H.2.8 Database Management, Database Applications-Data Mining, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	33
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

Top 10%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Funded byView all

NSF| III: Medium: Collaborative Research: Towards On-Line Analytical Mining of Heterogeneous Information Networks, NSF| PIRE: Training and Workshops in Data Intensive Computing Using The Open Science Data Cloud, NSF| Collaborative Research: G-SESAME Cloud: A Dynamically Scalable Collaboration Community for Biological Knowledge Discovery, NSF| III:Small:Privacy Preserving Data Publishing: A Second Look on Group based Anonymization