Semi-supervised graph partitioning with decision trees.

Timothy Hancock, Hiroshi Mamitsuka
Author Information
  1. Timothy Hancock: Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan. timhancock@kuicr.kyoto-u.ac.jp

Abstract

In this paper we investigate a new framework for graph partitioning using decision trees to search for sub-graphs within a graph adjacency matrix. Graph partitioning by a decision tree seeks to optimize a specified graph partitioning index such as ratio cut by recursively applying decision rules found within nodes of the graph. Key advantages of tree models for graph partitioning are they provide a predictive framework for evaluating the quality of the solution, determining the number of sub-graphs and assessing overall variable importance. We evaluate the performance of tree based graph partitioning on a benchmark dataset for multiclass classification of tumor diagnosis based on gene expression. Three graph cut indices will be compared, ratio cut, normalized cut and network modularity and assessed in terms of their classification accuracy, power to estimate the optimal number of sub-graphs and ability to extract known important variables within the dataset.

MeSH Term

Computer Graphics
Decision Trees
Humans
Models, Genetic
Neoplasms
Predictive Value of Tests
Reference Values
Reproducibility of Results
Sensitivity and Specificity
Software

Word Cloud

Created with Highcharts 10.0.0graphpartitioningdecisioncutsub-graphswithintreeframeworktreesrationumberbaseddatasetclassificationpaperinvestigatenewusingsearchadjacencymatrixGraphseeksoptimizespecifiedindexrecursivelyapplyingrulesfoundnodesKeyadvantagesmodelsprovidepredictiveevaluatingqualitysolutiondeterminingassessingoverallvariableimportanceevaluateperformancebenchmarkmulticlasstumordiagnosisgeneexpressionThreeindiceswillcomparednormalizednetworkmodularityassessedtermsaccuracypowerestimateoptimalabilityextractknownimportantvariablesSemi-supervised

Similar Articles

Cited By