A hybrid unsupervised machine learning model with spectral clustering and semi-supervised support vector machine for credit risk assessment.

Tao Yu, Wei Huang, Xin Tang, Duosi Zheng
Author Information
  1. Tao Yu: School of Mathematics, Harbin Institute of Technology, Harbin, China.
  2. Wei Huang: College of Business, Southern University of Science and Technology, Shenzhen, China.
  3. Xin Tang: College of Business, Southern University of Science and Technology, Shenzhen, China. ORCID
  4. Duosi Zheng: College of Business, Southern University of Science and Technology, Shenzhen, China.

Abstract

In credit risk assessment, unsupervised classification techniques can be introduced to reduce human resource expenses and expedite decision-making. Despite the efficacy of unsupervised learning methods in handling unlabeled datasets, their performance remains limited owing to challenges such as imbalanced data, local optima, and parameter adjustment complexities. Thus, this paper introduces a novel hybrid unsupervised classification method, named the two-stage hybrid system with spectral clustering and semi-supervised support vector machine (TSC-SVM), which effectively addresses the unsupervised imbalance problem in credit risk assessment by targeting global optimal solutions. Furthermore, a multi-view combined unsupervised method is designed to thoroughly mine data and enhance the robustness of label predictions. This method mitigates discrepancies in prediction outcomes from three distinct perspectives. The effectiveness, efficiency, and robustness of the proposed TSC-SVM model are demonstrated through various real-world applications. The proposed algorithm is anticipated to expand the customer base for financial institutions while reducing economic losses.

References

  1. PLoS One. 2015 Feb 23;10(2):e0117844 [PMID: 25706988]
  2. IEEE Trans Cybern. 2021 Mar;51(3):1598-1612 [PMID: 31150353]
  3. PLoS One. 2021 Aug 3;16(8):e0255216 [PMID: 34343180]
  4. PLoS One. 2023 Feb 16;18(2):e0281616 [PMID: 36795729]
  5. PLoS One. 2023 Nov 27;18(11):e0289130 [PMID: 38011207]
  6. IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):801-5 [PMID: 15875800]

MeSH Term

Support Vector Machine
Unsupervised Machine Learning
Risk Assessment
Cluster Analysis
Algorithms
Humans
Supervised Machine Learning

Word Cloud

Created with Highcharts 10.0.0unsupervisedcreditriskassessmenthybridmethodmachineclassificationlearningdataspectralclusteringsemi-supervisedsupportvectorTSC-SVMrobustnessproposedmodeltechniquescanintroducedreducehumanresourceexpensesexpeditedecision-makingDespiteefficacymethodshandlingunlabeleddatasetsperformanceremainslimitedowingchallengesimbalancedlocaloptimaparameteradjustmentcomplexitiesThuspaperintroducesnovelnamedtwo-stagesystemeffectivelyaddressesimbalanceproblemtargetingglobaloptimalsolutionsFurthermoremulti-viewcombineddesignedthoroughlymineenhancelabelpredictionsmitigatesdiscrepanciespredictionoutcomesthreedistinctperspectiveseffectivenessefficiencydemonstratedvariousreal-worldapplicationsalgorithmanticipatedexpandcustomerbasefinancialinstitutionsreducingeconomiclosses

Similar Articles

Cited By

No available data.