应用科学学报 ›› 2024, Vol. 42 ›› Issue (2): 350-363.doi: 10.3969/j.issn.0255-8297.2024.02.015

• 计算机科学与应用 • 上一篇    下一篇

直觉模糊的结构化最小二乘孪生支持向量机

张法滢1,2, 吕莉1,2, 韩龙哲1,2, 刘东晓1,2, 樊棠怀1,2   

  1. 1. 南昌工程学院 信息工程学院, 江西 南昌 330099;
    2. 南昌工程学院 南昌市智慧城市物联感知与协同计算重点实验室, 江西 南昌 330099
  • 收稿日期:2022-11-02 出版日期:2024-03-31 发布日期:2024-03-28
  • 通信作者: 吕莉,教授,研究方向为智能计算与计算智能、大数据与人工智能。E-mail:lvli623@163.com E-mail:lvli623@163.com
  • 基金资助:
    国家自然科学基金(No.62066030);江西省重点研发计划项目(No.20192BBE50076,No.20203BBGL-73225)资助

Intuition Fuzzy and Structural Least Squares Twin Support Vector Machine

ZHANG Faying1,2, LYU Li1,2, HAN Longzhe1,2, LIU Dongxiao1,2, FAN Tanghuai1,2   

  1. 1. School of Information Engineering, Nanchang Institute of Technology, Nanchang 330099, Jiangxi, China;
    2. Nanchang Key Laboratory of IoT Perception and Collaborative Computing for Smart City, Nanchang Institute of Technology, Nanchang 330099, Jiangxi, China
  • Received:2022-11-02 Online:2024-03-31 Published:2024-03-28

摘要: 针对最小二乘孪生支持向量机(least squares twin support vector machine,LSTSVM)对噪声或是异常数据敏感和忽略数据内在结构信息的问题,提出了一种直觉模糊的结构化最小二乘孪生支持向量机(intuition fuzzy and structural least squares twin support vector machine,IF-SLSTSVM)。首先采用孤立森林对输入样本点进行预处理;然后通过直觉模糊数的概念,赋予输入样本点不同的权重以减少噪声或是异常数据对分类超平面产生的影响;最后采用K-Means算法,以协方差的形式获取输入样本点之间的结构信息。IFSLSTSVM在LS-TSVM的基础上,考虑了输入样本点在特征空间中的分布信息及输入样本点之间的关系,提高了模型的鲁棒性。实验采取UCI数据集,在0%、5%、10%以及20%的不同比例噪声环境对IF-SLSTSVM算法的有效性进行验证。结果显示相较于6种对比算法,IF-SLSTSVM算法有更好的鲁棒性。

关键词: 支持向量机, 孤立森林, 结构信息, 直觉模糊, 聚类, 协方差

Abstract: Addressing the sensitivity of the least squares twin support vector machine(LS-SVM) to noise or abnormal data, and its tendency to overlook intrinsic structural information in the data, this paper introduces an intuition fuzzy and structural least squares twin support vector machine(IF-SLSTSVM). Firstly, the input sample points undergo preprocessing using isolated forest. Subsequently, leveraging the concept of intuitionistic fuzzy, varying weights are assigned to the input sample points to mitigate the impact of noise or abnormal data on the classification hyperplane. Finally, the K-Means algorithm is employed to extract structural information, represented in the form of covariance, among the input sample points. Built upon LS-SVM, IF-SLSTSVM takes into account the distribution information of input sample points in the feature space and their interrelationships,thereby enhancing the model's robustness. Experimental validation is performed using the UCI dataset in noise environments with different proportions of 0%, 5%, 10%, and 20%. The results demonstrate that the IF-SLSTSVM algorithm exhibits superior robustness compared to six other evaluated algorithms.

Key words: support vector machine, isolated forest, structural information, intuition fuzzy, clustering, covariance

中图分类号: