应用科学学报 ›› 2026, Vol. 44 ›› Issue (1): 1-20.doi: 10.3969/j.issn.0255-8297.2026.01.001

• 计算机应用专辑 • 上一篇    下一篇

基于结构增强和深度聚类的网络群体识别

李永桢1,2, 马涪元1,2, 马世旋1,2, 王钰涵1,2, 王英1,2   

  1. 1. 吉林大学 计算机科学与技术学院, 吉林 长春 130012;
    2. 吉林大学 符号计算与知识工程教育部重点实验室, 吉林 长春 130012
  • 收稿日期:2025-08-05 出版日期:2026-01-30 发布日期:2026-02-03
  • 通信作者: 王英,教授,研究方向为人工智能、机器学习。E-mail:wangying2010@jlu.edu.cn E-mail:wangying2010@jlu.edu.cn
  • 基金资助:
    科技创新2030 “新一代人工智能”重大项目(No.2021ZD0112500);吉林省科技厅国际科技合作项目(No.20240402067GH)

Network Community Detection Based on Structure-Enhanced Deep Clustering

LI Yongzhen1,2, MA Fuyuan1,2, MA Shixuan1,2, WANG Yuhan1,2, WANG Ying1,2   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, Jilin, China;
    2. Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Jilin University, Changchun 130012, Jilin, China
  • Received:2025-08-05 Online:2026-01-30 Published:2026-02-03

摘要: 社交网络群体识别在信息传播、推荐与广告等领域具有重要的研究意义与应用价值。但现有方法在特征融合、稀疏图建模及多源信息利用上仍存在不足。为此,提出一种基于结构增强与深度聚类(structure-enhanced deep clustering,SDC)的网络群体识别模型,包含4个关键模块:首先,网络拓扑增强模块通过建模节点二阶相似性生成增强邻接矩阵,缓解稀疏社交网络的高阶关系缺失;其次,多视图特征融合模块在节点级动态融合节点属性特征与拓扑特征,在图级整合原始图与增强图的语义信息;再次,多源分布融合聚类模块在分布级利用可学习权重集成不同特征空间的聚类信息,平衡局部拓扑与全局语义;最后,双重自监督模块通过KL散度(Kullback-Leibler divergence)对齐、节点重构与相似性约束进行优化。实验表明,相较于主流基线方法,SDC网络群体识别模型在3个基准数据集上的ACC、NMI、ARI、F1指标平均提升了3.80%、9.09%、11.21%和7.43%。在Facebook动态交互数据上的仿真也验证了SDC网络群体识别模型捕捉社区结构演化的能力。

关键词: 社交网络, 群体识别, 结构增强, 深度聚类, 注意力机制

Abstract: Community detection in social networks is important for applications such as information diffusion, recommendation, and advertising. However, existing methods still face challenges in feature fusion, sparse graph modeling, and multi-source information utilization. To address these issues, this paper proposed a structure-enhanced deep clustering (SDC) model for network community detection, which consisted of four key modules. First, the topology enhancement module built an enhanced adjacency matrix by modeling second-order similarity between nodes, which alleviated the problem of missing high-order relations in sparse social networks. Second, the multi-view feature fusion module dynamically fused node attributes and topology features at the node level, and integrated semantic information from both the original and enhanced graphs at the graph level. Third, the multi-source distribution fusion clustering module used learnable weights to integrate clustering information from different feature spaces at the distribution level, balancing local topology and global semantics. Finally, the dual self-supervised module optimized the model through Kullback-Leibler (KL) divergence alignment, node reconstruction, and similarity constraints. Experiments show that compared with mainstream baseline methods, SDC model improves ACC, NMI, ARI, and F1 by an average of 3.80%, 9.09%, 11.21%, and 7.43%, respectively on the three benchmark datasets. Simulations based on Facebook interaction data also demonstrate the ability of SDC model to capture community structure evolution.

Key words: social network, community detection, structure-enhanced, deep clustering, attention mechanism

中图分类号: