应用科学学报

• 论文 • 上一篇    下一篇

面向数据流挖掘过程的算法管理框架

朱小栋 黄志球 陈圣青 黄 凤 沈国华   

  1. 南京航空航天大学 信息科学与技术学院,江苏 南京 210016
  • 收稿日期:2007-09-12 修回日期:2007-11-26 出版日期:2008-01-31 发布日期:2008-01-31

Algorithm Management Framework for Data Stream Mining

ZHU Xiao-dong; HUANG Zhi-qiu; CHEN Sheng-qing; HUANG Feng; SHEN Guo-hua   

  1. College of Information Science & Technology, Nanjing University of Aeronautics & Astronautics,Nanjing 210016,China
  • Received:2007-09-12 Revised:2007-11-26 Online:2008-01-31 Published:2008-01-31

摘要: 结合数据流的特点,提出了一种面向数据流挖掘的过程模型PM-DSM。针对目前数据流挖掘过程中存在算法众多但利用率低的问题,提出了一种基于Web服务的数据流挖掘过程模型算法管理框架PMAMF-DSM,描述了该框架的体系结构和运行机制,并用UML活动图给出了框架的实现语义。在Eclipse上基于该框架实现了一个数据流挖掘算法管理系统,实验结果表明了该框架的灵活性与自适应性。

关键词: 数据挖掘, 过程模型, 数据流, 预测模型标记语言, Web服务

Abstract: In developing algorithms for data stream mining, little work has been done in the management of various algorithms. Although there have been many effective algorithms developed specifically for data stream mining, the application rate is quite low. In this research, we establish a process model for data stream mining. A process model for the algorithm management framework based on web services, PMAMF-DSM, is proposed. We analyze the construction of data stream algorithm repository and the architecture of the framework. Using the framework, a data stream oriented algorithm management system is implemented on Eclipse. Experiments indicate that the framework has high flexibility and self adaptability.

Key words: data mining, process model, data streams, predictive model markup language (PMML), web service