Journal of Applied Sciences ›› 2006, Vol. 24 ›› Issue (2): 203-207.

• Articles • Previous Articles     Next Articles

Biased Sampling of Data Streams Based on Density

YANG Yi-dong, SUN Zhi-hui   

  1. Department of Computer Science and Engineering, Southeast University, Nanjing 210096, China
  • Received:2004-12-29 Revised:2005-03-29 Online:2006-03-31 Published:2006-03-31

Abstract: As an important kind of data source, data stream has received increasing attention.Data stream management systems and data mining based on data streams have also attracted much research interest.With dynamical gridpartitioning of the data space, distribution density of data streams is approximated, and based on which a density biased sampling method is presented.To test its efficiency, the proposed sampling method is applied to clustering data streams. Experimental results show promising applicability of the approach.

Key words: data streams, clustering, biased sampling

CLC Number: