Journal of Applied Sciences ›› 2003, Vol. 21 ›› Issue (3): 284-288.

• Articles • Previous Articles     Next Articles

Quick Clustering of Large Log Data in Library

SONG Ai-bo, ZHUANG Xiao-qing, HE Jie-yue, YE Ning, DONG Yi-sheng   

  1. Department of Computer Science & Engineering, Southeast University, Nanjing 210096, China
  • Received:2002-07-08 Revised:2003-02-18 Online:2003-09-10 Published:2003-09-10

Abstract: In this paper, a simple and efficient method is presented for quick clustering and trend analyzing of library large log data. First, log data is clustered into a number of subclasses based on the underlying regularity of reader's borrowing and returning books. Then a fuzzy clustering algorithm is given for clustering the subclasses. The time complexity is linear, so our method can scale to large dataset. Finally, regression analysis is performed on the each cluster in order to dis cover the trend of borrowing and returning books. The experiment shows that this approach is successful.

Key words: digital library, clustering, regression analysis, log data

CLC Number: