Journal of Applied Sciences

• Articles • Previous Articles     Next Articles

Kernel-Based Nonlinear Discriminant Method in Text Classification

LIU Hai-feng 1, YAO Ze-qing 1, LIU Shou-sheng 1, WANG Qian 2   

  1. 1.Institute of Sciences, PLA University of Science and Technology, Nanjing 210007, China; 2.Institute of Xuzhou Engineering, Xuzhou 221116, China
  • Received:2008-06-16 Revised:2008-09-09 Online:2008-12-10 Published:2008-12-10
  • Contact: LIU Hai-feng

Abstract: To achieve feature reduction in text categorization, the scatter difference criterion is improved to satisfy a broad range of text categorization problems using kernel commutation in the pre-treatment. A kernel-based nonlinear method is proposed to extract features. By kernel commutation, the stylebook categorization problem is solved with less linear separability. Dimension of the feature space is significantly reduced without incurring excessive information loss. Experiments show that performance of the proposed method is better than maximal scatter difference with an efficiency improvement of 4.7% for the value of F1.

Key words: text categorization, feature extraction, scatter difference, kernel commutation

CLC Number: