Shandong Science ›› 2019, Vol. 32 ›› Issue (6): 106-111.doi: 10.3976/j.issn.1002-4026.2019.06.015

• Other Research Article • Previous Articles     Next Articles

A deep learning-based method for Chinese text-feature extraction and classification

CAO Lu-hui1,DENG Yu-xiang2,CHEN Tong3*,LI Zhao4   

  1. 1. Shandong University, Jinan 250100, China;2. Shandong Financial Security and Evaluation Center,Jinan 250001, China;
    3. Big Data Engineering Technology Research Center of E-Government, Jinan 250014, China; 4. Shandong Provincial Key 
    Laboratory of Computer Networks,Shandong Computer Science Center(National Super Computer in Jinan), Qilu University of Technology(Shandong Academy of Sciences) , Jinan 250014, China
  • Received:2019-08-28 Online:2019-12-20 Published:2019-12-11

Abstract: This paper proposes a text-feature extraction method based on a convolutional recurrent neural network, and in the meanwhile, it also compares the statistical methods TF-IDF and Word2vec for text-feature representation. Text features are then fed into the SVM and Random forest classifier to classify the Chinese academic papers from CNKI. Experimental results show that the classification results obtained from the feature extraction models based on the convolutional neural network and convolutional recurrent neural network are better than those obtained from the TF-IDF and Word2vec feature extraction methods. Furthermore, the classification results obtained from the SVM and Random forest classifier are slightly better than those obtained from the native neural network

Key words: convolutional neural network, convolutional recurrent neural network, feature extraction, text classification

CLC Number: 

  • TP391.1