Journal of Lanzhou University of Technology ›› 2025, Vol. 51 ›› Issue (3): 99-106.

• Automation Technique and Computer Technology • Previous Articles     Next Articles

Chinese authorship identification based on fine-grained word feature

ZHAO Hong, ZHANG Chen-peng, WANG Ao-long, ZHANGYang   

  1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China
  • Received:2022-07-09 Online:2025-06-28 Published:2025-06-30

Abstract: Most existing authorship identification models are primarily designed for English texts. However, due to the differences between Chinese and English in grammar and language elements, the English authorship identification models have large deviations when applied to Chinese text. To solve the problem of Chinese author identification, a model adapted to Chinese features is proposed, termed the Chinese author recognition model with fine-grained word features. The model uses parallel convolution to extract fine-grained features from 1 to 4 characters and combines with an attention mechanism for weight assignment. Finally, Chinese authorship identification is obtained by the classifier. The experimental results show that the accuracy of this model is average improved by 2.09%, 7.2%, and 6.71% on three Chinese author identification datasets compared with the baseline models of BERT, TextCNN, and RNN, separately. Therefore, this model has a high value in reality.

Key words: Chinese authorship identification, BERT, attention mechanism, parallel convolutional layers, fine-grained feature

CLC Number: