基于词-标签概率的多标签文本分类研究

兰州理工大学学报 ›› 2023, Vol. 49 ›› Issue (1): 103-109.

• 自动化技术与计算机技术 • 上一篇下一篇

基于词-标签概率的多标签文本分类研究

赵宏^*, 郑厚泽, 郭岚

兰州理工大学计算机与通信学院, 甘肃兰州 730050

收稿日期:2021-09-10 出版日期:2023-02-28 发布日期:2023-03-21
通讯作者: 赵宏(1971-),男,甘肃西和人,博士,教授,博导. Email:zhaoh@lut.edu.cn
基金资助:
国家自然科学基金(62166025),甘肃省重点研发计划(21YF5GA073)

Multi-label text classification based on word-label probability

ZHAO Hong, ZHENG Hou-ze, GUO Lan

School of Computer and Communication, Lanzhou Univ. of Tech., Lanzhou 730050, China

Received:2021-09-10 Online:2023-02-28 Published:2023-03-21

摘要/Abstract

摘要： 针对多标签文本分类任务中如何有效地提取文本特征和获取标签之间潜在的相关性问题,提出一种CNN(convolutional neural networks)结合Bi-LSTM (bi-directional long short-term memory)的模型.首先,通过CNN网络和最大池化提取文本的特征;然后,利用训练的Labeled-LDA(labeled latent dirichlet allocation)模型获取所有词与标签之间的词-标签概率信息;接着,使用Bi-LSTM网络和CNN网络提取当前预测文本中每个词的词-标签信息特征;最后,结合提取的文本特征,预测与当前文本相关联的标签集.实验结果表明,使用词-标签概率获取文本中词与标签之间的相关性信息,能够有效提升模型的F1值.

关键词: 多标签文本分类, 卷积神经网络, 双向长短期记忆网络, 标签的隐狄利克雷分布

Abstract: Multi-label text classification is one of the important tasks in the field of natural language processing, the goal of which is to find the label subset associated with the text from a given label set. Aiming at the problem of how to effectively extract text features and obtain the potential correlation between labels in processing multi-label text classification, a model of convolutional neural networks (CNN) combined with bi-directional long short-term memory (Bi-LSTM) is proposed to process multi-label text classification. Firstly, text features are extracted through the CNN network and max pooling. Then, the trained Labeled Latent Dirichlet Allocation (labeled LDA) model is used to obtain the word-label probability information of all words and labels. In addition, the Bi-LSTM network and CNN network are used to extract the word-label information feature of each word in the current prediction text. Finally, combined with the extracted text features, the label set associated with the text is predicted. The experimental results show that the F1 value of the model can be effectively improved by using the word-label probability to get the correlation information between the words and labels in the text.

Key words: multi-label text classification, convolutional neural networks, bi-directional long short-term memory, labeled latent dirichlet allocation

中图分类号:

TP389.1

赵宏, 郑厚泽, 郭岚. 基于词-标签概率的多标签文本分类研究[J]. 兰州理工大学学报, 2023, 49(1): 103-109.

ZHAO Hong, ZHENG Hou-ze, GUO Lan. Multi-label text classification based on word-label probability[J]. Journal of Lanzhou University of Technology, 2023, 49(1): 103-109.

参考文献

[1] 陈亚茹,陈世平.融合自注意力机制和BiGRU网络的微博情感分析模型 [J].小型微型计算机系统,2020,41(8):1590-1595.
[2] XU J,HUANG F,ZHANG X,et al.Visual-textual sentiment classification with bi-directional multi-level attention networks [J].Knowledge-Based Systems,2019,178:61-73.
[3] PORIA S,CAMBRIA E,HAZARIKA D,et al.Multi-level multiple attentions for contextual multimodal sentiment analysis [C]//2017 IEEE International Conference on Data Mining (ICDM).[S.l.]:IEEE,2017:1033-1038.
[4] KUMAR A,VEPA J.Gated mechanism for attention based multi modal sentiment analysis [C]//2020 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP).[S.l.]:IEEE,2020:4477-4481.
[5] LUACES O,DÍEZ J,BARRANQUERO J,et al.Binary relevance efficacy for multilabel classification [J].Progress in Artificial Intelligence,2012,1(4):303-313.
[6] SPOLAÔR N,CHERMAN E A,MONARD M C,et al.A comparison of multi-label feature selection methods using the problem transformation approach [J].Electronic Notes in Theoretical Computer Science,2013,292:135-151.
[7] ZHANG M L,ZHOU Z H.ML-KNN:A lazy learning approach to multi-label learning [J].Pattern Recognition,2007,40(7):2038-2048.
[8] ELISSEEFF A,WESTON J.A kernel method for multi-labelled classification [J].Advances in Neural Information Processing Systems,2001,14:681-687.
[9] KURATA G,XIANG B,ZHOU B.Improved neural network-based multi-label classification with better initialization leveraging label co-occurrence [C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.San Diego,CA:[s.n.],2016:521-526.
[10] CHEN G,YE D,XING Z,et al.Ensemble application of convolutional and recurrent neural networks for multi-label text categorization [C]//2017 International Joint Conference on Neural Networks (IJCNN).[S.l.]:IEEE,2017:2377-2383.
[11] NAM J,MENCÍA E L,KIM H J,et al.Maximizing subset accuracy with recurrent neural networks in multi-label classification [C/OL].[2021-07-10].https://proceedings.neurips.cc/paper/2017/file/2eb5657d37f474e4c4cf01e4882b8962-Paper.pdf.
[12] ADHIKARI A,RAM A,TANG R,et al.Rethinking complex neural network architectures for document classification [C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies,Volume 1 (Long and Short Papers).Seattle:[s.n.],2019:4046-4051.
[13] LIN J,SU Q,YANG P,et al.Semantic-unit-based dilated convolution for multi-label text classification[C/OL].[2021-07-10].https:arxiv.org/pdf/1808.08561.pdf.
[14] TANG P,JIANG M,XIA B N,et al.Multi-label patent categorization with non-local attention-based graph convolutional network [J].Proceedings of the AAAI Conference on Artificial Intelligence,2020,34(5):9024-9031.
[15] YANG J,WANG K,YAN J.Incorporating label Co-occurrence into neural network-based models for multi-label text classification [J].IEEE Access,2019,7:183580-183588.
[16] YANG P,SUN X,LI W,et al.SGM:sequence generation model for multi-label classification [C]//Proceedings of the 27th International Conference on Computational Linguistics.City of Santa Fe:[s.n.],2018:3915-3926.
[17] XIAO L,HUANG X,CHEN B,et al.Label-specific document representation for multi-label text classification[C/OL].[2021-07-10].https:/www.aclweb.web.org/anthology/D19-1044.pdf.
[18] LIAO W,WANG Y,YIN Y,et al.Improved sequence generation model for multi-label classification via CNN and initialized fully connection [J].Neurocomputing,2020,382:188-195.
[19] RAMAGE D,HALL D,NALLAPATI R,et al.Labeled LDA:A supervised topic model for credit attribution in multi-labeled corpora [C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing.Singapore:[s.n.],2009:248-256.
[20] LEWIS D D,YANG Y,RUSSELL-ROSE T,et al.Rcv1:A new benchmark collection for text categorization research [J].Journal of Machine Learning Research,2004(5):361-397.
[21] KIM Y.Convolutional neural networks for sentence classification [C/OL].[2021-07-10].https://arxiv.org/pdf/1408.5882.pdf.

基于词-标签概率的多标签文本分类研究

Multi-label text classification based on word-label probability

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 3

编辑推荐

Metrics

本文评价

[1]	郭润兰, 史方青, 范雅琼, 何智. 基于卷积神经网络的壁面清洗机器人障碍物检测识别算法[J]. 兰州理工大学学报, 2022, 48(4): 83-89.
[2]	刘金霞. 基于深度学习的无线电信号分类[J]. 兰州理工大学学报, 2021, 47(4): 106-110.
[3]	张云, 李岚. 基于级联卷积神经网络的人脸特征点识别算法实现[J]. 兰州理工大学学报, 2020, 46(3): 105-109.