| 37 | 1 | 78 |
| 下载次数 | 被引频次 | 阅读次数 |
文本观点检索旨在从特定文档集中检索出与查询相关的观点文本,过程中涉及查询-文本相关性及文本观点特征的提取.现有模型在文档倾向性特征提取时,主要采用传统的机器学习方法进行观点特征挖掘,该方法忽视了文本中词语节点间关系对判断观点倾向的影响,导致了模型在文本语义层面的泛化能力受限,进而影响了观点检索的性能.提出一种基于图卷积网络的观点检索方法,旨在将文档中的词语映射到高维语义空间中,自适应地学习文档词语节点间的连接关系,以获取更准确的文档语义信息表述,应用于模型之中.实验表明,与当前最优的模型相比,改进后的模型在两个Twitter数据集上,检索主指标分别提升1.2%和0.8%,验证了本文所提方法的有效性.
Abstract:Opinion retrieval text aims to retrieve the opinion documents related to a query from a specific document set, which involves the combination of query and document relevance as well as the orientation of the document itself.The existing models focus on mining opinion features based on traditional machine learning when extracting document opinion features.These methods ignore the information representation of adjacent word nodes in the document, which leads to limited generalization ability at the semantic level of the text and cannot obtain more accurate opinion features, thus affecting the opinion retrieval performance.To solve the above problems, this paper proposes an opinion retrieval method based on graph convolutional network, which aims to map the words in the document into a high-dimensional semantic space, adaptively learn the connection relationship between the document word nodes, and obtain more accurate semantic information representation of the document.Experiments show that, compared with the current best model, the improved model improves the main index by 1.2% and 0.8% respectively on two Twitter public data, which well verifies the effectiveness of the proposed method.
[1] 廖祥文,许洪波,孙乐,等.第三届中文倾向性分析评测(COAE2011)语料的构建与分析[J].中文信息学报,2013,27(1):56-64.
[2] HEMMATIAN F,SOHRABI M K.A survey on classification techniques for opinion mining and sentiment analysis[J].Artificial Intelligence Review,2019,52(3):1495-1545.
[3] ZHANG M,YE X.A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval [C]//Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,2008:411-419.
[4] 马飞翔,廖祥文,於志勇,等.基于知识图谱的文本观点检索方法 [J].山东大学学报(理学版),2016,51(11):33-40.
[5] 张铭洲,魏晶晶,陈开志,等.基于循环卷积网络的观点检索方法[C]//第二十五届全国信息检索学术会议,2019:103-115.
[6] DEVLIN J,CHANG M W,LEE K,et al.BERT:pre-training of deep bidirectional transformers for language understanding[C]//In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies,2019:4171-4186.
[7] KIPF T N,WELLING M.Semi-supervised classification with graph convolutional networks[EB/OL].(2017-04-24)[2023-08-22].https://openreview.net/pdf?id=SJU4ayYgl.
[8] 张铭洲.基于交互注意力机制的观点检索方法[J].通化师范学院学报,2023,44(2):33-38.
[9] 廖祥文,刘德元,桂林,等.融合文本概念化与网络表示的观点检索[J].软件学报,2018,29(10):2899-2914.
[10] JIANG L,YU M,ZHOU M,et al.Target-dependent twitter sentiment classification[C]//Proceedings of the 49th annual meeting of the association for computational linguistics:human language technologies,2011:151-160.
基本信息:
中图分类号:TP391.1;TP183
引用信息:
[1]张铭洲.基于图卷积网络的文本观点检索方法[J].鞍山师范学院学报,2024,26(02):74-80.
基金信息:
福建省中青年项目(JAT220718); 福建省教育厅B类科技研究项目(JB12487S); 黎明职业大学校级课题(LZ202301)
2024-04-20
2024-04-20