M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
2025-02-17
状态已发表
摘要Aspect-based sentiment analysis (ABSA) is a crucial task in information extraction and sentiment analysis, aiming to identify aspects with associated sentiment elements in text. However, existing ABSA datasets are predominantly English-centric, limiting the scope for multilingual evaluation and research. To bridge this gap, we present M-ABSA, a comprehensive dataset spanning 7 domains and 21 languages, making it the most extensive multilingual parallel dataset for ABSA to date. Our primary focus is on triplet extraction, which involves identifying aspect terms, aspect categories, and sentiment polarities. The dataset is constructed through an automatic translation process with human review to ensure quality. We perform extensive experiments using various baselines to assess performance and compatibility on M-ABSA. Our empirical findings highlight that the dataset enables diverse evaluation tasks, such as multilingual and multi-domain transfer learning, and large language model evaluation, underscoring its inclusivity and its potential to drive advancements in multilingual ABSA research.
语种英语
DOIarXiv:2502.11824
相关网址查看原文
出处Arxiv
收录类别PPRN.PPRN
WOS记录号PPRN:121697529
WOS类目Computer Science, Interdisciplinary Applications
资助项目ERC Consolidator Grant DIALECT[101043235] ; Characteristic Innovation Projects of Guangdong Colleges and Universities[2018KTSCX049] ; National Natural Science Foundation of China[32371114] ; Guangdong Basic and Applied Basic Research Foundation[2023A1515011370]
文献类型预印本
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/514097
专题信息科学与技术学院_硕士生
通讯作者Xue, Yun
作者单位
1.South China Normal Univ, Sch Elect Sci & Engn, Sch Microelect, Guangdong Prov Key Lab Quantum Engn & Quantum Mat, Guangzhou, Peoples R China
2.LMU Munich, Munich, Germany
3.Munich Ctr Machine Learning, Munich, Germany
4.Tech Univ Munich, Munich, Germany
5.Renmin Univ China, Beijing, Peoples R China
6.Brown Univ, Providence, RI 02912, USA
7.ShanghaiTech Univ, Shanghai, Peoples R China
8.Univ Erlangen Nuremberg, Erlangen, Germany
推荐引用方式
GB/T 7714
Wu, Chengyan,Ma, Bolei,Liu, Yihong,et al. M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis. 2025.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Wu, Chengyan]的文章
[Ma, Bolei]的文章
[Liu, Yihong]的文章
百度学术
百度学术中相似的文章
[Wu, Chengyan]的文章
[Ma, Bolei]的文章
[Liu, Yihong]的文章
必应学术
必应学术中相似的文章
[Wu, Chengyan]的文章
[Ma, Bolei]的文章
[Liu, Yihong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。