| |||||||
ShanghaiTech University Knowledge Management System
For the Underrepresented in Gender Bias Research: Chinese Name Gender Prediction with Heterogeneous Graph Attention Network | |
2023-06-27 | |
会议录名称 | PROCEEDINGS OF THE 37TH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI 2023 |
卷号 | 37 |
页码 | 14436-14443 |
发表状态 | 正式接收 |
摘要 | Achieving gender equality is an important pillar for humankind’s sustainable future. Pioneering data-driven gender bias research is based on large-scale public records such as scientific papers, patents, and company registrations, covering female researchers, inventors and entrepreneurs, and so on. Since gender information is often missing in relevant datasets, studies rely on tools to infer genders from names. However, available open-sourced Chinese gender-guessing tools are not yet suitable for scientific purposes, which may be partially responsible for female Chinese being underrepresented in mainstream gender bias research and affect their universality. Specifically, these tools focus on character-level information while overlooking the fact that the combinations of Chinese characters in multi-character names, as well as the components and pronunciations of characters, convey important messages. As a first effort, we design a Chinese Heterogeneous Graph Attention (CHGAT) model to capture the heterogeneity in component relationships and incorporate the pronunciations of characters. Our model largely surpasses current tools and also outperforms the state-of-the-art algorithm. Last but not least, the most popular Chinese name-gender dataset is single-character based with far less female coverage from an unreliable source, naturally hindering relevant studies. We open-source a more balanced multi-character dataset from an official source together with our code, hoping to help future research promoting gender equality. Copyright © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. |
会议举办国 | Association for the Advancement of Artificial Intelligence |
会议录编者/会议主办者 | Association for the Advancement of Artificial Intelligence |
关键词 | Artificial intelligence Character level Chinese characters Data driven Gender bias Gender equality Gender predictions Heterogeneous graph Large-scales Public records Scientific papers |
会议名称 | 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
会议地点 | Washington, DC, United states |
会议日期 | February 7, 2023 - February 14, 2023 |
收录类别 | SCI ; EI |
语种 | 英语 |
出版者 | AAAI Press |
EI入藏号 | 20233414581418 |
EI主题词 | Open systems |
EI分类号 | 723.4 Artificial Intelligence |
原始文献类型 | Conference article (CA) |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/325825 |
专题 | 信息科学与技术学院_硕士生 信息科学与技术学院_PI研究组_张海鹏组 |
通讯作者 | Pan, Zihao; Peng, Kai; Zhang, Haipeng |
作者单位 | ShanghaiTech University, China |
第一作者单位 | 上海科技大学 |
通讯作者单位 | 上海科技大学 |
第一作者的第一单位 | 上海科技大学 |
推荐引用方式 GB/T 7714 | Pan, Zihao,Peng, Kai,Ling, Shuai,et al. For the Underrepresented in Gender Bias Research: Chinese Name Gender Prediction with Heterogeneous Graph Attention Network[C]//Association for the Advancement of Artificial Intelligence:AAAI Press,2023:14436-14443. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。