Where Did the President Visit Last Week? Detecting Celebrity Trips from News Articles
2023-10-09
状态已发表
摘要

Celebrities' whereabouts are of pervasive importance. For instance, where politicians go, how often they visit, and who they meet, come with profound geopolitical and economic implications. Although news articles contain travel information of celebrities, it is not possible to perform large-scale and network-wise analysis due to the lack of automatic itinerary detection tools. To design such tools, we have to overcome difficulties from the heterogeneity among news articles: 1)One single article can be noisy, with irrelevant people and locations, especially when the articles are long. 2)Though it may be helpful if we consider multiple articles together to determine a particular trip, the key semantics are still scattered across different articles intertwined with various noises, making it hard to aggregate them effectively. 3)Over 20% of the articles refer to the celebrities' trips indirectly, instead of using the exact celebrity names or location names, leading to large portions of trips escaping regular detecting algorithms. We model text content across articles related to each candidate location as a graph to better associate essential information and cancel out the noises. Besides, we design a special pooling layer based on attention mechanism and node similarity, reducing irrelevant information from longer articles. To make up the missing information resulted from indirect mentions, we construct knowledge sub-graphs for named entities (person, organization, facility, etc.). Specifically, we dynamically update embeddings of event entities like the G7 summit from news descriptions since the properties (date and location) of the event change each time, which is not captured by the pre-trained event representations. The proposed CeleTrip jointly trains these modules, which outperforms all baseline models and achieves 82.53% in the F1 metric. By open-sourcing the first tool and a carefully curated dataset for such a new task, we hope to facilitate relevant research in celebrity itinerary mining as well as the social and political analysis built upon the extracted trips.

DOIarXiv:2307.08721
相关网址查看原文
出处Arxiv
WOS记录号PPRN:73995315
WOS类目Computer Science, Artificial Intelligence
文献类型预印本
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/348016
专题信息科学与技术学院_硕士生
信息科学与技术学院_PI研究组_张海鹏组
作者单位
ShanghaiTech Univ, Shanghai, Peoples R China
推荐引用方式
GB/T 7714
Peng, Kai,Zhang, Ying,Ling, Shuai,et al. Where Did the President Visit Last Week? Detecting Celebrity Trips from News Articles. 2023.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Peng, Kai]的文章
[Zhang, Ying]的文章
[Ling, Shuai]的文章
百度学术
百度学术中相似的文章
[Peng, Kai]的文章
[Zhang, Ying]的文章
[Ling, Shuai]的文章
必应学术
必应学术中相似的文章
[Peng, Kai]的文章
[Zhang, Ying]的文章
[Ling, Shuai]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。