Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning
2024
会议录名称2024 IEEE 35TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC)
ISSN2166-9570
发表状态已发表
DOI10.1109/PIMRC59610.2024.10817161
摘要

Artificial intelligence (AI) is critical in evolving 5G and developing 6G networks, running on edge devices, and solving resource management challenges. The burgeoning number of edge devices draws attention to the potential of low-earth orbit (LEO) satellite networks with their onboard computing capabilities for edge inference. This paper explores LEO scenarios where multiple remote sensing edge AI inference tasks concurrently process data from a single source. However, due to there being parts with the same functions between different AI applications, traditional monolithic edge AI architecture must be deployed repeatedly and falls short in efficiently harnessing the heterogeneous resources of LEO satellite networks. To solve this problem, we utilize the microservice architecture to decouple a single AI application into several independent microservices to reuse these same functions. However, due to the high latency caused by multiple microservices’ communication, we need to design a deployment strategy to fully utilize resources to reduce the service latency. We present a microservice deployment model to minimize the total service latency across all AI applications and meet resource constraints with the constraints of hardware, energy, and memory limitations. This latency optimization problem is rewritten as a Markov decision process (MDP) to effectively deal with the challenge posed by the time-varying transmission rate caused by satellite mobility. To increase the training data utilization, we employ a Proximal Policy Optimization (PPO) based reinforcement learning algorithm to meet the dynamic environment challenge. Finally, we obtain a sub-optimal solution with minimal accuracy loss and an acceptable solution time.

会议地点Valencia, Spain
会议日期2-5 Sept. 2024
URL查看原文
语种英语
来源库IEEE
文献类型会议论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/467846
专题信息科学与技术学院
信息科学与技术学院_PI研究组_石远明组
信息科学与技术学院_PI研究组_周勇组
信息科学与技术学院_硕士生
信息科学与技术学院_博士生
信息科学与技术学院_PI研究组_刘鑫组
作者单位
1.School of Information Science and Technology, ShanghaiTech University, Shanghai, China
2.Department of Electrical and Computer Engineering, Aarhus University, Denmark
3.Automatic Control Laboratory, École Polytechnique Fédérale de Lausanne (EPFL), Switzerland
第一作者单位信息科学与技术学院
第一作者的第一单位信息科学与技术学院
推荐引用方式
GB/T 7714
Zhiyong Yu,Hei Victor Cheng,Zhanpeng Yang,et al. Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning[C],2024.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Zhiyong Yu]的文章
[Hei Victor Cheng]的文章
[Zhanpeng Yang]的文章
百度学术
百度学术中相似的文章
[Zhiyong Yu]的文章
[Hei Victor Cheng]的文章
[Zhanpeng Yang]的文章
必应学术
必应学术中相似的文章
[Zhiyong Yu]的文章
[Hei Victor Cheng]的文章
[Zhanpeng Yang]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。