ShanghaiTech University Knowledge Management System
Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning | |
2024 | |
会议录名称 | 2024 IEEE 35TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC)
![]() |
ISSN | 2166-9570 |
发表状态 | 已发表 |
DOI | 10.1109/PIMRC59610.2024.10817161 |
摘要 | Artificial intelligence (AI) is critical in evolving 5G and developing 6G networks, running on edge devices, and solving resource management challenges. The burgeoning number of edge devices draws attention to the potential of low-earth orbit (LEO) satellite networks with their onboard computing capabilities for edge inference. This paper explores LEO scenarios where multiple remote sensing edge AI inference tasks concurrently process data from a single source. However, due to there being parts with the same functions between different AI applications, traditional monolithic edge AI architecture must be deployed repeatedly and falls short in efficiently harnessing the heterogeneous resources of LEO satellite networks. To solve this problem, we utilize the microservice architecture to decouple a single AI application into several independent microservices to reuse these same functions. However, due to the high latency caused by multiple microservices’ communication, we need to design a deployment strategy to fully utilize resources to reduce the service latency. We present a microservice deployment model to minimize the total service latency across all AI applications and meet resource constraints with the constraints of hardware, energy, and memory limitations. This latency optimization problem is rewritten as a Markov decision process (MDP) to effectively deal with the challenge posed by the time-varying transmission rate caused by satellite mobility. To increase the training data utilization, we employ a Proximal Policy Optimization (PPO) based reinforcement learning algorithm to meet the dynamic environment challenge. Finally, we obtain a sub-optimal solution with minimal accuracy loss and an acceptable solution time. |
会议地点 | Valencia, Spain |
会议日期 | 2-5 Sept. 2024 |
URL | 查看原文 |
语种 | 英语 |
来源库 | IEEE |
文献类型 | 会议论文 |
条目标识符 | https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/467846 |
专题 | 信息科学与技术学院 信息科学与技术学院_PI研究组_石远明组 信息科学与技术学院_PI研究组_周勇组 信息科学与技术学院_硕士生 信息科学与技术学院_博士生 信息科学与技术学院_PI研究组_刘鑫组 |
作者单位 | 1.School of Information Science and Technology, ShanghaiTech University, Shanghai, China 2.Department of Electrical and Computer Engineering, Aarhus University, Denmark 3.Automatic Control Laboratory, École Polytechnique Fédérale de Lausanne (EPFL), Switzerland |
第一作者单位 | 信息科学与技术学院 |
第一作者的第一单位 | 信息科学与技术学院 |
推荐引用方式 GB/T 7714 | Zhiyong Yu,Hei Victor Cheng,Zhanpeng Yang,et al. Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning[C],2024. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 |
修改评论
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。