Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning

doi:10.1109/PIMRC59610.2024.10817161

	Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning
	Zhiyong Yu1 ; Hei Victor Cheng 2; Zhanpeng Yang1 ; Xin Liu1 ; Yuning Jiang 3; Yong Zhou1 ; Yuanming Shi1
	2024
会议录名称	2024 IEEE 35TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC)
ISSN	2166-9570
发表状态	已发表
DOI	10.1109/PIMRC59610.2024.10817161
摘要	Artificial intelligence (AI) is critical in evolving 5G and developing 6G networks, running on edge devices, and solving resource management challenges. The burgeoning number of edge devices draws attention to the potential of low-earth orbit (LEO) satellite networks with their onboard computing capabilities for edge inference. This paper explores LEO scenarios where multiple remote sensing edge AI inference tasks concurrently process data from a single source. However, due to there being parts with the same functions between different AI applications, traditional monolithic edge AI architecture must be deployed repeatedly and falls short in efficiently harnessing the heterogeneous resources of LEO satellite networks. To solve this problem, we utilize the microservice architecture to decouple a single AI application into several independent microservices to reuse these same functions. However, due to the high latency caused by multiple microservices’ communication, we need to design a deployment strategy to fully utilize resources to reduce the service latency. We present a microservice deployment model to minimize the total service latency across all AI applications and meet resource constraints with the constraints of hardware, energy, and memory limitations. This latency optimization problem is rewritten as a Markov decision process (MDP) to effectively deal with the challenge posed by the time-varying transmission rate caused by satellite mobility. To increase the training data utilization, we employ a Proximal Policy Optimization (PPO) based reinforcement learning algorithm to meet the dynamic environment challenge. Finally, we obtain a sub-optimal solution with minimal accuracy loss and an acceptable solution time.
会议地点	Valencia, Spain
会议日期	2-5 Sept. 2024
URL	查看原文
语种	英语
来源库	IEEE
文献类型	会议论文
条目标识符	https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/467846
专题	信息科学与技术学院信息科学与技术学院_PI研究组_石远明组信息科学与技术学院_PI研究组_周勇组信息科学与技术学院_硕士生信息科学与技术学院_博士生信息科学与技术学院_PI研究组_刘鑫组
作者单位	1.School of Information Science and Technology, ShanghaiTech University, Shanghai, China 2.Department of Electrical and Computer Engineering, Aarhus University, Denmark 3.Automatic Control Laboratory, École Polytechnique Fédérale de Lausanne (EPFL), Switzerland
第一作者单位	信息科学与技术学院
第一作者的第一单位	信息科学与技术学院
推荐引用方式 GB/T 7714	Zhiyong Yu,Hei Victor Cheng,Zhanpeng Yang,et al. Microservice Deployment for Satellite Edge AI Inference via Deep Reinforcement Learning[C],2024.