Cascaded Code Distributed Computing With Low Complexity and Improved Flexibility
2024-01-28
状态已发表
摘要

Coded distributed computing, proposed by Li et al., offers significant potential for reducing the communication load in MapReduce computing systems. In the setting of the cascaded coded distributed computing that consisting of K nodes, N input files, and Q output functions, the objective is to compute each output function through s > 1 nodes with a computation load r > 1, enabling the application of coding techniques during the Shuffle phase to achieve minimum communication load. However, for most existing coded distributed computing schemes, a major limitation lies in their demand for splitting the original data into an exponentially growing number of input files in terms of N/(K) â N and requiring r an exponentially large number of output functions Q/(K ) â N, which imposes stringent requirements s for implementation and results in significant coding complexity when K is large. In this paper, we focus on the cascaded case of K/s â N, deliberately designing the strategy of data placement and output functions assignment based on a grouping method, such that a low-complexity Shuffle strategy of two rounds is achievable. The main advantages of the proposed scheme include: 1) the multicast gains equal to (r + s â 1)(1â 1/s) and r + s â 1 which is approximate to r + s â 1 when s is relatively large, and the communication load is quite approximate to or surprisingly better than the optimal state -of -the -art scheme proposed by Li et al.; 2) the proposed scheme requires significantly less number of input files and output functions; 3) all the operations are implemented over the minimum binary field F2 in the one-shot fashion. Finally, we derive a new information-theoretic converse bound for the cascaded coded distributed computing framework, under the given strategies of data placement and output functions assignment. We demonstrate that the communication load of the proposed scheme is order optimal within a factor of 2; and is also approximately optimal when K is sufficiently large for a given r.

关键词Cascaded coded distributed computing MapReduce computation-communication tradeoff
语种英语
DOIarXiv:2307.14927
相关网址查看原文
出处Arxiv
收录类别PPRN.PPRN
WOS记录号PPRN:74129293
WOS类目Computer Science, Information Systems ; Mathematics
文献类型预印本
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/381330
专题信息科学与技术学院
信息科学与技术学院_PI研究组_吴幼龙组
通讯作者Zhang, Mingming
作者单位
1.Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
2.ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China
3.Guangxi Normal Univ, Sch Math & Stat, Guilin 541006, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Mingming,Wu, Youlong,Cheng, Minquan,et al. Cascaded Code Distributed Computing With Low Complexity and Improved Flexibility. 2024.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Zhang, Mingming]的文章
[Wu, Youlong]的文章
[Cheng, Minquan]的文章
百度学术
百度学术中相似的文章
[Zhang, Mingming]的文章
[Wu, Youlong]的文章
[Cheng, Minquan]的文章
必应学术
必应学术中相似的文章
[Zhang, Mingming]的文章
[Wu, Youlong]的文章
[Cheng, Minquan]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。