消息
×
loading..
FMS-FPS: An FPGA-Based Fast Multi-Stage Farthest Point Sampling Accelerator with Reusable Modular Architecture and Efficient Search
2025-03
发表期刊IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS (IF:5.2[JCR-2023],4.5[5-Year])
ISSN1558-0806
发表状态已投递待接收
摘要

Farthest Point Sampling (FPS) is a widely used uniform downsampling algorithm in point cloud object detection and classification. However, its high computational complexity and the large size of modern point clouds make real-time performance challenging. Existing FPS accelerators fail to fully optimize their algorithms and hardware architectures for large scale point clouds. In this paper, we propose FMS-FPS, an FPGA based Fast Multi-Stage FPS accelerator that incorporates three key innovations: First, we introduce a density-adaptive multi stage FPS algorithm that adaptively adjusts search strategies based on point cloud density. This approach reduces memory accesses by around 90% during coarse-grain sampling and computation time by around 50% during fine-grain sampling. Second, we design a high-efficiency reusable modular architecture that reuses a limited set of deeply pipelined hardware modules across all stages. This significantly reduces resource utilization while enabling highly parallel processing of large-scale point clouds. Third, we develop an ultra-low-latency neighboring voxel selector that employs a two-level compressed lookup table to reduce storage requirements by around 60% and achieve a one-clock-cycle latency for voxel selection. Additionally, a multi pointer FIFO mechanism ensures efficient processing of non empty voxels and eliminates redundant computations. Experi ments show that FMS-FPS achieves a runtime of 4.06 ms for 120K-point clouds in the KITTI dataset, providing a 340.2× speedup over GPU implementations. Compared to the state-of the-art ASIC accelerator, FuseFPS, FMS-FPS achieves 1.47× and 1.83× speedups for 16K and 120K-point clouds, respectively. To the best of our knowledge, FMS-FPS is the first FPS accelerator implemented on FPGA rather than in simulation.

关键词FPGA hardware acceleration farthest point sampling point cloud algorithm-architecture co-design
语种英语
文献类型期刊论文
条目标识符https://kms.shanghaitech.edu.cn/handle/2MSLDSTB/496997
专题信息科学与技术学院_硕士生
信息科学与技术学院_PI研究组_哈亚军组
信息科学与技术学院_博士生
通讯作者Hao, Sun; Yajun, Ha
作者单位
1.ShanghaiTech University
2.AMD AECG Group
3.Southeast University
第一作者单位上海科技大学
通讯作者单位上海科技大学
第一作者的第一单位上海科技大学
推荐引用方式
GB/T 7714
Chunxu, Guo,Shaoyi, Chen,Yunhao, Hu,et al. FMS-FPS: An FPGA-Based Fast Multi-Stage Farthest Point Sampling Accelerator with Reusable Modular Architecture and Efficient Search[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS,2025.
APA Chunxu, Guo,Shaoyi, Chen,Yunhao, Hu,Zhiqi, Zhou,Hao, Sun,&Yajun, Ha.(2025).FMS-FPS: An FPGA-Based Fast Multi-Stage Farthest Point Sampling Accelerator with Reusable Modular Architecture and Efficient Search.IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS.
MLA Chunxu, Guo,et al."FMS-FPS: An FPGA-Based Fast Multi-Stage Farthest Point Sampling Accelerator with Reusable Modular Architecture and Efficient Search".IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS (2025).
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Chunxu, Guo]的文章
[Shaoyi, Chen]的文章
[Yunhao, Hu]的文章
百度学术
百度学术中相似的文章
[Chunxu, Guo]的文章
[Shaoyi, Chen]的文章
[Yunhao, Hu]的文章
必应学术
必应学术中相似的文章
[Chunxu, Guo]的文章
[Shaoyi, Chen]的文章
[Yunhao, Hu]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。