KMS

浏览/检索结果: 共23条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Lookup Table Refactoring: Towards Efficient Logarithmic Number System Addition for Large Language Models 会议论文
2025 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE (DATE), Lyon, France, 31 March-2 April 2025
作者:  Xinkuang Geng;  Siting Liu;  Hui Wang;  Jie Han;  Honglan Jiang
Adobe PDF(663Kb)  |  收藏  |  浏览/下载:17/1  |  提交时间:2025/05/26
QuantTPM: Efficient Mixed-Precision Quantization Framework for Tractable Probabilistic Models 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: PP, 期号: 99
作者:  Shen Zhang;  Bin Ning;  Guangyao Yan;  Xinzhe Liu;  Weixiong Jiang
Adobe PDF(9186Kb)  |  收藏  |  浏览/下载:92/2  |  提交时间:2025/03/03
Pushing the Limit of Post-Training Quantization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: PP, 期号: 99
作者:  Ruihao Gong;  Xianglong Liu;  Yuhang Li;  Yunqiang Fan;  Xiuying Wei
Adobe PDF(3735Kb)  |  收藏  |  浏览/下载:55/3  |  提交时间:2025/03/29
COMET: Towards Practical W4A4KV4 LLMs Serving 会议论文
INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS - ASPLOS, Rotterdam, Netherlands, March 30, 2025 - April 3, 2025
作者:  Liu, Lian;  Cheng, Long;  Ren, Haimeng;  Xu, Zhaohui;  Pan, Yudong
Adobe PDF(2187Kb)  |  收藏  |  浏览/下载:34/1  |  提交时间:2025/05/09
Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization 预印本
2024
作者:  Liu, Weihang;  Zheng, Xue Xian;  Yu, Jingyi;  Lou, Xin
收藏  |  浏览/下载:209/0  |  提交时间:2024/12/04
Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection 会议论文
PROCEEDINGS OF THE AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, Vancouver, BC, Canada, February 20, 2024 - February 27, 2024
作者:  Fan, Yunqian;  Wei, Xiuying;  Gong, Ruihao;  Ma, Yuqing;  Zhang, Xiangguo
Adobe PDF(574Kb)  |  收藏  |  浏览/下载:412/1  |  提交时间:2024/04/26
Task-Oriented Sensing, Computation, and Communication Integration for Multi-Device Edge AI 期刊论文
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 卷号: 23, 期号: 3, 页码: 2486-2502
作者:  Dingzhu Wen;  Peixi Liu;  Guangxu Zhu;  Yuanming Shi;  Jie Xu
Adobe PDF(7791Kb)  |  收藏  |  浏览/下载:549/10  |  提交时间:2023/10/07
QUQ: Quadruplet Uniform Quantization for Efficient Vision Transformer Inference 会议论文
PROCEEDINGS - DESIGN AUTOMATION CONFERENCE, San Francisco, CA, United states, June 23, 2024 - June 27, 2024
作者:  Geng, Xinkuang;  Liu, Siting;  Liu, Leibo;  Han, Jie;  Jiang, Honglan
Adobe PDF(860Kb)  |  收藏  |  浏览/下载:212/2  |  提交时间:2024/12/27
Compact Powers-of-Two: An Efficient Non-Uniform Quantization for Deep Neural Networks 会议论文
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), Valencia, Spain, 25-27 March 2024
作者:  Xinkuang Geng;  Siting Liu;  Jianfei Jiang;  Kai Jiang;  Honglan Jiang
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:287/2  |  提交时间:2024/06/17
Drift: Leveraging Distribution-based Dynamic Precision Quantization for Efficient Deep Neural Network Acceleration 会议论文
PROCEEDINGS - DESIGN AUTOMATION CONFERENCE, San Francisco, CA, United states, June 23, 2024 - June 27, 2024
作者:  Liu, Lian;  Xu, Zhaohui;  He, Yintao;  Wang, Ying;  Li, Huawei
Adobe PDF(809Kb)  |  收藏  |  浏览/下载:145/3  |  提交时间:2024/12/27
  • 首页
  • 上一页
  • 1
  • 2
  • 3
  • 下一页
  • 末页