KMS

浏览/检索结果: 共17条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Lookup Table Refactoring: Towards Efficient Logarithmic Number System Addition for Large Language Models 会议论文
2025 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE (DATE), Lyon, France, 31 March-2 April 2025
作者:  Xinkuang Geng;  Siting Liu;  Hui Wang;  Jie Han;  Honglan Jiang
Adobe PDF(663Kb)  |  收藏  |  浏览/下载:19/1  |  提交时间:2025/05/26
Verification of Bit-Flip Attacks against Quantized Neural Networks 期刊论文
PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES, 2025, 卷号: 9, 期号: OOPSLA1
作者:  Zhang, Yedi;  Huang, Lei;  Gao, Pengfei;  Song, Fu;  Sun, Jun
Adobe PDF(1010Kb)  |  收藏  |  浏览/下载:15/1  |  提交时间:2025/05/30
QuantTPM: Efficient Mixed-Precision Quantization Framework for Tractable Probabilistic Models 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 卷号: PP, 期号: 99
作者:  Shen Zhang;  Bin Ning;  Guangyao Yan;  Xinzhe Liu;  Weixiong Jiang
Adobe PDF(9186Kb)  |  收藏  |  浏览/下载:95/2  |  提交时间:2025/03/03
Pushing the Limit of Post-Training Quantization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: PP, 期号: 99
作者:  Ruihao Gong;  Xianglong Liu;  Yuhang Li;  Yunqiang Fan;  Xiuying Wei
Adobe PDF(3735Kb)  |  收藏  |  浏览/下载:62/3  |  提交时间:2025/03/29
Certified Quantization Strategy Synthesis for Neural Networks 会议论文
LECTURE NOTES IN COMPUTER SCIENCE (INCLUDING SUBSERIES LECTURE NOTES IN ARTIFICIAL INTELLIGENCE AND LECTURE NOTES IN BIOINFORMATICS), Milan, Italy, September 9, 2024 - September 13, 2024
作者:  Zhang, Yedi;  Chen, Guangke;  Song, Fu;  Sun, Jun;  Dong, Jin Song
收藏  |  浏览/下载:373/0  |  提交时间:2024/10/11
COMET: Towards Practical W4A4KV4 LLMs Serving 会议论文
INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS - ASPLOS, Rotterdam, Netherlands, March 30, 2025 - April 3, 2025
作者:  Liu, Lian;  Cheng, Long;  Ren, Haimeng;  Xu, Zhaohui;  Pan, Yudong
Adobe PDF(2187Kb)  |  收藏  |  浏览/下载:38/1  |  提交时间:2025/05/09
Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection 会议论文
PROCEEDINGS OF THE AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, Vancouver, BC, Canada, February 20, 2024 - February 27, 2024
作者:  Fan, Yunqian;  Wei, Xiuying;  Gong, Ruihao;  Ma, Yuqing;  Zhang, Xiangguo
Adobe PDF(574Kb)  |  收藏  |  浏览/下载:416/1  |  提交时间:2024/04/26
QUQ: Quadruplet Uniform Quantization for Efficient Vision Transformer Inference 会议论文
PROCEEDINGS - DESIGN AUTOMATION CONFERENCE, San Francisco, CA, United states, June 23, 2024 - June 27, 2024
作者:  Geng, Xinkuang;  Liu, Siting;  Liu, Leibo;  Han, Jie;  Jiang, Honglan
Adobe PDF(860Kb)  |  收藏  |  浏览/下载:218/2  |  提交时间:2024/12/27
A Customized Model for Defensing Against Adversarial Attacks 会议论文
2024 CONFERENCE OF SCIENCE AND TECHNOLOGY FOR INTEGRATED CIRCUITS (CSTIC), Shanghai, China, 17-18 March 2024
作者:  Jiang Sun;  Pingqiang Zhou
Adobe PDF(630Kb)  |  收藏  |  浏览/下载:214/4  |  提交时间:2024/06/03
Compact Powers-of-Two: An Efficient Non-Uniform Quantization for Deep Neural Networks 会议论文
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), Valencia, Spain, 25-27 March 2024
作者:  Xinkuang Geng;  Siting Liu;  Jianfei Jiang;  Kai Jiang;  Honglan Jiang
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:290/2  |  提交时间:2024/06/17
  • 首页
  • 上一页
  • 1
  • 2
  • 下一页
  • 末页