×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
在结果中检索
研究单元&专题
信息科学与技术学院 [4]
作者
任海蒙 [2]
徐兆辉 [2]
周平强 [1]
娄鑫 [1]
虞晶怡 [1]
万浩川 [1]
更多...
文献类型
会议论文 [4]
发表日期
2025 [3]
2024 [1]
出处
2025 IEEE ... [1]
INTERNATIO... [1]
LECTURE NO... [1]
PROCEEDING... [1]
语种
英语 [4]
资助项目
Central Gu... [1]
National K... [1]
National K... [1]
National N... [1]
资助机构
收录类别
EI [4]
CPCI-S [2]
状态
已发表 [3]
×
知识图谱
KMS
反馈留言
浏览/检索结果:
共4条,第1-4条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
期刊影响因子升序
期刊影响因子降序
提交时间升序
提交时间降序
题名升序
题名降序
作者升序
作者降序
发表日期升序
发表日期降序
WOS被引频次升序
WOS被引频次降序
Make LLM Inference Affordable to Everyone: Augmenting GPU Memory with NDP-DIMM
会议论文
2025 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), Las Vegas, NV, USA, 1-5 March 2025
作者:
Lian Liu
;
Shixin Zhao
;
Bing Li
;
Haimeng Ren
;
Zhaohui Xu
Adobe PDF(1300Kb)
|
收藏
|
浏览/下载:34/1
|
提交时间:2025/04/14
Analog storage
Computer graphics equipment
Graphics processing unit
Neurons
Problem oriented languages
Static random access storage
'current
Computational loads
Cost effective
Data processing units
Language model
Model inference
Modeling parameters
Performance
Real- time
Weight parameters
Accelerating Mini-batch HGNN Training by Reducing CUDA Kernels
会议论文
LECTURE NOTES IN COMPUTER SCIENCE (INCLUDING SUBSERIES LECTURE NOTES IN ARTIFICIAL INTELLIGENCE AND LECTURE NOTES IN BIOINFORMATICS), Macau, China, October 29, 2024 - October 31, 2024
作者:
Wu, Meng
;
Qiu, Jingkai
;
Yan, Mingyu
;
Li, Wenming
;
Zhang, Yang
收藏
|
浏览/下载:339/0
|
提交时间:2025/03/14
Computer graphics equipment - Digital storage - Graphics processing unit - Heterogeneous networks
Feature matrices - Graph neural networks - Heterogeneous graph - Heterogeneous graph neural network - Memory bounds - Neural networks trainings - Semantics Information - Single kernel - Structure information - Time bound
COMET: Towards Practical W4A4KV4 LLMs Serving
会议论文
INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS - ASPLOS, Rotterdam, Netherlands, March 30, 2025 - April 3, 2025
作者:
Liu, Lian
;
Cheng, Long
;
Ren, Haimeng
;
Xu, Zhaohui
;
Pan, Yudong
Adobe PDF(2187Kb)
|
收藏
|
浏览/下载:37/1
|
提交时间:2025/05/09
Cache memory
Compaction
Computer graphics equipment
Graphics processing unit
Integrated circuit design
Modeling languages
Problem oriented languages
Algorithm
system co
design
Bit weight
Co
designs
Language model
Large language model serving
Large language model quantization
Mixed precision
Modeling quantizations
Quantisation
ZeroTetris: A Spacial Feature Similarity-based Sparse MLP Engine for Neural Volume Rendering
会议论文
PROCEEDINGS - DESIGN AUTOMATION CONFERENCE, San Francisco, CA, United states, June 23, 2024 - June 27, 2024
作者:
Wan, Haochuan
;
Ma, Linjie
;
Li, Antong
;
Zhou, Pingqiang
;
Yu, Jingyi
Adobe PDF(1288Kb)
|
收藏
|
浏览/下载:204/3
|
提交时间:2024/12/27
Computer graphics equipment
Interactive computer graphics
Matrix algebra
Multilayer neural networks
Particle accelerators
Rendering (computer graphics)
Computational requirements
Hardware accelerators
MAtrix multiplication
Multilayers perceptrons
Neural volume rendering
Neural-networks
Photorealistic rendering
Sparse matrices
Sparse matrix multiplication
Virtual worlds
首页
上一页
1
下一页
末页