×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
在结果中检索
研究单元&专题
信息科学与技术学院 [9]
生命科学与技术学院 [1]
作者
殷树 [3]
娄鑫 [2]
吴天元 [2]
杨易为 [2]
李冠呈 [2]
任海蒙 [2]
更多...
文献类型
会议论文 [6]
期刊论文 [3]
发表日期
2025 [3]
2024 [2]
2023 [1]
2022 [3]
出处
IEEE TRANS... [2]
2024 IEEE ... [1]
2024 IEEE ... [1]
2025 IEEE ... [1]
IEEE TRANS... [1]
INTERNATIO... [1]
更多...
语种
英语 [9]
资助项目
National K... [1]
National N... [1]
Natural Sc... [1]
资助机构
收录类别
EI [9]
SCI [3]
CPCI-S [2]
SCIE [1]
SCOPUS [1]
状态
已发表 [8]
×
知识图谱
KMS
反馈留言
浏览/检索结果:
共9条,第1-9条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
期刊影响因子升序
期刊影响因子降序
发表日期升序
发表日期降序
WOS被引频次升序
WOS被引频次降序
提交时间升序
提交时间降序
题名升序
题名降序
Make LLM Inference Affordable to Everyone: Augmenting GPU Memory with NDP-DIMM
会议论文
2025 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), Las Vegas, NV, USA, 1-5 March 2025
作者:
Lian Liu
;
Shixin Zhao
;
Bing Li
;
Haimeng Ren
;
Zhaohui Xu
Adobe PDF(1300Kb)
|
收藏
|
浏览/下载:31/1
|
提交时间:2025/04/14
Analog storage
Computer graphics equipment
Graphics processing unit
Neurons
Problem oriented languages
Static random access storage
'current
Computational loads
Cost effective
Data processing units
Language model
Model inference
Modeling parameters
Performance
Real- time
Weight parameters
Accelerating Mini-batch HGNN Training by Reducing CUDA Kernels
会议论文
LECTURE NOTES IN COMPUTER SCIENCE (INCLUDING SUBSERIES LECTURE NOTES IN ARTIFICIAL INTELLIGENCE AND LECTURE NOTES IN BIOINFORMATICS), Macau, China, October 29, 2024 - October 31, 2024
作者:
Wu, Meng
;
Qiu, Jingkai
;
Yan, Mingyu
;
Li, Wenming
;
Zhang, Yang
收藏
|
浏览/下载:334/0
|
提交时间:2025/03/14
Computer graphics equipment - Digital storage - Graphics processing unit - Heterogeneous networks
Feature matrices - Graph neural networks - Heterogeneous graph - Heterogeneous graph neural network - Memory bounds - Neural networks trainings - Semantics Information - Single kernel - Structure information - Time bound
COMET: Towards Practical W4A4KV4 LLMs Serving
会议论文
INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS - ASPLOS, Rotterdam, Netherlands, March 30, 2025 - April 3, 2025
作者:
Liu, Lian
;
Cheng, Long
;
Ren, Haimeng
;
Xu, Zhaohui
;
Pan, Yudong
Adobe PDF(2187Kb)
|
收藏
|
浏览/下载:35/1
|
提交时间:2025/05/09
Cache memory
Compaction
Computer graphics equipment
Graphics processing unit
Integrated circuit design
Modeling languages
Problem oriented languages
Algorithm
system co
design
Bit weight
Co
designs
Language model
Large language model serving
Large language model quantization
Mixed precision
Modeling quantizations
Quantisation
An FPGA Accelerator for 3D Cone-beam Sparse-view Computed Tomography Reconstruction
会议论文
2024 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS, Abu Dhabi, United Arab Emirates, 22-25 April 2024
作者:
Gu YH(顾雨涵)
;
Wu Q(吴晴)
;
Yuan ZC(袁哲晨)
;
Zhang XY(张湘煜)
;
Su WY(苏文艳)
Adobe PDF(2638Kb)
|
收藏
|
浏览/下载:326/3
|
提交时间:2024/05/30
Computer graphics
Computerized tomography
Energy efficiency
Graphics processing unit
Image reconstruction
Ionizing radiation
Medical imaging
Program processors
3d self-supervised projection network
Computed tomography
Cone beam
Cross sectional image
FPGA accelerator
Human bodies
Imaging method
Projection network
Sparse-view computed tomography
Tomography reconstruction
Portus: Efficient DNN Checkpointing to Persistent Memory with Zero-Copy
会议论文
2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), Jersey City, NJ, USA, 23-26 July 2024
作者:
Yuanhao Li
;
Tianyuan Wu
;
Guancheng Li
;
Yanjie Song
;
Shu Yin
Adobe PDF(1359Kb)
|
收藏
|
浏览/下载:332/12
|
提交时间:2024/08/26
Graphics processing unit
Memory architecture
Problem oriented languages
Static random access storage
Check pointing
Index structure
Model training
Performance
Persistence memory
Persistent memory
RDMA
System for AI
Three-level
Zero copy
An Energy-Efficient Stream-Based FPGA Implementation of Feature Extraction Algorithm for LiDAR Point Clouds With Effective Local-Search
期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I: REGULAR PAPERS, 2023, 卷号: 70, 期号: 1, 页码: 1-13
作者:
Sun, Hao
;
Deng, Qi
;
Liu, Xinzhe
;
Shu, Yuhao
;
Ha, Yajun
Adobe PDF(3454Kb)
|
收藏
|
浏览/下载:542/0
|
提交时间:2022/11/25
Computer graphics
Energy efficiency
Extraction
Feature extraction
Field programmable gate arrays (FPGA)
Graphics processing unit
Matrix algebra
Program processors
Robotics
Features extraction
Field programmable gate array
Field programmables
Local search
Localization and mappings
Point cloud compression
Point-clouds
Programmable gate array
Simultaneously localization and mapping
Sparse matrices
Task analysis
Critique of MemXCT: memory-centric X-ray CT reconstruction with massive parallelization by SCC Team from ShanghaiTech University
期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 9, 页码: 2047-2049
作者:
Yuchen Liu
;
Yixuan Meng
;
Kaiyuan Xu
;
Zijun Xu
;
Tianyuan Wu
Adobe PDF(946Kb)
|
收藏
|
浏览/下载:523/0
|
提交时间:2021/12/17
Computer graphics
Computerized tomography
Graphics processing unit
Image reconstruction
Iterative methods
Matrix algebra
Memory architecture
Program processors
Graphic processing unit
Graphics processing
MemXCT
Performances evaluation
Processing units
Reproducibilities
Reproducibility of result
Sparse matrices
Spstudent cluster challenge
Cache-locality Based Adaptive Warp Scheduling for Neural Network Acceleration on GPGPUs
会议论文
INTERNATIONAL SYSTEM ON CHIP CONFERENCE, Belfast, Northern Ireland, United kingdom, September 5, 2022 - September 8, 2022
作者:
Hu, Weiming
;
Zhou, Yi
;
Quan, Ying
;
Wang, Yuanfeng
;
Lou, Xin
Adobe PDF(1352Kb)
|
收藏
|
浏览/下载:367/0
|
提交时间:2022/11/11
Convolution
Graphics processing unit
Multilayer neural networks
Network layers
Neural network models
Program processors
Scheduling
Cache locality
Convolutional neural network
General purpose graph processing unit
Graph processing
Neural network model
Performance
Processing units
Scheduling policies
Warp scheduling
Critique of A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery by SCC Team from ShanghaiTech University
期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 34, 期号: 6, 页码: 1-4
作者:
Li, Guancheng
;
Cao, Songhui
;
Zhao, Chuyi
;
Zhang, Siyuan
;
Ji, Yuchen
Adobe PDF(409Kb)
|
收藏
|
浏览/下载:383/9
|
提交时间:2022/09/30
Benchmarking
Computer graphics
Graphics processing unit
Multitasking
Program processors
Random access storage
Statistical tests
Bayes method
Benchmark testing
Biological system modeling
Case study in scientific application
Case-studies
Noise measurements
Non-volatile memory
Nonvolatile memory
Scientific applications
Task analysis
Usability testing
首页
上一页
1
下一页
末页