×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
在结果中检索
研究单元&专题
信息科学与技术学院 [11]
创意与艺术学院 [2]
创业与管理学院 [1]
物质科学与技术学院 [1]
更多...
作者
何旭明 [2]
Andre Rose... [2]
刘晓培 [1]
吴可斐 [1]
黄静怡 [1]
庄子文 [1]
更多...
文献类型
会议论文 [8]
期刊论文 [3]
预印本 [1]
发表日期
2024 [4]
2023 [2]
2022 [2]
2021 [3]
2020 [1]
出处
2021 20TH ... [1]
2024 IEEE ... [1]
ACM INTERN... [1]
Arxiv [1]
ICLR 2022 ... [1]
IEEE ACCES... [1]
更多...
语种
英语 [11]
资助项目
CAS Interd... [1]
资助机构
收录类别
EI [11]
CPCI [1]
CPCI-S [1]
SCI [1]
SCIE [1]
状态
已发表 [12]
×
知识图谱
KMS
反馈留言
浏览/检索结果:
共12条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
期刊影响因子升序
期刊影响因子降序
提交时间升序
提交时间降序
WOS被引频次升序
WOS被引频次降序
Research on Agents Decision-making Ability based on Adaptive Deep Learning
会议论文
ACM INTERNATIONAL CONFERENCE PROCEEDING SERIES, Singapore, Singapore, August 7, 2024 - August 9, 2024
作者:
Chen, Qianyu
Adobe PDF(252Kb)
|
收藏
|
浏览/下载:18/1
|
提交时间:2025/04/25
Contrastive Learning
Convolutional neural networks
Deep learning
Efficiency
Federated learning
Reinforcement learning
Adaptive decision making
Adaptive deep learning
Changing environment
Convolutional neural network
Decision-making ability
Decisions makings
Experience replay
Intelligence decision
Policy optimization
Research results
Adaptive rescheduling of rail transit services with short-turnings under disruptions via a multi-agent deep reinforcement learning approach
期刊论文
TRANSPORTATION RESEARCH PART B: METHODOLOGICAL, 2024, 卷号: 188
作者:
Ying, Chengshuo
;
Chow, Andy H.F.
;
Yan, Yimo
;
Kuo, Yong-Hong
;
Wang, Shouyang
Adobe PDF(8136Kb)
|
收藏
|
浏览/下载:323/2
|
提交时间:2024/09/20
Light rail transit
Multilayer neural networks
Railroad transportation
Reinforcement learning
Markov Decision Processes
Multi agent
Multi-agent deep reinforcement learning
Policy optimization
Proximal policy optimization
Rail transit
Reinforcement learnings
Short-turning
Train rescheduling
Transit services
An intelligent process parameters optimization approach for directed energy deposition of nickel-based alloys using deep reinforcement learning
期刊论文
JOURNAL OF MANUFACTURING PROCESSES, 2024, 卷号: 120, 页码: 1130-1140
作者:
Shuai,Shi
;
Xuewen,Liu
;
Zhongan,Wang
;
Hai,Chang
;
Yingna,Wu
Adobe PDF(3961Kb)
|
收藏
|
浏览/下载:442/3
|
提交时间:2024/05/16
Cost effectiveness
Deep learning
Deposition
Heuristic methods
Nickel alloys
Optimization
Vickers hardness
Deep reinforcement learning
Directed energy
Directed energy deposition
Energy depositions
Policy optimization
Process parameters
Proximal policy optimization
Reinforcement learnings
Temperature simulator
Vickers hardness measurements
Multi-Level Progressive Reinforcement Learning for Control Policy in Physical Simulations
会议论文
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), Yokohama, Japan, 13-17 May 2024
作者:
Kefei Wu
;
Xuming He
;
Yang Wang
;
Xiaopei Liu
Adobe PDF(2701Kb)
|
收藏
|
浏览/下载:391/4
|
提交时间:2024/08/26
Intelligent virtual agents
Reinforcement learning
Control policy
Fluid simulations
Model free
Multilevels
Physical simulation
Real-world scenario
Reinforcement learning algorithms
Reinforcement learnings
Simulation took
Training model
A Deep Reinforcement Learning Approach to Efficient Distributed Optimization
预印本
2023
作者:
Zhu, Daokuan
;
Lu, Jie
Adobe PDF(482Kb)
|
收藏
|
浏览/下载:437/2
|
提交时间:2024/01/09
Distributed optimization
reinforcement learning
learning to optimize
proximal policy optimization
Online Nonstochastic Control with Adversarial and Static Constraints
会议论文
PROCEEDINGS OF MACHINE LEARNING RESEARCH, Honolulu, HI, United states, July 23, 2023 - July 29, 2023
作者:
Liu, Xin
;
Yang, Zixian
;
Ying, Lei
Adobe PDF(1822Kb)
|
收藏
|
浏览/下载:181/0
|
提交时间:2023/11/10
Constrained optimization
Linear control systems
Machine learning
Subroutines
Constraint violation
Control policy
Control problems
Cumulative cost
Linear controls
Non-stochastic
Online convex optimizations
State of the art
Static constraints
Sublinear
TRUST REGION POLICY OPTIMISATION IN MULTI-AGENT REINFORCEMENT LEARNING
会议论文
ICLR 2022 - 10TH INTERNATIONAL CONFERENCE ON LEARNING REPRESENTATIONS, Virtual, Online, April 25, 2022 - April 29, 2022
作者:
Kuba, Jakub Grudzien
;
Chen, Ruiqing
;
Wen, Muning
;
Wen, Ying
;
Sun, Fanglei
Adobe PDF(1257Kb)
|
收藏
|
浏览/下载:316/0
|
提交时间:2023/04/28
Fertilizers
Game theory
Multi agent systems
Software agents
Heterogeneous agents
Learn+
Monotonics
Multi agent
Multi-agent reinforcement learning
Policy optimization
Property
Reinforcement learning agent
Trust region
Trust-region methods
SEMI: Self-supervised Exploration via Multisensory Incongruity
会议论文
PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, Philadelphia, PA, United states, May 23, 2022 - May 27, 2022
作者:
Jianren Wang
;
Ziwen Zhuang
;
Hang Zhao
Adobe PDF(788Kb)
|
收藏
|
浏览/下载:449/98
|
提交时间:2022/09/09
Reinforcement learning
Exploration policy
Extrinsic rewards
Input and outputs
Intrinsic rewards
Learn+
Multisensory
Policy model
Reinforcement learnings
Sensory input
Standing problems
'Could You Describe the Reason for the Transfer?': A Reinforcement Learning Based Voice-Enabled Bot Protecting Customers from Financial Frauds
会议论文
INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, PROCEEDINGS, Virtual, Online, Australia, November 1, 2021 - November 5, 2021
作者:
Wang, Zihao
;
Wang, Fudong
;
Zhang, Haipeng
;
Yang, Minghui
;
Cao, Shaosheng
Adobe PDF(1897Kb)
|
收藏
|
浏览/下载:289/0
|
提交时间:2021/12/03
Finance
Losses
Reinforcement learning
Sales
Speech processing
Dialog policy
Dialog risk detection
Dialogue systems
E
payments
Financial fraud
Financial loss
Outbound bot
Reinforcement learnings
Risk detections
Users' experiences
Sparse Gaussian Processes-based Black-Box Data-efficient Policy Search for Robotics
会议论文
2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, ICAR 2021, Ljubljana, Slovenia, December 6, 2021 - December 10, 2021
作者:
Chunyan Rong
;
Jingyi Huang
;
Andre Rosendo
Adobe PDF(341Kb)
|
收藏
|
浏览/下载:374/0
|
提交时间:2022/12/09
Deep learning
Drops
Gaussian distribution
Gaussian noise (electronic)
Reinforcement learning
Robotics
Black boxes
Gaussian Processes
Learn+
Learning problem
Policy search
Process-based
Reinforcement learnings
Search Algorithms
Search spaces
Sparse Gaussian process
首页
上一页
1
2
下一页
末页