×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
在结果中检索
研究单元&专题
信息科学与技术学院 [3]
创业与管理学院 [1]
作者
刘鑫 [2]
周勇 [1]
汪寿阳 [1]
文献类型
会议论文 [2]
期刊论文 [2]
发表日期
2024 [1]
2022 [2]
2021 [1]
出处
IEEE INTER... [1]
INTERNATIO... [1]
PROCEEDING... [1]
TRANSPORTA... [1]
语种
英语 [4]
资助项目
NSF[ [2]
资助机构
收录类别
EI [4]
CPCI-S [2]
CPCI [1]
SCIE [1]
状态
已发表 [4]
×
知识图谱
KMS
反馈留言
浏览/检索结果:
共4条,第1-4条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
WOS被引频次升序
WOS被引频次降序
题名升序
题名降序
发表日期升序
发表日期降序
作者升序
作者降序
期刊影响因子升序
期刊影响因子降序
Adaptive rescheduling of rail transit services with short-turnings under disruptions via a multi-agent deep reinforcement learning approach
期刊论文
TRANSPORTATION RESEARCH PART B: METHODOLOGICAL, 2024, 卷号: 188
作者:
Ying, Chengshuo
;
Chow, Andy H.F.
;
Yan, Yimo
;
Kuo, Yong-Hong
;
Wang, Shouyang
Adobe PDF(8136Kb)
|
收藏
|
浏览/下载:309/2
|
提交时间:2024/09/20
Light rail transit
Multilayer neural networks
Railroad transportation
Reinforcement learning
Markov Decision Processes
Multi agent
Multi-agent deep reinforcement learning
Policy optimization
Proximal policy optimization
Rail transit
Reinforcement learnings
Short-turning
Train rescheduling
Transit services
A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes
会议论文
PROCEEDINGS OF THE 36TH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI 2022, Virtual, Online, February 22, 2022 - March 1, 2022
作者:
Wei, Honghao
;
Liu, Xin
;
Ying, Lei
Adobe PDF(450Kb)
|
收藏
|
浏览/下载:245/0
|
提交时间:2023/03/10
Behavioral research
Markov processes
Reinforcement learning
Average reward
Constrained Markov decision process
Constraint violation
Infinite horizons
Model free
Model-free algorithms
Number of state
Reinforcement learning algorithms
Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation
会议论文
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, null,null,ELECTR NETWORK, MAR 28-30, 2022
作者:
Wei, Honghao
;
Liu, Xin
;
Ying, Lei
Adobe PDF(3032Kb)
|
收藏
|
浏览/下载:361/0
|
提交时间:2022/08/23
Learning algorithms
Markov processes
Constrained Markov decision process
Constraint violation
Model free
Model-free algorithms
Q-functions
Q-values
Reinforcement learning algorithms
Reinforcement learnings
Sublinear
Value functions
Optimizing Information Freshness for Cooperative IoT Systems with Stochastic Arrivals
期刊论文
IEEE INTERNET OF THINGS JOURNAL, 2021, 卷号: 8, 期号: 19, 页码: 14485-14500
作者:
Bohai Li
;
Qian Wang
;
He Chen
;
Yong Zhou
;
Yonghui Li
Adobe PDF(1649Kb)
|
收藏
|
浏览/下载:887/188
|
提交时间:2021/12/03
Markov processes
Stochastic systems
Transmissions
Closed
form expression
Error probabilities
Internet of Things (IOT)
Markov Decision Processes
Near
optimal performance
Optimal scheduling
Relaying protocols
Stochastic arrivals
首页
上一页
1
下一页
末页