×
验证码:
换一张
忘记密码?
记住我
×
统一认证登录
登录
中文版
|
English
上海科技大学知识管理系统
ShanghaiTech University Knowledge Management System
统一认证登录
登录
注册
ALL
ORCID
题名
作者
发表日期
关键词
文献类型
DOI
出处
存缴日期
收录类别
出版者
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅
在结果中检索
研究单元&专题
信息科学与技术学院 [7]
作者
刘鑫 [7]
郭亨铨 [3]
朱琪 [1]
文献类型
会议论文 [7]
发表日期
2024 [2]
2023 [2]
2022 [3]
出处
PROCEEDING... [2]
INTERNATIO... [1]
LEARNING F... [1]
NEURIPS 20... [1]
PROCEEDING... [1]
PROCEEDING... [1]
更多...
语种
英语 [7]
资助项目
NSF[ [2]
Shanghai S... [1]
资助机构
收录类别
EI [7]
CPCI-S [5]
CPCI [1]
状态
已发表 [7]
×
知识图谱
KMS
反馈留言
浏览/检索结果:
共7条,第1-7条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
发表日期升序
发表日期降序
期刊影响因子升序
期刊影响因子降序
提交时间升序
提交时间降序
WOS被引频次升序
WOS被引频次降序
作者升序
作者降序
Stochastic Constrained Contextual Bandits via Lyapunov Optimization Based Estimation to Decision Framework
会议论文
PROCEEDINGS OF MACHINE LEARNING RESEARCH, Edmonton, AB, Canada, June 30, 2024 - July 3, 2024
作者:
Guo, Hengquan
;
Liu, Xin
Adobe PDF(1646Kb)
|
收藏
|
浏览/下载:306/3
|
提交时间:2024/10/11
Constrained optimization
Cost functions
Lyapunov functions
Lyapunov methods
Regression analysis
Constraint violation
Contextual banditti
Decision framework
Function class
General functions
Off-line problems
Optimisations
Realizability conditions
Slater condition
Stochastics
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
会议论文
PROCEEDINGS OF THE AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, Vancouver, BC, Canada, February 20, 2024 - February 27, 2024
作者:
Wei, Honghao
;
Liu, Xin
;
Ying, Lei
Adobe PDF(518Kb)
|
收藏
|
浏览/下载:390/45
|
提交时间:2024/04/26
Learning systems
Reinforcement learning
Action sets
Constraint violation
Cost-function
Functions approximations
Hard constraints
Information gain
Linear functions
Matchings
Reinforcement learnings
Reproducing Kernel Hilbert spaces
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints.
会议论文
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, Philadelphia, PA, United states, June 15, 2023 - June 16, 2023
作者:
Guo Hengquan
;
Zhu Qi
;
Liu Xin
Adobe PDF(3079Kb)
|
收藏
|
浏览/下载:364/11
|
提交时间:2023/03/25
Stochastic models
Bayesian optimization
Black boxes
Constraint functions
Constraint violation
Cumulative constraints
Hard constraints
Optimistics
Reward function
Stochastic continuum-armed bandit
Stochastics
Hard constraint
Online Nonstochastic Control with Adversarial and Static Constraints
会议论文
PROCEEDINGS OF MACHINE LEARNING RESEARCH, Honolulu, HI, United states, July 23, 2023 - July 29, 2023
作者:
Liu, Xin
;
Yang, Zixian
;
Ying, Lei
Adobe PDF(1822Kb)
|
收藏
|
浏览/下载:176/0
|
提交时间:2023/11/10
Constrained optimization
Linear control systems
Machine learning
Subroutines
Constraint violation
Control policy
Control problems
Cumulative cost
Linear controls
Non-stochastic
Online convex optimizations
State of the art
Static constraints
Sublinear
Online Convex Optimization with Hard Constraints: Towards the Best of Two Worlds and Beyond.
会议论文
NEURIPS 2022, New Orleans, LA, United states, November 28, 2022 - December 9, 2022
作者:
Guo Hengquan
;
Liu X(刘鑫)
;
Wei Honghao
;
Ying Lei
Adobe PDF(2556Kb)
|
收藏
|
浏览/下载:422/5
|
提交时间:2023/03/25
Constraint violation
Hard constraints
Loss functions
Online convex optimizations
Online optimization algorithms
Soft constraint
Time varying
A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes
会议论文
PROCEEDINGS OF THE 36TH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI 2022, Virtual, Online, February 22, 2022 - March 1, 2022
作者:
Wei, Honghao
;
Liu, Xin
;
Ying, Lei
Adobe PDF(450Kb)
|
收藏
|
浏览/下载:245/0
|
提交时间:2023/03/10
Behavioral research
Markov processes
Reinforcement learning
Average reward
Constrained Markov decision process
Constraint violation
Infinite horizons
Model free
Model-free algorithms
Number of state
Reinforcement learning algorithms
Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation
会议论文
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, null,null,ELECTR NETWORK, MAR 28-30, 2022
作者:
Wei, Honghao
;
Liu, Xin
;
Ying, Lei
Adobe PDF(3032Kb)
|
收藏
|
浏览/下载:360/0
|
提交时间:2022/08/23
Learning algorithms
Markov processes
Constrained Markov decision process
Constraint violation
Model free
Model-free algorithms
Q-functions
Q-values
Reinforcement learning algorithms
Reinforcement learnings
Sublinear
Value functions
首页
上一页
1
下一页
末页