验证码:

换一张

忘记密码？记住我

取消登录

统一认证登录

取消

中文版 | English

上海科技大学知识管理系统

ShanghaiTech University Knowledge Management System

统一认证登录登录注册

图片搜索

粘贴图片网址

首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
知识整合
学习讨论厅

在结果中检索

研究单元&专题

信息科学与技术学院 [7]

作者

文献类型

会议论文 [7]

发表日期

出处

PROCEEDING... [2]

INTERNATIO... [1]

LEARNING F... [1]

NEURIPS 20... [1]

PROCEEDING... [1]

PROCEEDING... [1]

语种

资助项目

Shanghai S... [1]

资助机构

收录类别

EI [7]

状态

知识图谱

KMS

浏览/检索结果: 共7条，第1-7条

帮助

已选(0)清除条数/页：排序方式：
	Stochastic Constrained Contextual Bandits via Lyapunov Optimization Based Estimation to Decision Framework 会议论文 PROCEEDINGS OF MACHINE LEARNING RESEARCH, Edmonton, AB, Canada, June 30, 2024 - July 3, 2024 作者: Guo, Hengquan; Liu, Xin Adobe PDF(1646Kb) \| 收藏 \| 浏览/下载：325/3 \| 提交时间：2024/10/11 Constrained optimization Cost functions Lyapunov functions Lyapunov methods Regression analysis Constraint violation Contextual banditti Decision framework Function class General functions Off-line problems Optimisations Realizability conditions Slater condition Stochastics
	Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration 会议论文 PROCEEDINGS OF THE AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, Vancouver, BC, Canada, February 20, 2024 - February 27, 2024 作者: Wei, Honghao; Liu, Xin; Ying, Lei Adobe PDF(518Kb) \| 收藏 \| 浏览/下载：409/52 \| 提交时间：2024/04/26 Learning systems Reinforcement learning Action sets Constraint violation Cost-function Functions approximations Hard constraints Information gain Linear functions Matchings Reinforcement learnings Reproducing Kernel Hilbert spaces
	Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints. 会议论文 LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, Philadelphia, PA, United states, June 15, 2023 - June 16, 2023 作者: Guo Hengquan; Zhu Qi; Liu Xin Adobe PDF(3079Kb) \| 收藏 \| 浏览/下载：376/11 \| 提交时间：2023/03/25 Stochastic models Bayesian optimization Black boxes Constraint functions Constraint violation Cumulative constraints Hard constraints Optimistics Reward function Stochastic continuum-armed bandit Stochastics Hard constraint
	Online Nonstochastic Control with Adversarial and Static Constraints 会议论文 PROCEEDINGS OF MACHINE LEARNING RESEARCH, Honolulu, HI, United states, July 23, 2023 - July 29, 2023 作者: Liu, Xin; Yang, Zixian; Ying, Lei Adobe PDF(1822Kb) \| 收藏 \| 浏览/下载：181/0 \| 提交时间：2023/11/10 Constrained optimization Linear control systems Machine learning Subroutines Constraint violation Control policy Control problems Cumulative cost Linear controls Non-stochastic Online convex optimizations State of the art Static constraints Sublinear
	Online Convex Optimization with Hard Constraints: Towards the Best of Two Worlds and Beyond. 会议论文 NEURIPS 2022, New Orleans, LA, United states, November 28, 2022 - December 9, 2022 作者: Guo Hengquan; Liu X(刘鑫); Wei Honghao; Ying Lei Adobe PDF(2556Kb) \| 收藏 \| 浏览/下载：435/5 \| 提交时间：2023/03/25 Constraint violation Hard constraints Loss functions Online convex optimizations Online optimization algorithms Soft constraint Time varying
	A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes 会议论文 PROCEEDINGS OF THE 36TH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI 2022, Virtual, Online, February 22, 2022 - March 1, 2022 作者: Wei, Honghao; Liu, Xin; Ying, Lei Adobe PDF(450Kb) \| 收藏 \| 浏览/下载：250/0 \| 提交时间：2023/03/10 Behavioral research Markov processes Reinforcement learning Average reward Constrained Markov decision process Constraint violation Infinite horizons Model free Model-free algorithms Number of state Reinforcement learning algorithms
	Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation 会议论文 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, null,null,ELECTR NETWORK, MAR 28-30, 2022 作者: Wei, Honghao; Liu, Xin; Ying, Lei Adobe PDF(3032Kb) \| 收藏 \| 浏览/下载：370/0 \| 提交时间：2022/08/23 Learning algorithms Markov processes Constrained Markov decision process Constraint violation Model free Model-free algorithms Q-functions Q-values Reinforcement learning algorithms Reinforcement learnings Sublinear Value functions

首页
上一页
1
下一页
末页

首页
研究单元产出分布图
收录类型分布图
论文引用排行
作者
文献类型
学科分类
使用帮助
联系我们

条目量28509
全文量27315
访问量16717341
下载量1123759

Copyright © 上海科技大学版权所有沪ICP备13001436号-1 沪公网安备 31011502006855号

地址邮编: 上海市浦东新区华夏中路393号
电话: 86-21-20685191