关键词云

成果统计

合作作者[TOP 5]

  • 郭亨铨

    合作成果数:9

  • Ying, Lei

    合作成果数:9

  • Wei, Honghao

    合作成果数:7

  • 石远明

    合作成果数:5

  • 何静竹

    合作成果数:3

访问统计


  总访问量
 607

  访问来源
    内部: 26
    外部: 581
    国内: 515
    国外: 92

  年访问量
 109

  访问来源
    内部: 4
    外部: 105
    国内: 94
    国外: 15

  月访问量
 0

  访问来源
    内部: 0
    外部: 0
    国内: 0
    国外: 0

访问量

访问量

1. Exploration. Exploitation, and Engagement in Multi-Armed Bandits w.. [986]
2. Federated Reinforcement Learning for Electric Vehicles Charging Co.. [555]
3. POBO: Safe and optimal resource management for cloud microservices [505]
4. Large-System Insensitivity of Zero-Waiting Load Balancing Algorith.. [456]
5. Online Convex Optimization with Hard Constraints: Towards the Best.. [410]
6. Neural Constrained Combinatorial Bandits [384]
7. Safe Reinforcement Learning with Instantaneous Constraints: The Ro.. [363]
8. Rectified Pessimistic-Optimistic Learning for Stochastic Continuum.. [348]
9. Adversarially Trained Actor Critic for offline CMDPs [338]
10. Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Lea.. [321]
11. Learning to Schedule Online Tasks with Bandit Feedback [317]
12. QueueFlower: Orchestrating Microservice Workflows via Dynamic Queu.. [312]
13. Large-System Insensitivity of Zero-Waiting Load Balancing Algorith.. [290]
14. Stochastic Constrained Contextual Bandits via Lyapunov Optimizatio.. [285]
15. Toward Electrical Vehicle Charging for Demand Response Using Mean-.. [281]
16. Universal Scaling of Distributed Queues Under Load Balancing in th.. [244]
17. A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Ave.. [233]
18. Microservice Deployment for Satellite Edge AI Inference via Deep R.. [208]
19. Enhancing Safety in Reinforcement Learning with Human Feedback via.. [201]
20. Sample Efficient Reinforcement Learning in Mixed Systems through A.. [171]
21. Optimistic Joint Flow Control and Link Scheduling with Unknown Uti.. [167]
22. Scalable and Sample Efficient Distributed Policy Gradient Algorith.. [163]
23. Online Nonstochastic Control with Adversarial and Static Constrain.. [162]
24. A Reinforcement Learning and Prediction-Based Lookahead Policy for.. [161]
25. 微服务系统的资源分配优化方法、系统、存储介质及设备 [93]
26. DAAS: Dependency-aware Auto-scaling for Efficient Microservice Wor.. [55]
27. Microservice Deployment in Space Computing Power Networks via Robu.. [42]
28. Safe Learning in Stochastic Continuum-armed Bandit with Constraint.. [29]
29. Neural Constrained Combinatorial Bandits [28]
30. Exploration, Exploitation, and Engagement in Multi-Armed Bandits w.. [27]
31. On stochastic contextual bandits with knapsacks in small budget re.. [24]
32. Microservice Migration in Hybrid Satellite-Terrestrial Networks fo.. [24]
33. Adversarially Trained Weighted Actor-Critic for Safe Offline Reinf.. [22]
34. Learning universal knowledge graph embedding for predicting biomed.. [16]
35. On Preference-based Stochastic Linear Contextual Bandits with Knap.. [14]

下载量

1. Large-System Insensitivity of Zero-Waiting Load Balancing Algorith.. [201]
2. Exploration. Exploitation, and Engagement in Multi-Armed Bandits w.. [197]
3. Federated Reinforcement Learning for Electric Vehicles Charging Co.. [117]
4. Safe Reinforcement Learning with Instantaneous Constraints: The Ro.. [36]
5. Optimistic Joint Flow Control and Link Scheduling with Unknown Uti.. [34]
6. Rectified Pessimistic-Optimistic Learning for Stochastic Continuum.. [11]
7. QueueFlower: Orchestrating Microservice Workflows via Dynamic Queu.. [9]
8. Online Convex Optimization with Hard Constraints: Towards the Best.. [5]
9. POBO: Safe and optimal resource management for cloud microservices [5]
10. DAAS: Dependency-aware Auto-scaling for Efficient Microservice Wor.. [5]
11. Universal Scaling of Distributed Queues Under Load Balancing in th.. [3]
12. Adversarially Trained Actor Critic for offline CMDPs [3]
13. Stochastic Constrained Contextual Bandits via Lyapunov Optimizatio.. [3]
14. Neural Constrained Combinatorial Bandits [2]
15. Learning to Schedule Online Tasks with Bandit Feedback [2]
16. Large-System Insensitivity of Zero-Waiting Load Balancing Algorith.. [2]
17. 微服务系统的资源分配优化方法、系统、存储介质及设备 [2]
18. Neural Constrained Combinatorial Bandits [2]
19. Sample Efficient Reinforcement Learning in Mixed Systems through A.. [1]
20. Microservice Deployment for Satellite Edge AI Inference via Deep R.. [1]
21. On stochastic contextual bandits with knapsacks in small budget re.. [1]
22. Exploration, Exploitation, and Engagement in Multi-Armed Bandits w.. [1]
23. Adversarially Trained Weighted Actor-Critic for Safe Offline Reinf.. [1]
24. Learning universal knowledge graph embedding for predicting biomed.. [1]
25. Microservice Migration in Hybrid Satellite-Terrestrial Networks fo.. [1]