消息
×
loading..
KMS

浏览/检索结果: 共3条,第1-3条 帮助

已选(0)清除 条数/页:   排序方式:
What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning 会议论文
AAAI 2025
作者:  Ma, Yiran;  Chen, Zui;  Liu, Tianqiao;  Tian, Mi;  Liu, Zhuo
Adobe PDF(656Kb)  |  收藏  |  浏览/下载:15/2  |  提交时间:2025/03/09
Advancing Math Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages 预印本
2025
作者:  Chen, Zui;  Liu, Tianqiao;  Tian, Mi;  Tong, Qing;  Luo, Weiqi
收藏  |  浏览/下载:5/0  |  提交时间:2025/03/25
What Are Step-Level Reward Models Rewarding? Counterintuitive Findings from MCTS-Boosted Mathematical Reasoning 预印本
2024
作者:  Ma, Yiran;  Chen, Zui;  Liu, Tianqiao;  Tian, Mi;  Liu, Zhuo
收藏  |  浏览/下载:14/0  |  提交时间:2025/02/12
  • 首页
  • 上一页
  • 1
  • 下一页
  • 末页