[Feature Request] omegaPRM reproduced by openR: data used to synthesize process reward models #1279

zjrwtx · 2024-12-05T09:02:15Z

Required prerequisites

I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
Consider asking first in a Discussion.

Motivation

omegaPRM reproduced by openR: data used to synthesize process reward models
for datagene
reference:https://github.com/openreasoner/openr/tree/main/data
paper:https://arxiv.org/abs/2406.06592

Solution

reference:https://github.com/openreasoner/openr/tree/main/data
paper:https://arxiv.org/abs/2406.06592

Alternatives

already done the first experimental version

Additional context

already done the first experimental version

zjrwtx · 2024-12-05T09:06:11Z

already add the pr: #1280

zjrwtx added the enhancement New feature or request label Dec 5, 2024

zjrwtx self-assigned this Dec 5, 2024

zjrwtx added Data Related to camel data processing research Task related to research labels Dec 5, 2024

Wendong-Fan added this to Project Camel Dec 15, 2024

Wendong-Fan added this to the Sprint 18 milestone Dec 15, 2024

Wendong-Fan linked a pull request Dec 15, 2024 that will close this issue

feat:omegaPRM reproduced by openR: Process-supervision Data Generation（PRM） #1280

Draft

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] omegaPRM reproduced by openR: data used to synthesize process reward models #1279

[Feature Request] omegaPRM reproduced by openR: data used to synthesize process reward models #1279

zjrwtx commented Dec 5, 2024

zjrwtx commented Dec 5, 2024

[Feature Request] omegaPRM reproduced by openR: data used to synthesize process reward models #1279

[Feature Request] omegaPRM reproduced by openR: data used to synthesize process reward models #1279

Comments

zjrwtx commented Dec 5, 2024

Required prerequisites

Motivation

Solution

Alternatives

Additional context

zjrwtx commented Dec 5, 2024