-
Notifications
You must be signed in to change notification settings - Fork 1.2k
[WIP] Agent Training with Remote Service + Gym like protocal #973
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
rebase main
@sunjin-k The amazon team will be figuring out the right way to contribute back to the main. |
Hello, has there been any further progress on this work? |
@ChrisRBXiong I am rebasing this PR into |
Add full authors of SGLang RL team. Thanks!
Implement logics for support agent training:
The implementation is based on the environment implemented in https://github.com/HMJiangGatech/verl_agent_env_examples
The environment supports:
The communication with the environment is through chat message lists in openai format.
The implementation needed in Verl are:
Experimental Script: