Added new Post-training an LLM using GRPO with TRL
recipe 🧑🍳️
#707
Run time
Learn about OS # on GitHub ActionsJob | Run time |
---|---|
34m 47s | |
34m 47s |
Post-training an LLM using GRPO with TRL
recipe 🧑🍳️
#707
Job | Run time |
---|---|
34m 47s | |
34m 47s |