Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded? #69

Open
jaganrvce1 opened this issue Feb 14, 2025 · 1 comment

Comments

@jaganrvce1
Copy link

  • Is there a sample code someone can point me to that I can point me to checkpoint to see if the training worked?
  • There seems to be two checkpoints, one is the critic and actor? How do I combine them in a single program?

I would appreciate if someone points me to a piece of code that shows how actor/critic are used together etc. Thanks!

@prvnsmpth
Copy link

I think you can just load up the actor model using huggingface transformers and run inference. The critic model only exists to evaluate the actor model's outputs during training.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants