-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: Jiayi-Pan/TinyZero
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
In the process of training, my think process will be all invalid output (reread).
#95
opened Mar 11, 2025 by
royal-dargon
ValueError: Model architectures ['Qwen2ForCausalLM'] are not supported for now.
#88
opened Feb 28, 2025 by
zhufz
Performance Bottlenecks and Optimization in Multi-GPU Parallel Training
#87
opened Feb 27, 2025 by
Jinyi6
After I trained for 500 steps, the length of think became smaller and smaller, and even disappeared.
#85
opened Feb 22, 2025 by
yaxundai
When will the TinyZero be able to support multi-node train of the GRPO algorithm?
#70
opened Feb 14, 2025 by
echo-valor
Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded?
#69
opened Feb 14, 2025 by
jaganrvce1
Questions about OOM when running Qwen2.5-0.5B, 1.5B, and 3B on RTX4090 graphics cards OOM
#64
opened Feb 12, 2025 by
patrickstar-sjh
everything is normal in the beginning until answer after <think> suddenly all become !
#63
opened Feb 12, 2025 by
momo4826
ray: Fatal Python error: Floating point exception when running on H20
#61
opened Feb 11, 2025 by
AkaliKong
Previous Next
ProTip!
Follow long discussions with comments:>50.