Jiayi-Pan / TinyZero Public

Notifications You must be signed in to change notification settings
Fork 1.4k
Star 11.1k

Code
Issues 57
Pull requests 10
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: Jiayi-Pan/TinyZero

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

57 Open 19 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

In the process of training, my think process will be all invalid output (reread).

#95 opened Mar 11, 2025 by royal-dargon

Error: No available node types can fulfill resource request defaultdict(<class 'float'>, {'CPU': 2.0, 'GPU': 2.0}). Add suitable node types to this cluster to resolve this issue.

#94 opened Mar 6, 2025 by pzs19

how to run main_generation and main_eval?

#92 opened Mar 5, 2025 by Makesomethingbetter

Minimal RAM requirements?

#91 opened Mar 3, 2025 by shinandrew

Some weights of Qwen2ForTokenClassification were not initialized from the model checkpoint at /mnt/usercache/huggingface/Qwen2.5-3B-Instruct and are newly initialized: ['score.bias', 'score.weight']

#90 opened Mar 2, 2025 by MJinXiang

Question on WandB Experiments for GRPO

#89 opened Feb 28, 2025 by ArEsKay3

ValueError: Model architectures ['Qwen2ForCausalLM'] are not supported for now.

#88 opened Feb 28, 2025 by zhufz

Performance Bottlenecks and Optimization in Multi-GPU Parallel Training

#87 opened Feb 27, 2025 by Jinyi6

How to debug parallel ray

#86 opened Feb 24, 2025 by HarideP

After I trained for 500 steps, the length of think became smaller and smaller, and even disappeared.

#85 opened Feb 22, 2025 by yaxundai

Shape matching error

#83 opened Feb 21, 2025 by lixiaochuan2020

Missing trainer_state.json

#82 opened Feb 21, 2025 by anavarroa

内存最小需要多少呢？为什么500G内存还是会不够？

#79 opened Feb 19, 2025 by iaoxuesheng

模型只会加减，不会乘除。。。

#77 opened Feb 18, 2025 by LianhaoXue

ray start timeout

#75 opened Feb 16, 2025 by heningsu

Ray OOM

#74 opened Feb 16, 2025 by aivolcano

PPO vs GRPO time and space efficiency

#73 opened Feb 15, 2025 by Lineark

When will the TinyZero be able to support multi-node train of the GRPO algorithm?

#70 opened Feb 14, 2025 by echo-valor

Finally generated a checkpoint, how do I see use the checkpoint now to see if the training succeeded?

#69 opened Feb 14, 2025 by jaganrvce1

[Bug?] Evaluation failed

#68 opened Feb 14, 2025 by XuhuiZhou

Why am I generating a lot of endoftext here

#66 opened Feb 12, 2025 by ZhengChenYang

Questions about OOM when running Qwen2.5-0.5B, 1.5B, and 3B on RTX4090 graphics cards OOM

#64 opened Feb 12, 2025 by patrickstar-sjh

everything is normal in the beginning until answer after <think> suddenly all become !

#63 opened Feb 12, 2025 by momo4826

Minimum VRAM requirements

#62 opened Feb 11, 2025 by soulde

ray: Fatal Python error: Floating point exception when running on H20

#61 opened Feb 11, 2025 by AkaliKong

Previous 1 2 3 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly