Skip to content

Actions: huggingface/trl

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
20,469 workflow runs
20,469 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Are there any tips and tricks about GRPO reward function design ?
Hugging Face Issue Labeler #134: Issue #2832 opened by MohamedAliRashad
February 11, 2025 20:27 55s
February 11, 2025 20:27 55s
GRPO Environments for custom multi-step rollouts (vLLM-only)
Build PR Documentation #6554: Pull request #2810 synchronize by willccbb
February 11, 2025 19:26 Action required willccbb:grpo-envs
February 11, 2025 19:26 Action required
GRPO Environments for custom multi-step rollouts (vLLM-only)
Tests #7420: Pull request #2810 synchronize by willccbb
February 11, 2025 19:26 Action required willccbb:grpo-envs
February 11, 2025 19:26 Action required
Upload PR Documentation
Upload PR Documentation #4726: completed by August-murr
February 11, 2025 18:12 24s
February 11, 2025 18:12 24s
Simple Agentic framework with batch generation
Hugging Face Issue Labeler #133: Issue #2830 opened by August-murr
February 11, 2025 18:04 37s
February 11, 2025 18:04 37s
🥾 Allow bootstrap GRPO (#2829)
Build documentation #1126: Commit 7347c29 pushed by qgallouedec
February 11, 2025 17:56 4m 16s main
February 11, 2025 17:56 4m 16s
🥾 Allow bootstrap GRPO (#2829)
Secret Leaks #2412: Commit 7347c29 pushed by qgallouedec
February 11, 2025 17:56 18s main
February 11, 2025 17:56 18s
🥾 Allow bootstrap GRPO (#2829)
Slow tests (on push) #503: Commit 7347c29 pushed by qgallouedec
February 11, 2025 17:56 25m 43s main
February 11, 2025 17:56 25m 43s
🥾 Allow bootstrap GRPO (#2829)
Tests #7418: Commit 7347c29 pushed by qgallouedec
February 11, 2025 17:56 33m 52s main
February 11, 2025 17:56 33m 52s
pages build and deployment
pages-build-deployment #1137: by qgallouedec
February 11, 2025 17:56 37s main
February 11, 2025 17:56 37s
👴 Update tokenizer parameter to processing_class in tests (#2828)
Secret Leaks #2411: Commit 2106b31 pushed by August-murr
February 11, 2025 17:00 15s agentic-grpo
February 11, 2025 17:00 15s
Add generation caching in TextEnvironment and fix bugs in TextEnvironment
Build PR Documentation #6552: Pull request #2556 synchronize by konrad-gerlach
February 11, 2025 14:29 Action required konrad-gerlach:text_environment_caching
February 11, 2025 14:29 Action required
Upload PR Documentation
Upload PR Documentation #4725: completed by kashif
February 11, 2025 14:25 43s
February 11, 2025 14:25 43s
🥾 Allow bootstrap GRPO
Build PR Documentation #6551: Pull request #2829 synchronize by kashif
February 11, 2025 14:21 3m 43s bootstrap-grpo
February 11, 2025 14:21 3m 43s
🥾 Allow bootstrap GRPO
Tests #7416: Pull request #2829 synchronize by kashif
February 11, 2025 14:21 36m 51s bootstrap-grpo
February 11, 2025 14:21 36m 51s
Secret Leaks
Secret Leaks #2410: by kashif
February 11, 2025 14:19 17s bootstrap-grpo
February 11, 2025 14:19 17s
Upload PR Documentation
Upload PR Documentation #4724: completed by qgallouedec
February 11, 2025 13:33 29s
February 11, 2025 13:33 29s
🥾 Allow bootstrap GRPO
Build PR Documentation #6550: Pull request #2829 opened by qgallouedec
February 11, 2025 13:30 3m 35s bootstrap-grpo
February 11, 2025 13:30 3m 35s
🥾 Allow bootstrap GRPO
Tests #7415: Pull request #2829 opened by qgallouedec
February 11, 2025 13:30 36m 51s bootstrap-grpo
February 11, 2025 13:30 36m 51s
allow bootstrap grpo
Secret Leaks #2409: Commit 3680d55 pushed by qgallouedec
February 11, 2025 13:29 16s bootstrap-grpo
February 11, 2025 13:29 16s
📤 GRPO refactor loading the model weights to vllm (#2817)
Secret Leaks #2408: Commit b9df810 pushed by qgallouedec
February 11, 2025 13:29 13s bootstrap-grpo
February 11, 2025 13:29 13s
👴 Update tokenizer parameter to processing_class in tests (#2828)
Build documentation #1125: Commit 2106b31 pushed by qgallouedec
February 11, 2025 10:46 3m 31s main
February 11, 2025 10:46 3m 31s