Skip to content

Commit

Permalink
Merge branch 'main' of https://github.com/huggingface/cookbook into l…
Browse files Browse the repository at this point in the history
…lm-grpo-trl
  • Loading branch information
sergiopaniego committed Jan 31, 2025
2 parents 0574739 + 8273c06 commit 9300986
Show file tree
Hide file tree
Showing 3 changed files with 405 additions and 2 deletions.
2 changes: 2 additions & 0 deletions notebooks/en/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -110,6 +110,8 @@
title: Smol Multimodal RAG, Building with ColSmolVLM and SmolVLM on Colab's Free-Tier GPU
- local: fine_tuning_vlm_dpo_smolvlm_instruct
title: Fine-tuning SmolVLM using direct preference optimization (DPO) with TRL on a consumer GPU
- local: structured_generation_vision_language_models
title: Structured Generation from Images or Documents Using Vision Language Models

- title: Search Recipes
isExpanded: false
Expand Down
3 changes: 1 addition & 2 deletions notebooks/en/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,11 +8,10 @@ applications and solving various machine learning tasks using open-source tools
Check out the recently added notebooks:

- [Post-training an LLM using GRPO with TRL](fine_tuning_llm_grpo_trl)
- [Structured Generation from Images or Documents Using Vision Language Models](structured_generation_vision_language_models)
- [Vector Search on Hugging Face with the Hub as Backend](vector_search_with_hub_as_backend)
- [Multi-Agent Order Management System with MongoDB](mongodb_smolagents_multi_micro_agents)
- [Scaling Test-Time Compute for Longer Thinking in LLMs](search_and_learn)
- [Signature-Aware Model Serving from MLflow with Ray Serve](mlflow_ray_serve)


You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).

Expand Down
Loading

0 comments on commit 9300986

Please # to comment.