Simple Agentic framework with batch generation #2830

August-murr · 2025-02-11T18:04:56Z

we need a simple agentic framework that can handle tool calls, run them, and then provide the output back in the chat for follow-up responses, especially focusing on Batch generations to maximize GPU utility.

For the initial version, we'll use a Transformers model, and later on, we'll switch to a VLLM version for better efficiency.

I’ve been avoiding Langchain and smolagents since they rely heavily on pre-written prompts, which overcomplicates and limits training, plus they aren’t designed for batch generation.

If there’s a way to achieve this with existing libraries like Langchain or smolagents, I’d love to hear your thoughts!

github-actions bot added the ✨ enhancement New feature or request label Feb 11, 2025

August-murr added the 🏋 GRPO Related to GRPO label Feb 11, 2025

August-murr mentioned this issue Feb 11, 2025

simple agentic framework utils file #2831

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple Agentic framework with batch generation #2830

Simple Agentic framework with batch generation #2830

August-murr commented Feb 11, 2025

Simple Agentic framework with batch generation #2830

Simple Agentic framework with batch generation #2830

Comments

August-murr commented Feb 11, 2025