🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
-
Updated
Sep 3, 2025 - Python
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Unified Multimodal Model for image generation/editing/understanding
🤗A Unified Cache Acceleration Toolbox for All DiTs in Diffusers: Qwen-Image-Edit, Qwen-Image, FLUX, Wan, etc.
✨✨latest advancements of RL in generative ai
为 Qwen-Image 和 Qwen-Image-Edit 封装 OpenAI 兼容接口的 Web 服务。A web service that encapsulates OpenAI-compatible interfaces for Qwen-Image and Qwen-Image-Edit.
An enhanced launcher for Qwen-Image, with client-server architecture and queueing of the tasks
🐙 ComfyUI_RH_Qwen-Image delivers high-quality image generation with Qwen-Image, excels at Chinese text rendering and supports multiple aspect ratios.
The VLM Framework is an extensible, production-ready system for working with Vision-Language Models. It provides a unified interface for different VLM models while maintaining flexibility for future extensions.
SECourses Musubi Tuner - 1-Click to Install App for LoRA Training and Full Fine Tuning Qwen Image, Qwen Image Edit, Wan 2.1 and Wan 2.2 Models with Musubi Tuner with Ready Presets
Add a description, image, and links to the qwen-image topic page so that developers can more easily learn about it.
To associate your repository with the qwen-image topic, visit your repo's landing page and select "manage topics."