Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6
-
Updated
Dec 4, 2023 - Jupyter Notebook
Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6
Scripts from Neural network inference on Pytorch with tools like ONNX, TensorRT, nvFuser, TorchDynamo, Triton
Add a description, image, and links to the nvfuser topic page so that developers can more easily learn about it.
To associate your repository with the nvfuser topic, visit your repo's landing page and select "manage topics."