📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
-
Updated
Feb 24, 2025 - Cuda
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Deep learning in Rust, with shape checked tensors and neural networks
Thin, unified, C++-flavored wrappers for the CUDA APIs
Safe rust wrapper around CUDA toolkit
A simple bash script for switching between installed versions of CUDA.
Julia support for native CUDA programming
GitHub Action to install CUDA
VGG-19 deep learning model trained using ISCX 2012 IDS Dataset
Ubuntu 18.04 How to install Nvidia driver + CUDA + CUDNN + build tensorflow for gpu step by step command line
CUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Install CUDA on Windows11 using WSL2
Tutorial to install NVIDIA Drivers, CUDA 11.4 and cuDNN for deep learning programming on Ubuntu 20.04.
Multi-Instrument music generation using C-RNN-GAN with MIDI format input 🎼
Installation guide for NVIDIA driver, CUDA, cuDNN and TensorRT
HTML/JS port of CUDA Occupancy Calculator
A light-weighted and flexible C++ differentiable programming library. Just replace float and double with it, and it does Auto-Grad for you...
CUDA Programming Practices
A basic jupyterhub with Nvidia GPU accessibility.
Generate and explore fractals with Python and CUDA
Add a description, image, and links to the cuda-toolkit topic page so that developers can more easily learn about it.
To associate your repository with the cuda-toolkit topic, visit your repo's landing page and select "manage topics."