AI Fundermentals

深入理解 GPU 架构

在准备在 GPU 上运行的应用程序时，了解 GPU 硬件设计的主要特性并了解与 CPU 的相似之处和不同之处会很有帮助。本路线图适用于那些对 GPU 比较陌生或只是想了解更多有关 GPU 中计算机技术的人。不需要特定的并行编程经验，练习基于 CUDA 工具包中包含的标准 NVIDIA 示例程序。

GPU 特性
GPU 内存
GPU Example: Tesla V100
GPUs on Frontera: RTX 5000
练习：
- Exercise: Device Query
- Exercise: Device Bandwidth

GPU 架构和编程模型介绍

GPU Architecture and Programming — An Introduction

其他相关知识点

深入理解 Nvidia CUDA 核心（vs. Tensor Cores vs. RT Cores)

CUDA 学习材料

快速入门

参考资料

监控与运维

性能分析与调优

LLM 基础

Article & Video

eBook

AI Infra

深度学习/机器学习

动手实践

DeepSeek

Useful Projects

unstructured:Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
MinerU:A high-quality tool for convert PDF to Markdown and JSON.
markitdown: Python tool for converting files and office documents to Markdown.
unsloth: About Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory!
ktransformers: A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
AISystem @ f3d74d4		AISystem @ f3d74d4
cuda		cuda
deepseek		deepseek
gpu_architecture		gpu_architecture
gpu_programming		gpu_programming
img		img
llm		llm
ops		ops
profiling		profiling
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Fundermentals

相关硬件知识

深入理解 GPU 架构

GPU 架构和编程模型介绍

其他相关知识点

CUDA 学习材料

快速入门

参考资料

监控与运维

性能分析与调优

LLM 基础

Article & Video

eBook

AI Infra

深度学习/机器学习

动手实践

DeepSeek

Useful Projects

RAG

Fine-Tuning

LLM 训练

从零开始训练 70B 模型

About

Releases

Packages

Languages

License

ForceInjection/AI-fundermentals

Folders and files

Latest commit

History

Repository files navigation

AI Fundermentals

相关硬件知识

深入理解 GPU 架构

GPU 架构和编程模型介绍

其他相关知识点

CUDA 学习材料

快速入门

参考资料

监控与运维

性能分析与调优

LLM 基础

Article & Video

eBook

AI Infra

深度学习/机器学习

动手实践

DeepSeek

Useful Projects

RAG

Fine-Tuning

LLM 训练

从零开始训练 70B 模型

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages