Skip to content

Latest commit

 

History

History
10 lines (8 loc) · 278 Bytes

README.md

File metadata and controls

10 lines (8 loc) · 278 Bytes

GEMM_CUDA_study

Enviroments

  • Windows 10 laptop
  • CPU 11th Gen Intel(R) Core(TM) i7-11375H @ 3.30GHz (cpu)
  • NVIDIA GeForce RTX 3060 Laptop GPU (gpu)

CUDA GEMM Convolution

  • GEMM Convolution
  • process : im2col kernel -> Matrix Multiplication(cublas) -> col2im kernel