Skip to content
#

transformers-architecture

Here are 4 public repositories matching this topic...

Language: All
Filter by language

A clean, ground-up implementation of the Transformer architecture in PyTorch, including positional encoding, multi-head attention, encoder-decoder layers, and masking. Great for learning or building upon the core model.

  • Updated May 30, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the transformers-architecture topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformers-architecture topic, visit your repo's landing page and select "manage topics."

Learn more