Skip to content

Fix RoPE inner product equation & add note on the difference in implementation #265

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

thanhtcptit
Copy link

Hi, thank you for your work. I noticed an error in the RoPE inner product equation. Additionally, this implementation uses a different feature pairing strategy for feature subspaces rotation compared to the original paper, which I believe is worth noting to avoid confusion.
Ref: https://github.com/pytorch/torchtune/blob/main/torchtune/modules/position_embeddings.py#L117

Cheer,

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants