-
-
Notifications
You must be signed in to change notification settings - Fork 426
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Small paper ideas to be added #262
Comments
@RyanKim17920 so the first paper is already in the repository and even cited i do like the second paper, and can try it out before adding it the third paper, i like as well, but may be outside the scope of this repo |
@RyanKim17920 someone also shared with me https://arxiv.org/abs/2312.07987 which could be an improvement from MoA |
@RyanKim17920 the switchhead paper is pretty good will run the experiments tomorrow morning, and if all goes well, it will probably in the repository by week's end |
@lucidrains What do you think of https://www.arxiv.org/abs/2408.14915, in particular the DRA activation function for Continuous Transformers? |
@lucidrains If you confirm, I can also open a PR for DRA. |
@Baran-phys hey Baran, thanks for sharing your paper. it is interesting but i will probably not accept as it is not relevant for this repository. periodic activation functions is something i've been meaning to look into once the right problem presents |
Here's some papers I've read that would be nice to have, I'll try to implement them if I can:
https://arxiv.org/pdf/2010.04245
https://arxiv.org/abs/2210.05144
(Probably should add FFN MoE as well)
https://arxiv.org/pdf/2404.02258
(Probably will be hard to make work with other features)
The text was updated successfully, but these errors were encountered: