Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

GPT vs BERT, under same computation and data resource, which one is better for downstream tasks like GLUE? #276

Open
guotong1988 opened this issue Sep 30, 2020 · 1 comment

Comments

@guotong1988
Copy link

Thank you very much.

@LifeIsStrange
Copy link

@guotong1988 generally speaking, XLnet is the best pretrained model, period.
The original implementation that you can find on this repository is abandonware which is sad.
You should use https://huggingface.co/transformers/model_doc/xlnet.html

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants