-
Notifications
You must be signed in to change notification settings - Fork 45
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9
#524
opened Jul 28, 2025 by
abukhoy
Loading…
Add Support for Frequency Penalties in On Device Sampling
#523
opened Jul 24, 2025 by
quic-sanising
•
Draft
[QEff. Finetune] Updated handling of custom dataset in FT. Updated finetune.md readme file.
#520
opened Jul 21, 2025 by
quic-meetkuma
Loading…
Logger module in Efficient Transformers
1.21.0
wip
Work in progress
#517
opened Jul 11, 2025 by
quic-hemagnih
•
Draft
[Olmo2]: Add Support for Olmo2 CausalLM Model in QEff
1.21.0
enhancement
New feature or request
#509
opened Jul 9, 2025 by
vbaddi
Loading…
Reading mxfp6_matmul for QNN Compilation path from compile API arguments
1.21.0
#499
opened Jul 7, 2025 by
shubhagr-qc
Loading…
[Llama4]: Add support for padding num_patches
1.21.0
enhancement
New feature or request
#486
opened Jul 1, 2025 by
vbaddi
Loading…
Changing the hashing methodology for cache folder creation of models.
1.21.0
#481
opened Jun 24, 2025 by
quic-dhirajku
Loading…
adding Context Length Specialization (CCL)
1.21.0
#466
opened Jun 19, 2025 by
quic-vjanfaza
Loading…
[Tests]: Adding dummy causal models for testing in regular CI run
1.21.0
ready for review
#427
opened May 29, 2025 by
abukhoy
Loading…
feat: Add option to pass QAICInferenceSession to TextGeneration
#356
opened Apr 11, 2025 by
quic-shagun
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.