Skip to content

add bf16_int8 support for invokeLayerLLaMA API #470

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

miaojinc
Copy link
Contributor

invokeLayerLLaMA API enhancement:

  1. Add bf16_int8 dtype support
  2. Add kvcache dtype argument
  3. Add Rope type argument

Signed-off-by: Jincheng Miao <jincheng.miao@intel.com>
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant