Skip to content

Add static libraries for batch manager #2

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Merged
merged 1 commit into from
Sep 21, 2023

Conversation

kaiyux
Copy link
Member

@kaiyux kaiyux commented Sep 21, 2023

No description provided.

@kaiyux kaiyux self-assigned this Sep 21, 2023
@juney-nvidia
Copy link
Collaborator

LGTM, thanks for the quick fix.

@juney-nvidia juney-nvidia merged commit 9b563ba into main Sep 21, 2023
@kaiyux kaiyux deleted the kaiyu/add_static_libraries branch September 21, 2023 03:52
liuyhwangyh pushed a commit to liuyhwangyh/TensorRT-LLM that referenced this pull request Mar 21, 2024
# This is the 1st commit message:

add download models form www.modelscope.cn

# This is the commit message NVIDIA#2:

debug

# This is the commit message NVIDIA#3:

debug
yingcanw added a commit that referenced this pull request Jan 2, 2025
* Fix model name mapping (#2)
nv-guomingz pushed a commit that referenced this pull request Jan 24, 2025
* Add README

* Add unified converter (#1)

* init v3 lite feat

* fix moe topk method

* fix noaux_tc logic

* fix deepseek v3 normal rope

* refactor

* wo conversion ok debugging build

* add quantize for attn.dense

* add unified converter support

* testing unified converter

* add convert checkpoint and update docs

---------

Co-authored-by: Zeyu Wang <zeyuw@nvidia.com>

* update README

* add FP8 notes

* Update run.py result

* Update V3 README

* Update usages of FP8 to BF16 instruction

* fix model name mapping (#2)

* Update HF ckpt BF16 conversion.

* fix config of deepseek kv cache

* Remove source code

* Deepseek V3 FP8 Support

---------

Co-authored-by: jershi425 <83951930+jershi425@users.noreply.github.com>
Co-authored-by: Zeyu Wang <zeyuw@nvidia.com>
Co-authored-by: Hanyue He <hanyueh@nvidia.com>
Co-authored-by: root <root@h20-2.cm.cluster>
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants