Skip to content
This repository was archived by the owner on Oct 25, 2024. It is now read-only.

[NeuralChat] Support Assisted Generation on Multi-nodes #1283

Draft
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

letonghan
Copy link
Contributor

@letonghan letonghan commented Feb 19, 2024

Type of Change

feature
API added:

  • /v1/assist/chat
  • /v1/assist/decode
  • /v1/assist/data_transfer

Description

Support Assisted Generation on Multi-nodes.
The code framework is implemented. Details will be completed by Wangyi's team.
JIRA: https://jira.devtools.intel.com/browse/NLPTOOLKIU-1126

Expected Behavior & Potential Risk

The assisted generation restful api will be able to run on multi-nodes.

How has this PR been tested?

Local. Draft PR now.

Dependency Change?

None.

letonghan and others added 6 commits February 19, 2024 10:48
Signed-off-by: LetongHan <letong.han@intel.com>
Signed-off-by: LetongHan <letong.han@intel.com>
Signed-off-by: LetongHan <letong.han@intel.com>
Signed-off-by: LetongHan <letong.han@intel.com>
# for free to subscribe to this conversation on GitHub. Already have an account? #.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants