Skip to content

Support for remote inference #302

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

pawelknes
Copy link

This is a proposal to add possible support for remote inference other than only using HF.

The basis for this would be RemoteInferenceInterface which implementation would differ between platforms. The function remote_inference is responsible for preparing and performing inference depending on the platform specified in args.inference_platform (default is hf, which is the old approach to ensure compatibility).

This also includes implementation for ibm-watsonx-ai (WxInference), which can be used by setting required env variables.

Changes were tested for both watsonx.ai and the default approach using HF.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant