model_filter.py

An OpenAI-compatible API proxy that filters models.

This is a simple Python server that solves one very specific problem.

Sometimes you might use an OpenAI-like API provider that serves dozens, if not a hundred different models. When you point your frontend at that provider, you're forced to wade through all those different models, even if you only use a handful!

This proxy server will filter the models returned from the /v1/models endpoint, either through exact matching (i.e. an explicit allow list) or regexp matching.

All other endpoints are proxied to the real server.

An alternative is to use something like LiteLLM to map "virtual models" (for lack of a better term) to "real" models located at potentially different providers. This is the solution I use today, but I... am a lazy person.

Features

Regexp and exact-match filtering of /v1/models endpoint
Defaults to case-insensitive matching of both regexp and exact-match filters
Basic support for prefixing the IDs of the returned models. Sorry, only supports 0 or 1 prefixes, no routing yet!
/v1/chat/completions and /v1/completions endpoints are proxied, supporting streaming and non-streaming requests
/v1/embeddings is proxied
If present, the Authorization header (which contains your API key), is simply passed through to the proxied service
Simple TTL caching of filtered /v1/models output
Dockerfile
Robustness. I know failing authorization simply leaves the client in the dark with a generic 5xx error. Would be better if the proxy's endpoint simply mirrored the HTTP status code/message of the underlying service.

Future?

If there's an API endpoint you want to add, please open a PR or issue. It should be easy since we proxy everything but /v1/models
Wouldn't it be cool if it could proxy multiple backends? Assign a model prefix to each backend, maybe assign each backend its own set of filters. It would route based on the prefix of the received model parameter.

Querying /v1/models would query all backends and return a list that is a union of all models (prefix added) that passed their respective filters.

Of course this might be challenging with the Authorization header pass-through. How to ensure that it never sends the wrong header to a service?

Installation

pip install -r requirements.txt

or if you prefer unpinned dependencies:

pip install -r requirements.in

Running

You can set the following environment variables in lieu of CLI arguments:

LLM_MF_CONFIG: Config file, defaults to config.yaml in the current directory
LLM_MF_HOST: Host address to bind to. Defaults to 127.0.0.1
LLM_MF_PORT: Port to listen at. Defaults to 8080.

CLI arguments (all optional) override environment.

python model_filter.py --config <path/to/config> --host <host> --port <port>

Prefixes

You can optionally prepend a prefix to all returned model IDs. Put the usual configuration under an additional YAML map key (is that the right terminology?) like so:

model_filter:
  my_arbitrary_prefix:
    base_url: http://localhost:8080/v1
    simple:
      - "some_model"

All matching models will then be prefixed with my_arbitrary_prefix/, e.g. my_arbitrary_prefix/some_model

The proxied endpoints /v1/chat/completions, /v1/completions, and /v1/embeddings expect the received model parameter to be prefixed in a similar manner. (But it won't be the end of the world if it isn't. Just bear in mind routing decisions may be based off the prefix in the future and model params that lack a prefix will be unroutable...)

License

Licensed under Apache 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
config.yaml.default		config.yaml.default
model_filter.py		model_filter.py
requirements.in		requirements.in
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

model_filter.py

Features

Future?

Installation

Running

Prefixes

License

About

Releases

Packages

Languages

License

asaddi/llm-model-filter

Folders and files

Latest commit

History

Repository files navigation

model_filter.py

Features

Future?

Installation

Running

Prefixes

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages