Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Error: "POST /generate HTTP/1.1" 404 Not Found when running Locust tool against vLLM model server #777

Open
Edwinhr716 opened this issue Aug 14, 2024 · 0 comments

Comments

@Edwinhr716
Copy link
Member

Whenever running the locust benchmarking tool against a vLLM model server, I get this error

POST /generate HTTP/1.1" 404 Not Found

After some investigation, it looks like it was caused by #694, which switched the entrypoint from vllm.entrypoints.api_server to vllm.entrypoints.openai.api_server. the openAI entrypoint does not have a /generate endpoint, which breaks the locust benchmarking tool.

Looks like it was called out here #694 (comment) but never addressed

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant