v0.1.3
This release encompasses several enhancements to usability and code refactoring. The primary changes include:
- Skip compilation: You can skip compilation of llama.cpp shared library when running server with
--install-pkgs
. Just add--skip-compile
option. - Removed auto process kill feature: Killing process when unloading model, was introduced to prevent the program from memory leak, but this sometimes make the program exit for no reason. So this feature is removed.
- API key checker: API key checker will be activated if you start the server with option
--api-key YOUR_API_KEY
. Client must includeAuthorization
header withBearer YOUR_API_KEY
.