Cache predictions #154

Benjoyo · 2024-04-23T19:48:23Z

To improve latency and costs, LLM completions, OCR results and other predictions should be cached (disk backed).
Taking into account all kinds of parameters that should invalidate (different model or changed prompt) or disable (e.g. temperature > 0) a cache.

Benjoyo added the Type: enhancement New feature or request label Apr 23, 2024

Benjoyo self-assigned this Apr 23, 2024

Benjoyo added this to the 1.3.0 milestone Apr 23, 2024

Benjoyo added a commit that referenced this issue Apr 29, 2024

1.3.0: closes #80, #154, #155

cb907f1

Benjoyo closed this as completed Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache predictions #154

Cache predictions #154

Benjoyo commented Apr 23, 2024

Cache predictions #154

Cache predictions #154

Comments

Benjoyo commented Apr 23, 2024