Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Cache predictions #154

Closed
Benjoyo opened this issue Apr 23, 2024 · 0 comments
Closed

Cache predictions #154

Benjoyo opened this issue Apr 23, 2024 · 0 comments
Assignees
Labels
Type: enhancement New feature or request
Milestone

Comments

@Benjoyo
Copy link
Member

Benjoyo commented Apr 23, 2024

To improve latency and costs, LLM completions, OCR results and other predictions should be cached (disk backed).
Taking into account all kinds of parameters that should invalidate (different model or changed prompt) or disable (e.g. temperature > 0) a cache.

@Benjoyo Benjoyo added the Type: enhancement New feature or request label Apr 23, 2024
@Benjoyo Benjoyo self-assigned this Apr 23, 2024
@Benjoyo Benjoyo added this to the 1.3.0 milestone Apr 23, 2024
Benjoyo added a commit that referenced this issue Apr 29, 2024
@Benjoyo Benjoyo closed this as completed Apr 29, 2024
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
Type: enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant