-
Notifications
You must be signed in to change notification settings - Fork 343
Roadmap & Contribution Guide
Zach Nagengast edited this page Jan 30, 2024
·
2 revisions
WhisperKit is looking forward to your contributions! We have purposefully released at pre-feature complete stage to build this project with the open-source community and stress test how extensible WhisperKit really is.
Concretely, we put TODO stubs for Whisper functionality that exists in the OpenAI reference Python implementation but not yet in WhisperKit. Here is the current list to reach parity:
- Suppress Tokens
- No speech thresholding
- Word-level timestamps
- Beam Search
On the other hand, we are actively working towards adding the following features that are more involved to contribute because they require improvements to the models themselves to be efficiently supported:
- Multi-token
TextDecoder
forward pass for<|startofprev|>
-based prompting (Current TextDecoder can only decode auto regressively) - Speculative Decoding