Releases: k2-fsa/sherpa-onnx
Releases · k2-fsa/sherpa-onnx
v1.9.13
What's Changed
- add WebAssembly for Kws by @lovemefan in #648
- small fixes to wasm kws. by @csukuangfj in #672
- Fix argument for beam search Decoding by @bhaswa in #673
- Add HotwordsFile and HotwordsScore fields to OnlineRecognizerConfig in C# API by @foreversimon in #675
- Add tts play example for .Net. by @csukuangfj in #676
- support whisper language by @wxbool in #678
- Support whisper language/task in various language bindings. by @csukuangfj in #679
- Add nuget package for Windows x86 by @csukuangfj in #683
- Update MainActivity.kt by @mablue in #687
- Refactor github actions tests by @csukuangfj in #688
- Add hotwords support to .Net by @csukuangfj in #689
- Configurable low_freq high_freq, dithering by @KarelVesely84 in #664
- Fix building c api examples with alsa by @csukuangfj in #691
- Add timestamps and tokens for .Net's online models. by @csukuangfj in #690
- Update MainActivity.kt by @mablue in #693
- Use manylinux in CI test by @csukuangfj in #692
New Contributors
- @lovemefan made their first contribution in #648
- @bhaswa made their first contribution in #673
- @foreversimon made their first contribution in #675
- @wxbool made their first contribution in #678
- @mablue made their first contribution in #687
Full Changelog: v1.9.12...v1.9.13
v1.9.12
What's Changed
- Add Python API for keyword spotting by @pkufool in #576
- Fix building wheels by @csukuangfj in #620
- Add WebAssembly demo for streaming trilingual Paraformer (Chinese+Cantonese+English) by @csukuangfj in #618
- Support Android NNAPI. by @csukuangfj in #622
- fixed variable's spell num_trailing_blanks by @lonngxiang in #623
- Support VITS models from icefall. by @csukuangfj in #625
- support onnxruntime v1.17.1 by @csukuangfj in #624
- Refactor WebAssembly for nodejs by @csukuangfj in #626
- Add WebAssembly for NodeJS. by @csukuangfj in #628
- Fix building wheels from source. by @csukuangfj in #632
- Support Ubuntu 18.04 and CentOS 7 by @csukuangfj in #635
- Allow to not use pre-installed onnxruntime libs. by @csukuangfj in #636
- Support using T-head-Semi/csi-nn2 for RISC-V by @csukuangfj in #637
- Fix deletions at the tail when using hotwords in streaming model by @pkufool in #638
- Add Python ASR examples with alsa by @csukuangfj in #646
- Add microphone streaming ASR example for C API by @csukuangfj in #650
- Expose 'language' 'task' 'tailPaddings' in OfflineWhisperModelConfig by @GaryLaurenceauAva in #643
- Support user provided data in tts callback. by @csukuangfj in #653
- Use onnxruntime 1.17.1 for iOS. by @csukuangfj in #654
- c++ api for keyword spotter by @xinhecuican in #642
- Print the time about the first message in tts. by @csukuangfj in #655
- Release v1.9.12 by @csukuangfj in #661
New Contributors
- @lonngxiang made their first contribution in #623
- @GaryLaurenceauAva made their first contribution in #643
- @xinhecuican made their first contribution in #642
Full Changelog: v1.9.11...v1.9.12
Release v1.9.11
What's Changed
- Use curl to replace wget for Windows. by @csukuangfj in #558
- Update sherpa-onnx /scripts/whisper/test.py by @20246688 in #560
- C++ API demo for speaker identification with portaudio. by @csukuangfj in #561
- Add fine-tuned whisper model on aishell by @csukuangfj in #565
- Use sequential layout for OfflineTtsConfig in C# by @csukuangfj in #567
- Add context biasing for mobile by @ductranminh in #568
- Update README by @csukuangfj in #572
- Support WebAssembly by @csukuangfj in #577
- Update onnxruntime from 1.16.3 to 1.17.0 by @csukuangfj in #581
- Support building debug version on Windows by @csukuangfj in #583
- updated the icon on TTS engine for android by @paolo-caroni in #579
- Download android onnxruntime libs from github. by @csukuangfj in #584
- Use onnxruntime static lib compiled with gcc8 on ubuntu 20.04 by @csukuangfj in #587
- Increase the right chunk size of streaming paraformer to 3 by @csukuangfj in #588
- Support MinSizeRel and RelWithDebInfo build on Windows. by @csukuangfj in #586
- Add more Chinese TTS models (Mandarin and Cantonese) by @csukuangfj in #589
- Fix CI warnings by @csukuangfj in #590
- Add missing "start_time" to Python API by @vsd-vector in #591
- Download ios-onnxruntime from github instead of huggingface. by @csukuangfj in #593
- Link the math lib. by @csukuangfj in #592
- Fix CI tests. by @csukuangfj in #596
- Refactor WebAssembly build script. by @csukuangfj in #598
- Publish wasm tts to model scope. by @csukuangfj in #599
- Install wasm tts in a separate directory by @csukuangfj in #600
- Add WebAssembly for ASR by @csukuangfj in #604
- Use alsa to read microphone in speaker identification demo. by @csukuangfj in #605
- Support RISC-V by @csukuangfj in #609
- Fix #608 by @csukuangfj in #610
- Support using alsa to access the microphone with non-streaming ASR models by @csukuangfj in #517
- support using xnnpack as execution provider by @csukuangfj in #612
- Use hub.nuaa.cf to replace huggingface URL to download dependencies. by @csukuangfj in #614
- Track token scores by @KarelVesely84 in #571
- Fix computing features for whisper. by @csukuangfj in #617
New Contributors
- @20246688 made their first contribution in #560
- @ductranminh made their first contribution in #568
- @paolo-caroni made their first contribution in #579
- @vsd-vector made their first contribution in #591
Full Changelog: v1.9.10...v1.9.11
Release v1.9.10
What's Changed
- Fix CI tests for Python and JNI. by @csukuangfj in #554
- Add a new Persian tts model by @csukuangfj in #555
- Add TTS demo for C# API by @csukuangfj in #557
Full Changelog: v1.9.9...v1.9.10
Release v1.9.9
What's Changed
- Fix kws ci by @pkufool in #540
- Fix cmake variables to point to the project root directory. by @csukuangfj in #545
- add blank_penalty for offline transducer by @chiiyeh in #542
- add hotwords docstring to offline_recognizer and online_recognizer by @chiiyeh in #546
- add blank_penalty for online transducer by @chiiyeh in #548
- Fixes issue #535 , fix hexa 1-char tokens in ASR output. by @vesis84 in #550
- Ensure input for speaker ID is a valid number. by @csukuangfj in #552
- Run TTS engine service without starting the app. by @csukuangfj in #553
New Contributors
Full Changelog: v1.9.8...v1.9.9
Release v1.9.8
What's Changed
- Add missing field for two-pass APK. by @csukuangfj in #511
- Fix Byte BPE string results for Python. by @csukuangfj in #512
- Fix #510 by @csukuangfj in #513
- Use high_freq -400 in computing fbank features. by @csukuangfj in #515
- Use NDK 22.1 for android build by @csukuangfj in #518
- Add runtime support for wespeaker models by @csukuangfj in #516
- Support exporting models to onnx from 3D-Speaker by @csukuangfj in #522
- Fix publishing nuget packages. by @csukuangfj in #525
- Add C++ runtime for models from 3d-speaker by @csukuangfj in #523
- Export speaker verification models from NeMo to ONNX by @csukuangfj in #526
- Add C++ runtime for speaker verification models from NeMo by @csukuangfj in #527
- Android TTS APKs for Persian by @csukuangfj in #529
- Fix setting speaker ID for Android TTS Engine. by @csukuangfj in #530
- Add a Persian and a Slovenian model from Piper for Android TTS. by @csukuangfj in #531
- Add Python API examples for speaker recognition with VAD and ASR. by @csukuangfj in #532
- Refactor the UI of Android TTS engine by @csukuangfj in #533
- decoder for open vocabulary keyword spotting by @pkufool in #505
- Change model url from modelscope to github by @pkufool in #538
- Add Android demo for speaker recognition by @csukuangfj in #536
Full Changelog: v1.9.7...v1.9.8
kws-models
Refactor the UI of Android TTS engine (#533)
Release v1.9.7
What's Changed
- Replace Android system TTS engine by @csukuangfj in #508
- Build text-to-speech engine APKs by @csukuangfj in #509
Full Changelog: v1.9.5...v1.9.7
Release v1.9.5
What's Changed
- Fix building wheels for Linux. by @csukuangfj in #484
- Fix CI by @csukuangfj in #485
- Print informative error messages for sherpa-onnx-alsa on errors. by @csukuangfj in #486
- Keep multiple threads from calling into espeak-ng at the same time by @csukuangfj in #489
- Fix whisper test script for the latest onnxruntime. by @csukuangfj in #494
- Release Python GIL in C++ class constructor by @csukuangfj in #493
- Support streaming zipformer CTC by @csukuangfj in #496
Full Changelog: v1.9.4...v1.9.5
Release v1.9.4
What's Changed
- Give an informative log for whisper on exceptions. by @csukuangfj in #473
- convert wespeaker models to sherpa-onnx by @csukuangfj in #475
- Fix releasing go packages by @csukuangfj in #476
- Support playing as it is generating for Android by @csukuangfj in #477
- Fix android tts audio buffer size and fix CI. by @csukuangfj in #478
- Add two GLaDOS TTS models by @csukuangfj in #481
- Play generated audio using alsa for TTS by @csukuangfj in #482
Full Changelog: v1.9.1...v1.9.4