Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
#

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

#

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

k2-fsa / sherpa-onnx Public

Notifications You must be signed in to change notification settings
Fork 588
Star 5.2k

Code
Issues 253
Pull requests 26
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: k2-fsa/sherpa-onnx

Releases · k2-fsa/sherpa-onnx

Release v1.9.1

08 Dec 04:12

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.9.1

What's Changed

Remove the 30-second constraint from whisper. by @csukuangfj in #471
Support distil-small.en whisper by @csukuangfj in #472

Full Changelog: v1.9.0...v1.9.1

Contributors

csukuangfj

Assets 2

Loading

All reactions

Speaker recognition models

08 Dec 10:06

speaker-recongition-models

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Speaker recognition models

This release contains speaker recognition models for sherpa-onnx.

Each model has its own license. Please see the corresponding repository for the specific license of a given model.

Assets 24

Loading

hantengc, bnuzhouwei, lizaibeim, thewh1teagle, Funhuim, and inan22 reacted with thumbs up emoji

Funhuim reacted with laugh emoji

altunenes reacted with eyes emoji

All reactions

👍 6 reactions
😄 1 reaction
👀 1 reaction

7 people reacted

Release v1.9.0

06 Dec 11:48

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.9.0

What's Changed

Build building for iOS by @csukuangfj in #430
Judge before UseCachedDecoderOut by @HieDean in #431
Build MFC examples for Windows x86 (Win32) by @csukuangfj in #434
Replace Clone() with View() by @HieDean in #432
Refactor CI scripts about building wheels by @csukuangfj in #436
support nodejs by @csukuangfj in #438
Add Swift API for TTS by @csukuangfj in #439
Text-to-speech for iOS by @csukuangfj in #443
Lock before push_back the deque for thread safety by @HieDean in #445
Update to onnxruntime 1.16.3 by @csukuangfj in #446
Fix reading tokens.txt on Windows by @csukuangfj in #448
Fix nodejs on Windows by @csukuangfj in #450
Release GIL to support multithreading in Python websocket servers. by @csukuangfj in #451
Support piper-phonemize by @csukuangfj in #452
Use piper-phonemize to convert text to token IDs by @csukuangfj in #453
Fix CI by @csukuangfj in #456
Play generated audio as it is generating. by @csukuangfj in #457
Break text into sentences for tts. by @csukuangfj in #460
Support playing generated audio as it is generating for MFC. by @csukuangfj in #462
Fix building for .Net by @csukuangfj in #463
Use espeak-ng for coqui-ai/TTS VITS English models. by @csukuangfj in #466
Support Ukrainian VITS models from coqui-ai/TTS by @csukuangfj in #469
Release v1.9.0 by @csukuangfj in #470

New Contributors

@HieDean made their first contribution in #431

Full Changelog: v1.8.10...v1.9.0

Contributors

csukuangfj and HieDean

Assets 49

Loading

All reactions

tts-models

21 Nov 07:43

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

tts-models

This release contains pre-trained tts models.

Please refer to
https://k2-fsa.github.io/sherpa/onnx/tts/pretrained_models/index.html
for more models.

Pre-built Android APKs are available at https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

You can try all of the models by visiting the following huggingface space
https://huggingface.co/spaces/k2-fsa/text-to-speech

Assets 208

Loading

Leroy-X, yuyun2000, zhangYanGitHub, HeroSong666, gyroing, Soebb, paolo-caroni, jing332, optisynapsis, JohnClaw, and 11 more reacted with thumbs up emoji

All reactions

👍 21 reactions

21 people reacted

asr-models

21 Nov 09:42

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

asr-models

This release contains pre-trained ASR models.

Please refer to
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/index.html
for more models.

Assets 158

Loading

dd-rongfa, Leeviber, hasayakey, zhuangweiji, mxbi, npovey, Yin-zhiwei, surajsahani, DingZhaohai, csukuangfj, and 22 more reacted with thumbs up emoji

20246688, wenziqi123, 34766667028, qishimu, xinliu9451, and kuikui111222 reacted with laugh emoji

20246688, 34766667028, qishimu, xinliu9451, and kuikui111222 reacted with hooray emoji

BrutalCoding, daniel-dona, 20246688, 34766667028, zhuangweiji, npovey, baicai, iprovalo, info-wordcab, xinliu9451, and kuikui111222 reacted with heart emoji

20246688, 34766667028, zhuangweiji, npovey, yuyun2000, baicai, Niche180, ChrisLauVI, xinliu9451, kuikui111222, and Aduomas reacted with rocket emoji

20246688, 34766667028, thewh1teagle, xinliu9451, and kuikui111222 reacted with eyes emoji

All reactions

👍 32 reactions
😄 6 reactions
🎉 5 reactions
❤️ 11 reactions
🚀 11 reactions
👀 5 reactions

40 people reacted

Release v1.8.10

16 Nov 06:40

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.8.10

What's Changed

Fix punctuations in tts for Chinese by @csukuangfj in #417
Build Android APKs for VITS models from Coqui-ai/TTS by @csukuangfj in #419
Add a C++ example to show streaming VAD + non-streaming ASR. by @csukuangfj in #420
Update onnxruntime from v1.16.1 to v1.16.2 by @csukuangfj in #421
Resize circular buffer on overflow by @csukuangfj in #422
Add scripts to export ASR models from wenet to ONNX by @csukuangfj in #425
Support non-streaming WeNet CTC models. by @csukuangfj in #426
Support streaming conformer CTC models from wenet by @csukuangfj in #427
Add Python APIs for WeNet CTC models by @csukuangfj in #428

Full Changelog: v1.8.9...v1.8.10

TTS APKs

Please see
https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

Contributors

csukuangfj

Assets 49

Loading

cgisky1980, HaujetZhao, and 906051999 reacted with thumbs up emoji

All reactions

👍 3 reactions

3 people reacted

v1.8.9

10 Nov 08:59

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v1.8.9

What's Changed

Support VITS TTS models from coqui-ai/TTS by @csukuangfj in #416

Full Changelog: v1.8.8...v1.8.9

TTS APKs

Please see
https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

Contributors

csukuangfj

Assets 46

Loading

All reactions

Release v1.8.8

07 Nov 08:22

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.8.8

What's Changed

Add C# TTS API by @LKZMuZiLi in #399
Upload TTS APKs to huggingface by @csukuangfj in #400
Support static linking onnxruntime lib for 32-bit arm by @csukuangfj in #401
Support static linking onnxruntime for 64-bit ARM by @csukuangfj in #402
Support linking onnxruntime statically for macOS by @csukuangfj in #403
Use a single static lib file for onnxruntime on Windows by @csukuangfj in #404
Update to onnxruntime v1.16.1 by @csukuangfj in #406
Support text normalization via rule FST by @csukuangfj in #407
Catch exception from whisper by @csukuangfj in #408
Support Chinese polyphones in TTS by @csukuangfj in #409
support reading rule FST for Android TTS by @csukuangfj in #410
Support distil-whisper by @csukuangfj in #411
add --tts-rule-fsts argument at offline-tts.py by @longshiming in #413
Release v1.8.8 by @csukuangfj in #414

New Contributors

@LKZMuZiLi made their first contribution in #399

Full Changelog: v1.8.7...v1.8.8

Contributors

longshiming, csukuangfj, and LKZMuZiLi

Assets 46

Loading

Leroy-X reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

Release v1.8.7

28 Oct 14:23

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.8.7

What's Changed

Support German TTS by @csukuangfj in #394
Support German umlauts in splitting UTF8 strings. by @csukuangfj in #395
Support Spanish in TTS by @csukuangfj in #396
Support French in TTS by @csukuangfj in #397

Full Changelog: v1.8.6...v1.8.7

Contributors

csukuangfj

Assets 53

Loading

All reactions

Release v1.8.6

26 Oct 06:54

csukuangfj

This commit was created on github.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Release v1.8.6

What's Changed

Fix utf8 spliting for English by @csukuangfj in #386
include cstdint (debian, gcc-13.2) by @rouseabout in #388
Fix splitting words containing ', e.g., I've by @csukuangfj in #389
Support vits models from piper by @csukuangfj in #390
Release v1.8.6 by @csukuangfj in #391

New Contributors

@rouseabout made their first contribution in #388

Full Changelog: v1.8.5...v1.8.6

Contributors

csukuangfj and rouseabout

Assets 53

Loading

All reactions

Previous 1 2 … 6 7 8 9 10 11 12 13 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.