onnx-community

500 models • 11 total models in database

Sort by:

Kokoro-82M-v1.0-ONNX

Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out). - Usage - JavaScript - Python - Voices/Samples - Quantizations First, install the `kokoro-js` library from NPM using: ```python import os import numpy as np from onnxruntime import InferenceSession You can generate token ids as follows: 1. Convert input text to phonemes using https://github.com/hexgrad/misaki 2. Map phonemes to ids using https://huggingface.co/hexgrad/Kokoro-82M/blob/785407d1adfa7ae8fbef8ffd85f34ca127da3039/config.json#L34-L148 tokens = [50, 157, 43, 135, 16, 53, 135, 46, 16, 43, 102, 16, 56, 156, 57, 135, 6, 16, 102, 62, 61, 16, 70, 56, 16, 138, 56, 156, 72, 56, 61, 85, 123, 83, 44, 83, 54, 16, 53, 65, 156, 86, 61, 62, 131, 83, 56, 4, 16, 54, 156, 43, 102, 53, 16, 156, 72, 61, 53, 102, 112, 16, 70, 56, 16, 138, 56, 44, 156, 76, 158, 123, 56, 16, 62, 131, 156, 43, 102, 54, 46, 16, 102, 48, 16, 81, 47, 102, 54, 16, 54, 156, 51, 158, 46, 16, 70, 16, 92, 156, 135, 46, 16, 54, 156, 43, 102, 48, 4, 16, 81, 47, 102, 16, 50, 156, 72, 64, 83, 56, 62, 16, 156, 51, 158, 64, 83, 56, 16, 44, 157, 102, 56, 16, 44, 156, 76, 158, 123, 56, 4] Context length is 512, but leave room for the pad token 0 at the start & end assert len(tokens) Life is like a box of chocolates. You never know what you're gonna get. | Name | Nationality | Gender | Sample | | ------------ | ----------- | ------ | --------------------------------------------------------------------------------------------------------------------------------------- | | afheart | American | Female | | | afalloy | American | Female | | | afaoede | American | Female | | | afbella | American | Female | | | afjessica | American | Female | | | afkore | American | Female | | | afnicole | American | Female | | | afnova | American | Female | | | afriver | American | Female | | | afsarah | American | Female | | | afsky | American | Female | | | amadam | American | Male | | | amecho | American | Male | | | americ | American | Male | | | amfenrir | American | Male | | | amliam | American | Male | | | ammichael | American | Male | | | amonyx | American | Male | | | ampuck | American | Male | | | amsanta | American | Male | | | bfalice | British | Female | | | bfemma | British | Female | | | bfisabella | British | Female | | | bflily | British | Female | | | bmdaniel | British | Male | | | bmfable | British | Male | | | bmgeorge | British | Male | | | bmlewis | British | Male | | The model is resilient to quantization, enabling efficient high-quality speech synthesis at a fraction of the original model size. > How could I know? It's an unanswerable question. Like asking an unborn child if they'll lead a good life. They haven't even been born. | Model | Size (MB) | Sample | |------------------------------------------------|-----------|-----------------------------------------------------------------------------------------------------------------------------------------| | model.onnx (fp32) | 326 | | | modelfp16.onnx (fp16) | 163 | | | modelquantized.onnx (8-bit) | 92.4 | | | modelq8f16.onnx (Mixed precision) | 86 | | | modeluint8.onnx (8-bit & mixed precision) | 177 | | | modeluint8f16.onnx (Mixed precision) | 114 | | | modelq4.onnx (4-bit matmul) | 305 | | | modelq4f16.onnx (4-bit matmul & fp16 weights) | 154 | |

onnx-community

Kokoro-82M-v1.0-ONNX

gemma-4-E2B-it-ONNX

Medical-NER-ONNX

xlm-roberta-base-squad2-distilled-ONNX

t5-base-grammar-correction-ONNX

whisper-base

Kokoro-82M-ONNX

embeddinggemma-300m-ONNX

moonshine-base-ONNX

whisper-large-v3-turbo

Qwen3-Embedding-0.6B-ONNX

granite-docling-258M-ONNX

nanochat-d32-ONNX

depth-anything-v2-small

whisper-small

gpt-oss-20b-ONNX

granite-4.0-1b-speech-ONNX

ormbg-ONNX

granite-4.0-1b-ONNX-web

granite-4.0-micro-ONNX-web

Qwen3-0.6B-ONNX

whisper-medium-ONNX

whisper-base_timestamped

granite-4.0-350m-ONNX-web

BEN2-ONNX

whisper-large-v3-turbo_timestamped

gemma-3-270m-it-ONNX

whisper-base.en

gemma-4-E4B-it-ONNX

Supertonic-TTS-ONNX

whisper-tiny.en

whisper-tiny

gliner_base

dinov3-vits16-pretrain-lvd1689m-ONNX

cohere-transcribe-03-2026-ONNX

FastVLM-0.5B-ONNX

Janus-Pro-1B-ONNX

mobilenetv4_conv_small.e2400_r224_in1k

Qwen2.5-0.5B-Instruct

Qwen3.5-0.8B-ONNX

SmolLM2-135M-ONNX

Kokoro-82M-v1.0-ONNX-timestamped

DeepSeek-R1-Distill-Qwen-1.5B-ONNX

Florence-2-base-ft

pyannote-segmentation-3.0

gliner_small-v2.1

bge-reranker-v2-m3-ONNX

functiongemma-270m-it-ONNX

siglip2-base-patch16-256-ONNX

moonshine-tiny-ONNX

dinov3-vits16-pretrain-lvd1689m-ONNX-MHA-scores

Phi-3.5-mini-instruct-onnx-web

LFM2-350M-ONNX

chatterbox-multilingual-ONNX

gemma-3-1b-it-ONNX-GQA

gte-multilingual-reranker-base

Qwen3.5-2B-ONNX

Llama-3.2-1B-Instruct-q4f16

kitten-tts-nano-0.1-ONNX

LFM2-24B-A2B-ONNX

gte-multilingual-base

whisper-small_timestamped

LFM2-1.2B-ONNX

language_detection-ONNX

Qwen3-1.7B-ONNX

Voxtral-Mini-3B-2507-ONNX

gemma-3-1b-it-ONNX

Falcon-H1-Tiny-90M-Instruct-ONNX

depth-anything-v2-base

yolov10x

Qwen3.5-4B-ONNX

yolov10m

Llama-3.2-1B-Instruct

LFM2-700M-ONNX

ultravox-v0_5-llama-3_2-1b-ONNX

Supertonic-TTS-2-ONNX

bge-small-en-v1.5-ONNX

deberta-v3-large-zeroshot-v2.0-c-ONNX

Qwen3-4B-ONNX