nota-ai

29 models • 1 total models in database

Sort by:

bk-sdm-base

—

933

bk-sdm-tiny

—

835

ERGO-7B

ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models ERGO (Efficient Reasoning & Guided Observation) is a large vision–language model trained with reinforcement learning on efficiency objectives, focusing on task-relevant regions to enhance accuracy and achieve up to a 3× speedup in inference. Usage > We recommend using vLLM, as its `Automatic Prefix Caching` can significantly improve inference speed.

nota-ai

bk-sdm-base

bk-sdm-tiny

ERGO-7B

bk-sdm-small

bk-sdm-v2-tiny

Qwen3-30B-A3B-NotaMoEQuant-Int4

bk-sdm-tiny-2m

GLM-4.5-Air-NotaMoeQuant-Int4

bk-sdm-small-2m

phiva-4b-hf

bk-sdm-v2-small

Solar-Open-100B-NotaMoEQuant-NVFP4

bk-sdm-v2-base

st-llama-1-5.5b-ppl

bk-sdm-base-2m

st-vicuna-v1.3-5.5b-ppl

st-vicuna-v1.3-5.5b-taylor

st-vicuna-v1.3-10.5b-ppl

cpt_st-vicuna-v1.3-3.7b-ppl

cpt-lora_st-vicuna-v1.3-3.7b-ppl

st-llama-1-5.5b-taylor

st-vicuna-v1.3-10.5b-taylor

coreml-bk-sdm

cpt_st-vicuna-v1.3-2.7b-ppl

cpt_st-vicuna-v1.3-5.5b-ppl

cpt_st-vicuna-v1.3-1.5b-ppl

Solar-Open-100B-NotaMoEQuant-Int4

cpt-lora_st-vicuna-v1.3-5.5b-ppl-q4f16_0-MLC

cpt-lora_st-vicuna-v1.3-5.5b-ppl-q4f16_1-MLC