nota-ai
29 models • 1 total models in database
Sort by:
bk-sdm-base
—
933
19
bk-sdm-tiny
—
835
28
ERGO-7B
ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models ERGO (Efficient Reasoning & Guided Observation) is a large vision–language model trained with reinforcement learning on efficiency objectives, focusing on task-relevant regions to enhance accuracy and achieve up to a 3× speedup in inference. Usage > We recommend using vLLM, as its `Automatic Prefix Caching` can significantly improve inference speed.
NaNK
license:apache-2.0
406
15
bk-sdm-small
—
282
30
bk-sdm-v2-tiny
—
171
1
Qwen3-30B-A3B-NotaMoEQuant-Int4
NaNK
license:apache-2.0
142
4
bk-sdm-tiny-2m
—
107
18
GLM-4.5-Air-NotaMoeQuant-Int4
NaNK
license:cc-by-nc-4.0
53
1
bk-sdm-small-2m
—
43
14
phiva-4b-hf
NaNK
—
43
3
bk-sdm-v2-small
—
26
0
Solar-Open-100B-NotaMoEQuant-NVFP4
NaNK
—
16
0
bk-sdm-v2-base
—
10
1
st-llama-1-5.5b-ppl
NaNK
llama
4
10
bk-sdm-base-2m
—
2
15
st-vicuna-v1.3-5.5b-ppl
NaNK
llama
1
10
st-vicuna-v1.3-5.5b-taylor
NaNK
llama
1
10
st-vicuna-v1.3-10.5b-ppl
NaNK
llama
1
9
cpt_st-vicuna-v1.3-3.7b-ppl
NaNK
llama
1
4
cpt-lora_st-vicuna-v1.3-3.7b-ppl
NaNK
llama
1
0
st-llama-1-5.5b-taylor
NaNK
llama
0
10
st-vicuna-v1.3-10.5b-taylor
NaNK
llama
0
9
coreml-bk-sdm
—
0
6
cpt_st-vicuna-v1.3-2.7b-ppl
NaNK
llama
0
5
cpt_st-vicuna-v1.3-5.5b-ppl
NaNK
llama
0
4
cpt_st-vicuna-v1.3-1.5b-ppl
NaNK
llama
0
4
Solar-Open-100B-NotaMoEQuant-Int4
NaNK
—
0
1
cpt-lora_st-vicuna-v1.3-5.5b-ppl-q4f16_0-MLC
NaNK
—
0
1
cpt-lora_st-vicuna-v1.3-5.5b-ppl-q4f16_1-MLC
NaNK
—
0
1