Ellaria-9B

10
20
9.0B
1 language
by
tannedbum
Language Model
OTHER
9B params
New
10 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
21GB+ RAM
Mobile
Laptop
Server
Quick Summary

Same reliable approach as before.

Device Compatibility

Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
9GB+ RAM

Code Examples

SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
SillyTaverntext
temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16
Configurationyaml
slices:
  - sources:
      - model: TheDrummer/Gemmasutra-9B-v1
        layer_range: [0, 42]
      - model: princeton-nlp/gemma-2-9b-it-SimPO
        layer_range: [0, 42]
merge_method: slerp
base_model: TheDrummer/Gemmasutra-9B-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.