JOSIE-1.1-4B-Thinking
76
1
license:mit
by
Goekdeniz-Guelmez
Language Model
OTHER
4B params
New
76 downloads
Early-stage
Edge AI:
Mobile
Laptop
Server
9GB+ RAM
Mobile
Laptop
Server
Quick Summary
AI model with specialized capabilities.
Device Compatibility
Mobile
4-6GB RAM
Laptop
16GB RAM
Server
GPU
Minimum Recommended
4GB+ RAM
Code Examples
How to Get Startedpythontransformers
# Using Hugging Face Transformers
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "Goekdeniz-Guelmez/JOSIE-1.1-4B-Thinking"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
model_name,
device_map="auto",
torch_dtype="auto"
)Basic Usagepython
# Example inference
messages = [
{"role": "user", "content": "Explain quantum entanglement in simple terms.."}
]
inputs = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
return_tensors="pt"
).to(model.device)
outputs = model.generate(
**inputs,
max_new_tokens=4096,
temperature=0.6,
top_p=0.95,
top_k=20,
repetition_penalty=1.1,
do_sample=True
)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)MLX Usage (Apple Silicon)python
# Using MLX for optimized Apple Silicon inference
from mlx_lm.utils import load
from mlx_lm.generate import generate
from mlx_lm.sample_utils import make_logits_processors, make_sampler
model, tokenizer = load("Goekdeniz-Guelmez/JOSIE-1.1-4B-Thinking")
sampler = make_sampler(
temp=0.6,
top_p=0.95,
min_p=0.0,
top_k=20,
)
messages = [
{"role": "user", "content": "Explain quantum entanglement in simple terms.."}
]
prompt = tokenizer.apply_chat_template(
messages,
add_generation_prompt=True,
tokenize=False
)
response = generate(
model,
tokenizer,
prompt=prompt,
max_tokens=4096,
sampler=sampler,
logits_processors=make_logits_processors(repetition_penalty=1.1)
)
print(response)Deploy This Model
Production-ready deployment in minutes
Together.ai
Instant API access to this model
Production-ready inference API. Start free, scale to millions.
Try Free APIReplicate
One-click model deployment
Run models in the cloud with simple API. No DevOps required.
Deploy NowDisclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.