tawkeed-ocr

Name: tawkeed-ocr
Author: tawkeed-sa

179

license:apache-2.0

tawkeed-sa

Other

OTHER

2B params

New

179 downloads

Early-stage

Try on Hugging Face Add to Compare

Edge AI:

Mobile

Laptop

Server

5GB+ RAM

Mobile

Laptop

Server

Quick Summary

AI model with specialized capabilities.

Device Compatibility

Mobile

4-6GB RAM

Laptop

16GB RAM

Server

GPU

Minimum Recommended

2GB+ RAM

Code Examples

Usagepythontransformers

from transformers import AutoProcessor, AutoModelForVision2Seq
from PIL import Image

model_id = "tawkeed-sa/tawkeed-ocr"
processor = AutoProcessor.from_pretrained(model_id)
model = AutoModelForVision2Seq.from_pretrained(
    model_id,
    device_map="auto",
    torch_dtype="auto",
)

# Load an Arabic document image
image = Image.open("arabic_document.png")

# Extract text
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image"},
            {"type": "text", "text": "استخرج جميع النصوص العربية من هذه الصورة"},
        ],
    }
]
inputs = processor.apply_chat_template(messages, images=[image], return_tensors="pt").to(model.device)
outputs = model.generate(**inputs, max_new_tokens=2048)
result = processor.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
print(result)

Batch Processingpythontransformers

from pathlib import Path
from PIL import Image
from transformers import AutoProcessor, AutoModelForVision2Seq

model_id = "tawkeed-sa/tawkeed-ocr"
processor = AutoProcessor.from_pretrained(model_id)
model = AutoModelForVision2Seq.from_pretrained(model_id, device_map="auto", torch_dtype="auto")

# Process multiple document pages
for img_path in sorted(Path("scanned_pages/").glob("*.png")):
    image = Image.open(img_path)
    messages = [
        {
            "role": "user",
            "content": [
                {"type": "image"},
                {"type": "text", "text": "Extract all text from this document page."},
            ],
        }
    ]
    inputs = processor.apply_chat_template(messages, images=[image], return_tensors="pt").to(model.device)
    outputs = model.generate(**inputs, max_new_tokens=2048)
    text = processor.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
    print(f"--- {img_path.name} ---")
    print(text)

Deploy This Model

Production-ready deployment in minutes

Together.ai

Instant API access to this model

Fastest API

Production-ready inference API. Start free, scale to millions.

Try Free API

Replicate

One-click model deployment

Easiest Setup

Run models in the cloud with simple API. No DevOps required.

Deploy Now

Disclosure: We may earn a commission from these partners. This helps keep LLMYourWay free.