Clip Vit Base Patch 32

Downloads
Hugging Face
20.9M
795
Context
Small context
77
License
Updated
11/3/2025
by
openai

This model is designed for vision tasks. It includes candidate labels such as playing music and playing sports. An example title is Cat & Dog. You can find a sample image at https://huggingface.co/datasets/mishig/sample_images/resolve/main/cat-dog-music.png.

Image Model
PYTORCH

Quick Info

Released
3/2/2022
Framework
PYTORCH

Resources