prithivMLmods

500 models • 115 total models in database

Sort by:

open-deepfake-detection

license:apache-2.0

92,092

Common-Voice-Gender-Detection

license:apache-2.0

58,771

Deep-Fake-Detector-v2-Model

The Deep-Fake-Detector-v2-Model is a state-of-the-art deep learning model designed to detect deepfake images. It leverages the Vision Transformer (ViT) architecture, specifically the `google/vit-base-patch16-224-in21k` model, fine-tuned on a dataset of real and deepfake images. The model is trained to classify images as either "Realism" or "Deepfake" with high accuracy, making it a powerful tool for detecting manipulated media. Update : The previous model checkpoint was obtained using a smaller classification dataset. Although it performed well in evaluation scores, its real-time performance was average due to limited variations in the training set. The new update includes a larger dataset to improve the detection of fake images. | Repository | Link | |------------|------| | Deep Fake Detector v2 Model | GitHub Repository | Key Features - Architecture: Vision Transformer (ViT) - `google/vit-base-patch16-224-in21k`. - Input: RGB images resized to 224x224 pixels. - Output: Binary classification ("Realism" or "Deepfake"). - Training Dataset: A curated dataset of real and deepfake images. - Fine-Tuning: The model is fine-tuned using Hugging Face's `Trainer` API with advanced data augmentation techniques. - Performance: Achieves high accuracy and F1 score on validation and test datasets. Model Architecture The model is based on the Vision Transformer (ViT), which treats images as sequences of patches and applies a transformer encoder to learn spatial relationships. Key components include: - Patch Embedding: Divides the input image into fixed-size patches (16x16 pixels). - Transformer Encoder: Processes patch embeddings using multi-head self-attention mechanisms. - Classification Head: A fully connected layer for binary classification. Training Details - Optimizer: AdamW with a learning rate of `1e-6`. - Batch Size: 32 for training, 8 for evaluation. - Epochs: 2. - Data Augmentation: - Random rotation (±90 degrees). - Random sharpness adjustment. - Random resizing and cropping. - Loss Function: Cross-Entropy Loss. - Evaluation Metrics: Accuracy, F1 Score, and Confusion Matrix. Dataset The model is fine-tuned on the dataset, which contains: - Real Images: Authentic images of human faces. - Fake Images: Deepfake images generated using advanced AI techniques. Limitations The model is trained on a specific dataset and may not generalize well to other deepfake datasets or domains. - Performance may degrade on low-resolution or heavily compressed images. - The model is designed for image classification and does not detect deepfake videos directly. Misuse: This model should not be used for malicious purposes, such as creating or spreading deepfakes. Bias: The model may inherit biases from the training dataset. Care should be taken to ensure fairness and inclusivity. Transparency: Users should be informed when deepfake detection tools are used to analyze their content. Future Work - Extend the model to detect deepfake videos. - Improve generalization by training on larger and more diverse datasets. - Incorporate explainability techniques to provide insights into model predictions. ```bibtex @misc{Deep-Fake-Detector-v2-Model, author = {prithivMLmods}, title = {Deep-Fake-Detector-v2-Model}, initial = {21 Mar 2024}, secondupdated = {31 Jan 2025}, latestupdated = {02 Feb 2025} }

prithivMLmods

open-deepfake-detection

Common-Voice-Gender-Detection

Deep-Fake-Detector-v2-Model

Castor-3D-Sketchfab-Flux-LoRA

SmolLM2-135M-GGUF

Qwen3-VL-8B-Instruct-abliterated-v1

Qwen3-VL-8B-Instruct-abliterated

Dog-Breed-120

Watermark-Detection-SigLIP2

Canopus-LoRA-Flux-FaceRealism

chandra-ocr-2-GGUF

deepfake-detector-model-v1

Qwen2-VL-OCR-2B-Instruct

open-scene-detection

Canopus-Clothing-Flux-LoRA

Camel-Doc-OCR-080125-GGUF

Digital-Chaos-Flux-LoRA

Flux-Long-Toon-LoRA

Qwen2.5-VL-Abliterated-Caption-GGUF

Qwen3-VL-4B-Thinking-abliterated-v1

Qwen3-VL-4B-Thinking-abliterated

Street-Bokeh-Flux-LoRA

Qwen3-VL-4B-Instruct-abliterated-v1

Qwen3-VL-4B-Instruct-abliterated

Dark-Thing-Flux-LoRA

Flux-Toonic-2.5D-LoRA

Qwen3-VL-8B-Thinking-abliterated-v1

Qwen3-4B-2507-abliterated-GGUF

Realistic-Gender-Classification

QIE-2509-Object-Remover-Bbox-v3

Shadow-Projection-Flux-LoRA

Canopus-Pixar-3D-Flux-LoRA

Human-vs-NonHuman-Detection

3D-Render-Flux-LoRA

Megalodon-OCR-Sync-0713-AIO-GGUF

Gender-Classifier-Mini

Nanonets-OCR2-3B-AIO-GGUF

Qwen-Image-Edit-2511-Unblur-Upscale

Llama-SmolTalk-3.2-1B-Instruct

Canopus-LoRA-Flux-UltraRealism-2.0

Age-Classification-SigLIP2

Qwen3.5-abliterated-MAX-AIO-GGUF

Logo-Design-Flux-LoRA

DeepSeek-OCR-Latest-BF16.I64

Gliese-OCR-7B-Post2.0-final-GGUF

Retro-Pixel-Flux-LoRA

Nanonets-OCR-s-AIO-GGUF

Security-Llama3.2-3B-GGUF

SAGE-MM-Qwen3-VL-4B-SFT-GGUF

EBook-Creative-Cover-Flux-LoRA

Qwen-Image-Edit-2511-Polaroid-Photo

Camel-Doc-OCR-080125

Trash-Net

Gliese-Qwen3.5-9B-Abliterated-Caption

Gliese-OCR-7B-Post2.0-final

Qwen2.5-VL-7B-Abliterated-Caption-it

Flux-Product-Ad-Backdrop

Fashion-Hut-Modeling-LoRA

GA-Guard-AIO-GGUF

Food-101-93M

Ton618-Epic-Realism-Flux-LoRA

Aura-9999

Bone-Fracture-Detection

Qwen3-VL-8B-Thinking-Unredacted-MAX-GGUF

Alphabet-Sign-Language-Detection

IndoorOutdoorNet

Meta-Llama-3.2-1B-GGUF-QX

chandra-OCR-GGUF

Flux-GArt-LoRA

Flux-Dev-Real-Anime-LoRA

Qwen3-VL-8B-Instruct-abliterated-v2.0-GGUF

Fire-Detection-Siglip2

Canopus-LoRA-Flux-Anime

Flux.1-Dev-Movie-Boards-LoRA

Abstract-Cartoon-Flux-LoRA

Qwen3-VL-8B-Instruct-Unredacted-MAX-GGUF

Deepfake-Detect-Siglip2

Mockup-Texture-Flux-LoRA

Gliese-OCR-7B-Post1.0