Emotion Recognition

Detect emotions from facial expressions or voice - happiness, sadness, anger, surprise. Great for customer service, mental health apps, and gaming.

Models Found

Common Applications

Customer service quality monitoring

Mental health & wellness apps

Gaming & entertainment

Video conferencing insights

Education & training feedback

Top Models

20 models • Sorted by downloads

nsfw_image_detection

by Falconsai

Model Card: Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification The Fine-Tuned Vision Transformer (ViT) is a variant of the transformer encoder architecture, similar to BERT, that has been adapted for image classification tasks. This specific model, named "google/vit-base-patch16-224-in21k," is pre-trained on a substantial collection of images in a supervised manner, leveraging the ImageNet-21k dataset. The images in the pre-training dataset are resized to a resolution of 224x224 pixels, making it suitable for a wide range of image recognition tasks. During the training phase, meticulous attention was given to hyperparameter settings to ensure optimal model performance. The model was fine-tuned with a judiciously chosen batch size of 16. This choice not only balanced computational efficiency but also allowed for the model to effectively process and learn from a diverse array of images. To facilitate this fine-tuning process, a learning rate of 5e-5 was employed. The learning rate serves as a critical tuning parameter that dictates the magnitude of adjustments made to the model's parameters during training. In this case, a learning rate of 5e-5 was selected to strike a harmonious balance between rapid convergence and steady optimization, resulting in a model that not only learns swiftly but also steadily refines its capabilities throughout the training process. This training phase was executed using a proprietary dataset containing an extensive collection of 80,000 images, each characterized by a substantial degree of variability. The dataset was thoughtfully curated to include two distinct classes, namely "normal" and "nsfw." This diversity allowed the model to grasp nuanced visual patterns, equipping it with the competence to accurately differentiate between safe and explicit content. The overarching objective of this meticulous training process was to impart the model with a deep understanding of visual cues, ensuring its robustness and competence in tackling the specific task of NSFW image classification. The result is a model that stands ready to contribute significantly to content safety and moderation, all while maintaining the highest standards of accuracy and reliability. Intended Uses & Limitations Intended Uses - NSFW Image Classification: The primary intended use of this model is for the classification of NSFW (Not Safe for Work) images. It has been fine-tuned for this purpose, making it suitable for filtering explicit or inappropriate content in various applications. How to use Here is how to use this model to classifiy an image based on 1 of 2 classes (normal,nsfw): - 'evalloss': 0.07463177293539047, - 'evalaccuracy': 0.980375, - 'evalruntime': 304.9846, - 'evalsamplespersecond': 52.462, - 'evalstepspersecond': 3.279 Note: It's essential to use this model responsibly and ethically, adhering to content guidelines and applicable regulations when implementing it in real-world applications, particularly those involving potentially sensitive content. For more details on model fine-tuning and usage, please refer to the model's documentation and the model hub. - Hugging Face Model Hub - Vision Transformer (ViT) Paper - ImageNet-21k Dataset Disclaimer: The model's performance may be influenced by the quality and representativeness of the data it was fine-tuned on. Users are encouraged to assess the model's suitability for their specific applications and datasets.

70.7M downloads

890 likes

PYTORCH

Emotion Recognition

Common Applications

Top Models

nsfw_image_detection

fairface_age_image_detection

mobilenetv3_small_100.lamb_in1k

XTTS-v2

resnet50.a1_in1k

vit-base-patch16-224

resnet18.a1_in1k

convnextv2_nano.fcmae_ft_in22k_in1k

emotion_text_classifier

mobilevit-small

nsfw_image_detector

gender-classification

efficientnet_b0.ra_in1k

beit-base-patch16-224-pt22k-ft22k

chatterbox

vit_small_patch16_224.augreg_in21k_ft_in1k

resnet34.a1_in1k

emotion-recognition-wav2vec2-IEMOCAP

emotion-english-distilroberta-base

vit_tiny_patch16_224.augreg_in21k_ft_in1k

Related Use Cases

Age Group Detection