keras-io

101 models • 4 total models in database

Sort by:

monocular-depth-estimation

Deeplabv3p Resnet50

Multiclass semantic segmentation using DeepLabV3+ This repo contains the model and the notebook to this Keras example on Multiclass semantic segmentation using DeepLabV3+. The model is trained for demonstrative purposes and does not guarantee the best results in production. For better results, follow & optimize the Keras example as per your need. Background Information Semantic segmentation, with the goal to assign semantic labels to every pixel in an image, is an essential computer vision task. In this example, we implement the DeepLabV3+ model for multi-class semantic segmentation, a fully-convolutional architecture that performs well on semantic segmentation benchmarks. Training Data The model is trained on a subset (10,000 images) of Crowd Instance-level Human Parsing Dataset. The Crowd Instance-level Human Parsing (CIHP) dataset has 38,280 diverse human images. Each image in CIHP is labeled with pixel-wise annotations for 20 categories, as well as instance-level identification. This dataset can be used for the "human part segmentation" task. Model The model uses ResNet50 pretrained on ImageNet as the backbone model. References: 1. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation 2. Rethinking Atrous Convolution for Semantic Image Segmentation 3. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

—

198

Ocr For Captcha

Keras Implementation of OCR model for reading captcha 🤖🦹🏻 This repo contains the model and the notebook to this Keras example on OCR model for reading captcha. Background Information This example demonstrates a simple OCR model built with the Functional API. Apart from combining CNN and RNN, it also illustrates how you can instantiate a new layer and use it as an "Endpoint layer" for implementing CTC loss. This model uses subclassing, learn more about subclassing from this guide.

—

keras-io

monocular-depth-estimation

Deeplabv3p Resnet50

Ocr For Captcha

lowlight-enhance-mirnet

sentiment-analysis

Timeseries Anomaly Detection

Image-Classification-using-EANet

CutMix_data_augmentation_for_image_classification

denoising-diffusion-implicit-models

structured-data-classification-grn-vsn

timeseries_forecasting_for_weather

Object-Detection-RetinaNet

video-classification-cnn-rnn

bert-semantic-similarity

PointNet

low-light-image-enhancement

drug-molecule-generation-with-VAE

TF_Decision_Trees

timeseries-classification-from-scratch

vq-vae

ner-with-transformers

super-resolution

imbalanced_classification

tab_transformer

video-vision-transformer

vit_small_ds_v2

cct

graph-attention-nets

GauGAN-Image-generation

collaborative-filtering-movielens

mobile-vit-xxs

text-generation-miniature-gpt

timeseries_transformer_classification

CycleGAN

dual-encoder-image-search

supervised-contrastive-learning-cifar10

bidirectional-lstm-imdb

ppo-cartpole

vit-small-ds

structured-data-classification

text-classification-with-transformer

consistency_training_with_supervision_teacher_model

wgan-molecular-graphs

deep-dream

video-transformers

VGG19

conv_autoencoder

time-series-anomaly-detection-autoencoder

cifar10_metric_learning

learning_to_tokenize_in_ViT

deep-deterministic-policy-gradient

semantic-segmentation

conv-lstm

attention_mil

keras-reptile

speaker-recognition

siamese-contrastive

Multimodal Entailment

MelGAN-spectrogram-inversion

ctc_asr

addition-lstm

SimSiam

consistency_training_with_supervision_student_model

conditional-gan

pointnet_segmentation

swin-transformers

semi-supervised-classification-simclr

semantic-image-clustering

conv_mixer_image_classification

conv_Mixer

randaugment

bit

transformers-qa

english-speaker-accent-recognition-using-transfer-learning

pixel-cnn-mnist

Node2Vec_MovieLens

shiftvit

involution

3D_CNN_Pneumonia