MiaoshouAI

6 models • 1 total models in database
Sort by:

Florence-2-large-PromptGen-v2.0

Florence-2-large-PromptGen v2.0 This upgrade is based on PromptGen 1.5 with some new features to the model: Features: Improved caption quality for \ , \ and \ . A new \ instruction, which helps the model to better understands the image composition of the input image. Memory efficient compare to other models! This is a really light weight caption model that allows you to use a little more than 1G of VRAM and produce lightening fast and high quality image captions. Designed to handle image captions for Flux model for both T5XXL CLIP and CLIPL, the Miaoshou Tagger new node called "Flux CLIP Text Encode" which eliminates the need to run two separate tagger tools for caption creation. You can easily populate both CLIPs in a single generation, significantly boosting speed when working with Flux models. Instruction prompt: \ generate prompt as danbooru style tags \ a one line caption for the image \ a structured caption format which detects the position of the subjects in the image \ a very detailed description for the image \ image composition analysis mode \ a mixed caption style of more detailed caption and tags, this is extremely useful for FLUX model when using T5XXL and CLIPL together. A new node in MiaoshouTagger ComfyUI is added to support this instruction. \ Combine the power of mixed caption with analyze. Version History: For version 2.0, you will notice the following 1. \ along with a beta node in ComfyUI for partial image analysis 2. A new instruction for \ 3. A much improve accuracy for \ , \ and \ To use this model, you can load it directly from the Hugging Face Model Hub: Use under MiaoshouAI Tagger ComfyUI If you just want to use this model, you can use it under ComfyUI-Miaoshouai-Tagger A detailed use and install instruction is already there. (If you have already installed MiaoshouAI Tagger, you need to update the node in ComfyUI Manager first or use git pull to get the latest update.)

license:mit
33,354
90

Florence-2-base-PromptGen-v2.0

Florence-2-base-PromptGen v2.0 This upgrade is based on PromptGen 1.5 with some new features to the model: Features: Improved caption quality for \ , \ and \ . A new \ instruction, which helps the model to better understands the image composition of the input image. Memory efficient compare to other models! This is a really light weight caption model that allows you to use a little more than 1G of VRAM and produce lightening fast and high quality image captions. Designed to handle image captions for Flux model for both T5XXL CLIP and CLIPL, the Miaoshou Tagger new node called "Flux CLIP Text Encode" which eliminates the need to run two separate tagger tools for caption creation. You can easily populate both CLIPs in a single generation, significantly boosting speed when working with Flux models. Instruction prompt: \ generate prompt as danbooru style tags \ a one line caption for the image \ a structured caption format which detects the position of the subjects in the image \ a very detailed description for the image \ image composition analysis mode \ a mixed caption style of more detailed caption and tags, this is extremely useful for FLUX model when using T5XXL and CLIPL together. A new node in MiaoshouTagger ComfyUI is added to support this instruction. \ Combine the power of mixed caption with analyze. Version History: For version 2.0, you will notice the following 1. \ along with a beta node in ComfyUI for partial image analysis 2. A new instruction for \ 3. A much improve accuracy for \ , \ and \ To use this model, you can load it directly from the Hugging Face Model Hub: Use under MiaoshouAI Tagger ComfyUI If you just want to use this model, you can use it under ComfyUI-Miaoshouai-Tagger A detailed use and install instruction is already there. (If you have already installed MiaoshouAI Tagger, you need to update the node in ComfyUI Manager first or use git pull to get the latest update.)

license:mit
10,255
53

Florence-2-base-PromptGen-v1.5

license:mit
1,293
96

Florence-2-large-PromptGen-v1.5

license:mit
583
67

Florence-2-base-PromptGen

license:mit
325
56

transnetv2-pytorch-weights

0
2