MBZUAI

85 models • 1 total models in database
Sort by:

MedMO-8B-Next

NaNK
license:apache-2.0
6,593
12

MedMO-4B-Next

NaNK
license:apache-2.0
5,249
3

swiftformer-xs

4,771
7

GLaMM-GranD-Pretrained

license:apache-2.0
2,997
4

MedMO-8B

NaNK
license:apache-2.0
2,238
10

LaMini-Flan-T5-248M

license:cc-by-nc-4.0
1,627
80

MedMO-4B

NaNK
license:apache-2.0
1,432
14

speecht5_tts_clartts_ar

license:cc-by-nc-4.0
1,300
24

AIN

license:mit
1,134
11

GeoPixel-7B

NaNK
license:apache-2.0
960
5

LaMini-GPT-124M

license:cc-by-nc-4.0
920
23

geochat-7B

NaNK
license:apache-2.0
881
22

LaMini-GPT-1.5B

NaNK
license:cc-by-nc-4.0
655
39

LaMini-GPT-774M

license:cc-by-nc-4.0
649
14

LaMini Flan T5 783M

This model is one of our LaMini-LM model series in paper "LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions". This model is a fine-tuned version of google/flan-t5-large on LaMini-instruction dataset that contains 2.58M samples for instruction fine-tuning. For more information about our dataset, please refer to our project repository. You can view other models of LaMini-LM series as follows. Models with ✩ are those with the best overall performance given their size/architecture, hence we recommend using them. More details can be seen in our paper. Flan-T5 LaMini-Flan-T5-77M ✩ LaMini-Flan-T5-248M ✩ LaMini-Flan-T5-783M ✩ Cerebras-GPT LaMini-Cerebras-111M LaMini-Cerebras-256M LaMini-Cerebras-590M LaMini-Cerebras-1.3B GPT-2 LaMini-GPT-124M ✩ LaMini-GPT-774M ✩ LaMini-GPT-1.5B ✩ Intended use We recommend using the model to response to human instructions written in natural language. We now show you how to load and use our model using HuggingFace `pipeline()`. We initialize with google/flan-t5-large and fine-tune it on our LaMini-instruction dataset. Its total number of parameters is 783M. The following hyperparameters were used during training: - learningrate: 0.0005 - trainbatchsize: 128 - evalbatchsize: 64 - seed: 42 - gradientaccumulationsteps: 4 - totaltrainbatchsize: 512 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lrschedulertype: linear - numepochs: 5 Evaluation We conducted two sets of evaluations: automatic evaluation on downstream NLP tasks and human evaluation on user-oriented instructions. For more detail, please refer to our [paper]().

license:cc-by-nc-4.0
533
83

artst_asr_v3_qasr

license:cc-by-nc-4.0
500
4

LLaVA-Phi-3-mini-4k-instruct

license:mit
269
22

Llama-3-Nanda-10B-Chat

NaNK
llama
251
17

GeoPixel-7B-RES

📝 Description GeoPixel-7B-RES is the model specific to the Referring Remote Sensing Image Segmentation (RRSIS) task. It is finetuned on RRSIS-D dataset. 📚 Additional Resources - Paper: ArXiv. - GitHub Repository: For training and updates: GitHub - GeoPixel. - Project Page: For a detailed overview, visit our Project Page - GeoPixel.

NaNK
license:apache-2.0
241
2

LaMini-T5-738M

license:cc-by-nc-4.0
233
49

MobiLlama-05B

NaNK
llama
180
42

artst_asr_v3

license:cc-by-nc-4.0
158
0

LaMini-T5-61M

license:cc-by-nc-4.0
106
18

LaMini-Flan-T5-77M

license:cc-by-nc-4.0
105
26

GLaMM-FullScope

license:apache-2.0
105
7

MobiLlama-1B

NaNK
llama
74
18

GLaMM-RefSeg

license:apache-2.0
64
1

bactrian-x-llama-7b-merged

NaNK
llama
59
1

LaMini-Cerebras-111M

license:cc-by-nc-4.0
54
3

artst_asr_v2

license:cc-by-nc-4.0
50
2

artst_asr

license:cc-by-nc-4.0
42
2

LaMini-Neo-125M

license:cc-by-nc-4.0
28
16

swiftformer-s

26
1

MobiLlama-1B-Chat

NaNK
llama
22
25

BiMediX2-8B

NaNK
llava_llama
21
6

CoME-VL

license:apache-2.0
19
2

BiMediX2-8B-hf

NaNK
base_model:meta-llama/Llama-3.1-8B-Instruct
17
1

MobiLlama-05B-Chat

NaNK
llama
15
17

LLaVA-Meta-Llama-3-8B-Instruct-FT-S2

NaNK
llava_llama
13
4

BiMediX2-4B

NaNK
license:cc-by-nc-sa-4.0
9
1

PALO-7B

NaNK
license:apache-2.0
8
0

LaMini-Cerebras-590M

license:cc-by-nc-4.0
7
7

MobiLlama-08B

NaNK
llama
7
6

LaMini-T5-223M

license:cc-by-nc-4.0
7
3

swiftformer-l1

5
1

BiMediX2-8B-Bi

NaNK
llava_llama
5
0

LaMini-Neo-1.3B

NaNK
license:cc-by-nc-4.0
4
13

LLaVA-Meta-Llama-3-8B-Instruct-FT

NaNK
llava_llama
4
12

Video-R2

license:apache-2.0
4
0

LLaVA-Phi-3-mini-4k-instruct-FT

license:mit
3
5

swiftformer-l3

3
3

bactrian-x-llama-13b-merged

NaNK
llama
3
2

GLaMM-GCG

license:apache-2.0
3
1

artst_asr_v2_qasr

license:cc-by-nc-4.0
3
0

LaMini-Cerebras-1.3B

NaNK
license:cc-by-nc-4.0
2
3

BiMediX2-70B

BiMediX2 : Bio-Medical EXpert LMM for Diverse Medical Modalities Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), UAE [](https://github.com/mbzuai-oryx/BiMediX2) [](https://arxiv.org/abs/2412.07769) [](https://github.com/mbzuai-oryx/BiMediX/blob/main/LICENSE.txt) BiMediX2 is released under the CC-BY-NC-SA 4.0 License. For more details, please refer to the LICENSE file included in our BiMediX repository. ⚠️ Warning! This release, intended for research, is not ready for clinical or commercial use. Users are urged to employ BiMediX2 responsibly, especially when applying its outputs in real-world medical scenarios. It is imperative to verify the model's advice with qualified healthcare professionals and not rely on it for medical diagnoses or treatment decisions. Despite the overall advancements BiMediX2 shares common challenges with other language models, including hallucinations, toxicity, and stereotypes. BiMediX2's medical diagnoses and recommendations are not infallible. If you use BiMediX2 in your research, please cite our work as follows:

NaNK
llava_llama
2
2

PALO-13B

NaNK
license:apache-2.0
2
0

LLMVoX

license:cc-by-nc-sa-4.0
1
56

LLaVA-Meta-Llama-3-8B-Instruct

NaNK
llava_llama
1
12

LaMini-Cerebras-256M

license:cc-by-nc-4.0
1
4

LLaVA-Meta-Llama-3-8B-Instruct-lora

NaNK
llava_llama
1
3

LLaVA-Meta-Llama-3-8B-Instruct-pretrain

NaNK
llava_llama
1
1

GLaMM-FullScope_v0

license:apache-2.0
1
0

Video-ChatGPT-7B

NaNK
license:cc-by-4.0
0
42

TerraFM

license:apache-2.0
0
7

VideoGPT-plus_Phi3-mini-4k

license:apache-2.0
0
6

ArTST

license:cc-by-nc-4.0
0
5

bactrian-x-llama-7b-lora

NaNK
license:mit
0
4

bactrian-x-llama-13b-lora

NaNK
license:mit
0
3

MediX-R1-30B

NaNK
license:cc-by-nc-sa-4.0
0
2

LLaVA-Phi-3-mini-4k-instruct-pretrain

license:mit
0
2

ArTSTv2

license:cc-by-nc-4.0
0
2

ArTSTv3

license:cc-by-nc-4.0
0
2

Video-CoM

license:apache-2.0
0
1

MediX-R1-2B-GGUF

NaNK
license:cc-by-nc-sa-4.0
0
1

MediX-R1-8B-GGUF

NaNK
license:cc-by-nc-sa-4.0
0
1

MediX-R1-30B-GGUF

NaNK
license:cc-by-nc-sa-4.0
0
1

MediX-R1-2B

NaNK
license:cc-by-nc-sa-4.0
0
1

MediX-R1-8B

NaNK
license:cc-by-nc-sa-4.0
0
1

bactrian-x-mt5-xl-lora

license:mit
0
1

GLaMM-RegCap-RefCOCOg

license:apache-2.0
0
1

VideoGPT-plus_Phi3-mini-4k_Pretrain

license:apache-2.0
0
1

VideoGPT-plus_Vicuna-13B-4k

NaNK
0
1

STTATTS

0
1

ArTSTv1.5

license:cc-by-nc-4.0
0
1