monsterapi

50 models • 2 total models in database

Sort by:

gpt2_alpaca-lora

gpt2_124m_norobots

license:apache-2.0

gemma-2b-lora-maths-orca-200k

license:apache-2.0

Gptj-6b_alpaca-gpt4

llama2-7b-tiny-codes-code-generation

mistral_7b_DolphinCoder

license:apache-2.0

codellama_7b_DolphinCoder

mistral_7b_WizardLMEvolInstruct70k

llama2_SQL_Answers_finetuned

meta-llama/Llama-2-7b

llama7B_alpaca-lora

gemma-2-2b-hindi-translator

license:apache-2.0

sd21_anime_finetuning

opt125M_alpaca

OpenPlatypus_Falcon_7b

base_model:codellama/CodeLlama-7b-hf

Mistral-7B-v0.1-Dolly-15k

CodeAlpaca_LLAMA2_7B

sdxl_car_finetuning

OpenPlatypus_LLAMA2_7b

meta-llama/Llama-2-7b-hf

opt1.3B_codeinstruct

base_model:codellama/CodeLlama-7b-hf

falcon_7b_DolphinCoder

license:apache-2.0

llama2_7b_DolphinCoder

codellama7b_codealpaca20k

Llama-3_1-8B-Instruct-orca-ORPO

Model Used: meta-llama/Meta-Llama-3.1-8B-Instruct Dataset: Intel/orcadpopairs The Intel Orca dataset is a specialized version of the OpenOrca dataset, which includes ~1M GPT-4 completions and ~3.2M GPT-3.5 completions. This dataset is tabularized to align with the distributions in the ORCA paper and focuses on preference optimization by clearly indicating which responses are good and which are bad. It is primarily used in natural language processing for training and evaluation. This finetuning run was performed using MonsterAPI's LLM finetuner with ORPO (Optimized Response Preference Optimization) for enhancing preference optimization. - Completed in a total duration of 1 hour and 39 minutes for 1 epoch. - Costed `$2.69` for the entire process. - Epochs: 1 - Cost Per Epoch: $2.69 - Total Finetuning Cost: $2.69 - Model Path: meta-llama/Meta-Llama-3.1-8B-Instruct - Learning Rate: 0.001 - Data Split: 90% train 10% validation - Gradient Accumulation Steps: 16

falcon-7b-python-code-instructions-18k-alpaca

license:apache-2.0

sdxl_finetuning_anime

sdxl_chinatown_finetuning

Falcon_40B_dolly15k

license:apache-2.0

gpt2

CodeLlama-70b-hf_4bit_bnb

Llama3.3_70b

zephyr-7b-alpha_metamathqa

license:apache-2.0

llama2_7b_WizardLMEvolInstruct70k

Falcon_180B_dolly15k

license:apache-2.0

llama2_7b_norobots

mistral_7b_HalfEpoch_DolphinCoder

license:apache-2.0

Mixtral-8x7B-v0.1_4bit_bnb

llama2_70B_dolly15k

gpt2_124m_WizardLMEvolInstruct70k

license:apache-2.0

zephyr-7b-beta-CTranslate2-bfloat16

license:apache-2.0

falcon_7b_OpenPlatypus

license:apache-2.0

opt-350m_4bit_bnb

falcon-40b_4bit_bnb

license:apache-2.0

Meta-Llama-3-70B-Instruct_4bit_bnb

llama2-code-generation

mistral_7b_norobots

license:apache-2.0

falcon_7b_norobots

license:apache-2.0

zephyr_7b_norobots

license:apache-2.0

zephyr-7b-WizardLM-alpaca-instruct-70k-unfiltered

license:apache-2.0

falcon_7b_3epoch_norobots

license:apache-2.0

Mixtral-8x7B-Instruct-v0.1_4bit_bnb

license:apache-2.0