mmnga

293 models • 3 total models in database

Sort by:

Llama-3-70B-japanese-suzume-vector-v0.1

実験モデルです / This is an experimental model. lightblue/suzume-llama-3-8B-japaneseと、 meta-llama/Meta-Llama-3-8B-Instructの差分をchat-vectorアプローチで抽出し、 meta-llama/Meta-Llama-3-70B-Instructに適用しました - ja 1. `meta-llama/Meta-Llama-3-8B-Instruct`と`lightblue/suzume-llama-3-8B-japanese`の差分を作成 2. shapeが異なるので、差分をmeta-llama/Meta-Llama-3-70B-Instruct用にアップサンプリング 3. 前から 8-layer、最後から8-layerはそのまま適用 4. 中間layerを引き延ばして適用 - en 1. Create the difference between `meta-llama/Meta-Llama-3-8B-Instruct` and `lightblue/zume-llama-3-8B-japanese` 2. Since the shapes are different, the difference is upsampled for meta-llama/Meta-Llama-3-70B-Instruct 3. Apply the 8 layers from the front and 8 layers from the end as they are. 4. Continue applying the middle layer This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. - Developed by: [More Information Needed] - Funded by [optional]: [More Information Needed] - Shared by [optional]: [More Information Needed] - Model type: [More Information Needed] - Language(s) (NLP): [More Information Needed] - License: [More Information Needed] - Finetuned from model [optional]: [More Information Needed] - Repository: [More Information Needed] - Paper [optional]: [More Information Needed] - Demo [optional]: [More Information Needed] Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). - Hardware Type: [More Information Needed] - Hours used: [More Information Needed] - Cloud Provider: [More Information Needed] - Compute Region: [More Information Needed] - Carbon Emitted: [More Information Needed]

mmnga

Llama-3-70B-japanese-suzume-vector-v0.1

plamo-2-translate-gguf

Llama-3-ELYZA-JP-8B-gguf

cyberagent-DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf

ELYZA-japanese-Llama-2-7b-fast-instruct-gguf

Ninja-v1-NSFW-128k-gguf

Vecteus-v1-gguf

RakutenAI-2.0-mini-instruct-gguf

pfnet-nekomata-14b-pfn-qfin-gguf

cyberagent-Mistral-Nemo-Japanese-Instruct-2408-gguf

qwen2.5-bakeneko-32b-instruct-v2-gguf

Meta-Llama-3-70B-Instruct-gguf

Llama-3.3-70B-Instruct-gguf

tokyotech-llm-Llama-3.1-Swallow-8B-Instruct-v0.1-gguf

YuisekinAIEvol-Mistral-7B-ja-math-v0.1.1-gguf

HODACHI-Borea-Phi-3.5-mini-Instruct-Common-gguf

llm-jp-3.1-1.8b-instruct4-gguf

pfnet-Llama3-Preferred-MedSwallow-70B-gguf

Marco-o1-gguf

lightblue-suzume-llama-3-8B-multilingual-gguf

Aratako-Qwen3-30B-A3B-NSFW-JP-gguf

codegemma-1.1-2b-gguf

ELYZA-japanese-Llama-2-7b-instruct-gguf

DataPilot-ArrowPro-7B-RobinHood-gguf

ELYZA-Thinking-1.0-Qwen-32B-gguf

tokyotech-llm-Llama-3.1-Swallow-8B-Instruct-v0.3-gguf

Llama-3.1-70B-Instruct-gguf

haqishen-Llama-3-8B-Japanese-Instruct-gguf

TinySwallow-1.5B-Instruct-gguf

tokyotech-llm-Llama-3.1-Swallow-70B-Instruct-v0.1-gguf

Moonlight-16B-A3B-Instruct-gguf

cyberagent-DeepSeek-R1-Distill-Qwen-32B-Japanese-gguf

Llama-3-Swallow-70B-Instruct-v0.1-gguf

umiyuki-Umievo-itr012-Gleipnir-7B-gguf

aya-23-8B-gguf

Llama-3.1-8B-Instruct-gguf

EZO2.5-gemma-3-12b-it-Preview-gguf

Llama-3.1-8B-EZO-1.1-it-gguf

lightblue-DeepSeek-R1-Distill-Qwen-7B-Japanese-gguf

Phi-4-mini-instruct-gguf

codegemma-1.1-7b-it-gguf

HODACHI-EZO-Common-T2-2B-gemma-2-it-gguf

DataPilot-ArrowPro-7B-KUJIRA-gguf

Llama-3.1-Swallow-8B-Instruct-v0.5-gguf

Llama-3.1-70B-Japanese-Instruct-2407-gguf

aya-23-35B-gguf

sarashina2.2-0.5b-instruct-v0.1-gguf

llm-jp-13b-instruct-full-dolly-oasst-v1.0-gguf

ELYZA Japanese Llama 2 13b Fast Instruct Gguf

ELYZA-japanese-CodeLlama-7b-instruct-gguf

webbigdata-ALMA-7B-Ja-V2-gguf

Fugaku-LLM-13B-instruct-gguf

tokyotech-llm-Swallow-13b-instruct-v0.1-gguf

aixsatoshi-Ex-karakuri-8x12B-chat-v1-gguf

gemma-2b-it-gguf

Llama-3-Swallow-8B-Instruct-v0.1-gguf

Light-R1-32B-gguf

HODACHI-EZO-Humanities-9B-gemma-2-it-gguf

karakuri-lm-32b-thinking-2501-exp-gguf

gemma-2-2b-it-gguf

RakutenAI-2.0-8x7B-instruct-gguf

lightblue-qarasu-14B-chat-plus-unleashed-gguf

rinna-nekomata-7b-instruction-gguf

rinna-llama-3-youko-70b-instruct-gguf

Phi-3-medium-128k-instruct-gguf

Llama-3.1-70B-EZO-1.1-it-gguf

ELYZA-Shortcut-1.0-Qwen-7B-gguf

tokyotech-llm-Llama-3.3-Swallow-70B-Instruct-v0.4-gguf

RakutenAI-7B-chat-gguf

RakutenAI-7B-instruct-gguf

ABEJA-QwQ32b-Reasoning-Japanese-v1.0-gguf

Reflection-Llama-3.1-70B-gguf

Qwen3-4B-Instruct-2507-gguf

Llama-4-Scout-17B-16E-Instruct-gguf

Ninja-v1-NSFW-gguf

japanese-stablelm-instruct-gamma-7b-gguf

japanese-stablelm-2-instruct-1_6b-gguf

Phi-3-mini-128k-instruct-gguf

ELYZA-japanese-Llama-2-7b-fast-gguf