yanolja

21 models • 5 total models in database

Sort by:

YanoljaNEXT-EEVE-Instruct-10.8B

YanoljaNEXT-Rosetta-4B-2510

This model is a fine-tuned version of `google/gemma-3-4b-pt`. As it is intended solely for text generation, we have extracted and utilized only the `Gemma3ForCausalLM` component from the original architecture. Unlike our previous EEVE models, this model does not feature an expanded tokenizer. - Model Name: `yanolja/YanoljaNEXT-Rosetta-4B-2510` - Base Model: `google/gemma-3-4b-pt` This model is a 4-billion parameter, decoder-only language model built on the Gemma3 architecture and fine-tuned by Yanolja NEXT. It is specifically designed to translate structured data (JSON format) while preserving the original data structure. The model was trained on a multilingual dataset covering the following languages equally: - Arabic - Bulgarian - Chinese - Czech - Danish - Dutch - English - Finnish - French - German - Greek - Gujarati - Hebrew - Hindi - Hungarian - Indonesian - Italian - Japanese - Korean - Persian - Polish - Portuguese - Romanian - Russian - Slovak - Spanish - Swedish - Tagalog - Thai - Turkish - Ukrainian - Vietnamese While optimized for these languages, it may also perform effectively on other languages supported by the base Gemma3 model. You can use this model with the `transformers` library as follows: The model outputs the final translation in JSON format when appropriate, or plain text for simple translations. Training Data The translation datasets were synthesized using fineweb corpora. - FineWeb Edu - FineWeb2 The model was fine-tuned on synthetic multilingual translation data to optimize performance across the supported language pairs. The following CHrF++ scores (WMT24++) demonstrate the model's competitive performance compared to other state-of-the-art translation models on English to Korean translation: | Model | CHrF++ Score (WMT24++) | |------------------------------------|--------------| | google/gemini-2.5-flash-lite | 35.23 | | yanolja/YanoljaNEXT-Rosetta-4B-2510 | 35.09 | | yanolja/YanoljaNEXT-Rosetta-12B | 34.75 | | yanolja/YanoljaNEXT-Rosetta-20B | 33.87 | | google/gemini-2.0-flash-001 | 33.81 | | openai/gpt-oss-120b | 31.51 | | yanolja/YanoljaNEXT-Rosetta-4B | 31.31 | | openai/gpt-4.1-nano | 31.15 | | Qwen/Qwen3-235B-A22B-Instruct-2507-FP8 | 31.02 | | openai/gpt-oss-20b | 30.56 | | google/gemma-3-27b-it | 30.05 | | google/gemma-3-4b-pt | 27.53 | YanoljaNEXT-Rosetta-4B-2510 achieves competitive translation quality while maintaining the efficiency of a 4B parameter model. Scores for the other language pairs can be found in the WMT24++ Evaluation Results. This model is intended for translating structured data (JSON format) while preserving the original structure. It is particularly well-suited for tasks such as localizing product catalogs, translating hotel reviews, or handling any other structured content that requires accurate translation. Limitations The model is primarily optimized for processing JSON data. Its performance on unstructured text or other data formats may vary. In some cases, the model may produce invalid JSON, repetitive output, or inaccurate translations. License This model is released under the Gemma license, inherited from its base model, `google/gemma-3-4b-pt`. Please consult the official Gemma license terms for detailed usage guidelines. Acknowledgments This work was supported by the Korea Creative Content Agency (KOCCA) grant, funded by the Ministry of Culture, Sports and Tourism (MCST) in 2025 (Project Name: Cultivating Masters and Doctoral Experts to Lead Digital-Tech Tourism, Project Number: RS-2024-00442006, Contribution Rate: 100%). This work utilizes several models and datasets. We would like to acknowledge the original authors for their valuable contributions to the field.

yanolja

YanoljaNEXT-EEVE-Instruct-10.8B

YanoljaNEXT-Rosetta-4B-2510

YanoljaNEXT-EEVE-Instruct-2.8B

YanoljaNEXT-Rosetta-4B-2510-GGUF

YanoljaNEXT-Rosetta-12B-2510-GGUF

YanoljaNEXT-EEVE-2.8B

YanoljaNEXT Rosetta 12B 2510

YanoljaNEXT EEVE 10.8B

YanoljaNEXT-Rosetta-27B-2511-GGUF

YanoljaNEXT Rosetta 4B

YanoljaNEXT-Rosetta-4B-2511-GGUF

YanoljaNEXT-Rosetta-12B

YanoljaNEXT-EEVE-Instruct-7B-v2-Preview

YanoljaNEXT-Rosetta-4B-2511

KoSOLAR-10.7B-v0.2

YanoljaNEXT-Rosetta-27B-2511

YanoljaNEXT-Rosetta-20B

YanoljaNEXT-Rosetta-27B-2511-FP8

YanoljaNEXT-Rosetta-4B-2511-FP8

Bookworm-10.7B-v0.4-DPO

YanoljaNEXT-EEVE-Rosetta-7B-2602