ltg

61 models • 1 total models in database

Sort by:

gpt-bert-babylm-base

norbert4-base

license:apache-2.0

norbert3-xs

license:apache-2.0

norbert4-xsmall

license:apache-2.0

norbert3-large

license:apache-2.0

norbert4-large

license:apache-2.0

norbert3-small

license:apache-2.0

norbert4-xlarge

license:apache-2.0

norbert4-small

license:apache-2.0

norbert3-base

license:apache-2.0

Deberta Xxlarge Fixed

This is deberta-v2-xxlarge updated to implement the `AutoModelForCausalLM` class, enabling it to generate text. This implementation is based on our paper "BERTs are Generative In-Context Learners". This repository also fixes three bugs in the original HF implementation of DeBERTa: 1. We fixed the incorrect name of the output embedding weights in the checkpoint file; 2. We fixed the implementation of the enhanced mask decoder (EMD), based on the original GitHub repository; 3. We clamp the positional embeddings so that they work with long sequence lengths. If you find DeBERTa useful for your work, please cite the following paper:

norbert3-fine-absa

norbert2

license:cc-by-4.0

norbert3-base_sentence-sentiment

license:cc-by-4.0

norbert

license:cc-by-4.0

ltg-bert-bnc

license:cc-by-4.0

nort5-base-en-no-translation

license:cc-by-4.0

norbert3-large_sentence-sentiment

license:cc-by-4.0

norbert3-coarse-absa

flan-t5-definition-en-xl

license:cc-by-sa-4.0

mt0-definition-ru-xl-axolotl24st

license:cc-by-sa-4.0

nort5-base

license:apache-2.0

gpt-bert-babylm-small

norbert3-fine-absa-full

nort5-large

license:apache-2.0

aya-definition-fi-axolotl24st

This model is a version of CohereLabs/aya-101, fine-tuned on datasets of Finnish usage examples and definitions. It generates definitions of Finnish words in context. Its input is the usage example and the instruction question ". Mitä tarkoittaa \ ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Finnish. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

aya-definition-fi-axolotl24st_dbnary

This model is a version of CohereLabs/aya-101, fine-tuned on datasets of Finnish usage examples and definitions. It generates definitions of Finnish words in context. Its input is the usage example and the instruction question ". Mitä tarkoittaa \ ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Finnish. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

nort5-xs

license:apache-2.0

mt0-definition-de-xl-dbnary

This model is a version of bigscience/mt0-xl, fine-tuned on datasets of German usage examples and definitions. It generates definitions of German words in context. Its input is the usage example and the instruction question ". Was ist die Definition von \ ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to German. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

norbert3-coarse-absa-full

nort5-small

license:apache-2.0

aya-definition-ru-axolotl24st

license:cc-by-sa-4.0

mt0-definition-fi-xl-axolotl24st

license:cc-by-sa-4.0

mt0-definition-ru-xl-axolotl24st_dbnary

This model is a version of bigscience/mt0-xl, fine-tuned on datasets of Russian usage examples and definitions. It generates definitions of Russian words in context. Its input is the usage example and the instruction question "Что такое ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Russian. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

tower-definition-ru-axolotl24st_dbnary

This model is a version of Unbabel/TowerInstruct-7B-v0.2, fine-tuned on datasets of Russian usage examples and definitions. It generates definitions of Russian words in context. Its input is the usage example and the instruction question "Что такое \ ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Russian. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

tower-definition-de-dbnary

This model is a version of Unbabel/TowerInstruct-7B-v0.2, fine-tuned on datasets of German usage examples and definitions. It generates definitions of German words in context. Its input is the usage example and the instruction question ". Was ist die Definition von \ ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to German. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

tower-definition-ru-axolotl24st

This model is a version of Unbabel/TowerInstruct-7B-v0.2, fine-tuned on datasets of Russian usage examples and definitions. It generates definitions of Russian words in context. Its input is the usage example and the instruction question "Что такое ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Russian. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

aya-definition-ru-axolotl24st_dbnary

license:cc-by-sa-4.0

flan-t5-definition-en-base

license:cc-by-sa-4.0

aya-definition-de-dbnary

license:cc-by-sa-4.0

mt0-definition-fi-xl-axolotl24st_dbnary

license:cc-by-sa-4.0

tower-definition-fi-axolotl24st_dbnary

This model is a version of Unbabel/TowerInstruct-7B-v0.2, fine-tuned on datasets of Finnish usage examples and definitions. It generates definitions of Finnish words in context. Its input is the usage example and the instruction question ". Mitä tarkoittaa ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Finnish. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

tower-definition-fi-axolotl24st

This model is a version of Unbabel/TowerInstruct-7B-v0.2, fine-tuned on datasets of Finnish usage examples and definitions. It generates definitions of Finnish words in context. Its input is the usage example and the instruction question ". Mitä tarkoittaa ?" - Github repository: MultilingualDefGen - Paper: EMNLP 2025 Findings The model is intended for research purposes, as a source of contextualized dictionary-like lexical definitions. The fine-tuning datasets were limited to Finnish. Although the original model is multilingual, we did not evaluate its ability to generate definitions in other languages. Generated definitions can contain all sorts of biases and stereotypes, stemming from the underlying language model and raw dictionary data.

license:cc-by-sa-4.0

SLIDE-translation

flan-t5-definition-en-large

license:cc-by-sa-4.0

ltg-bert-babylm

license:cc-by-4.0

norbert3-large_TSA

license:cc-by-4.0

SLIDE-base

license:apache-2.0

nort5-large-en-no-translation

license:cc-by-4.0

mt0-definition-ru-xl

license:cc-by-sa-4.0

bnc-bert-span-0.25x

bnc-bert-span-2x

bnc-bert-span-document

bnc-bert-span-order

bnc-bert-subword

bnc-bert-word

mt0-definition-en-xl

license:cc-by-sa-4.0

mt0-definition-no-xl

license:cc-by-sa-4.0

SLIDE-small

license:apache-2.0

SLIDE-x-small

license:apache-2.0

ssa-perin

license:apache-2.0