Alibaba-NLP

68 models • 12 total models in database

Sort by:

gte-large-en-v1.5

--- datasets: - allenai/c4 library_name: transformers tags: - sentence-transformers - gte - mteb - transformers.js - sentence-similarity license: apache-2.0 language: - en model-index: - name: gte-large-en-v1.5 results: - task: type: Classification dataset: type: mteb/amazon_counterfactual name: MTEB AmazonCounterfactualClassification (en) config: en split: test revision: e8379541af4e31359cca9fbcf4b00f2671dba205 metrics: - type: accuracy value: 73.01492537313432 - type: ap value: 35.053416966595

—

3,963,021

228

gte-multilingual-base

--- tags: - mteb - sentence-transformers - transformers - multilingual - sentence-similarity - text-embeddings-inference license: apache-2.0 language: - af - ar - az - be - bg - bn - ca - ceb - cs - cy - da - de - el - en - es - et - eu - fa - fi - fr - gl - gu - he - hi - hr - ht - hu - hy - id - is - it - ja - jv - ka - kk - km - kn - ko - ky - lo - lt - lv - mk - ml - mn - mr - ms - my - ne - nl - 'no' - pa - pl - pt - qu - ro - ru - si - sk - sl - so - sq - sr - sv - sw - ta - te - th - tl -

license:apache-2.0

1,412,178

329

gte-reranker-modernbert-base

--- license: apache-2.0 language: - en base_model: - answerdotai/ModernBERT-base base_model_relation: finetune pipeline_tag: text-ranking library_name: transformers tags: - sentence-transformers - transformers.js - text-embeddings-inference ---

license:apache-2.0

598,665

gte-base-en-v1.5

--- library_name: transformers tags: - sentence-transformers - gte - mteb - transformers.js - sentence-similarity license: apache-2.0 language: - en model-index: - name: gte-base-en-v1.5 results: - task: type: Classification dataset: type: mteb/amazon_counterfactual name: MTEB AmazonCounterfactualClassification (en) config: en split: test revision: e8379541af4e31359cca9fbcf4b00f2671dba205 metrics: - type: accuracy value: 74.7910447761194 - type: ap value: 37.053785713650626 - type: f1 value: 68.

license:apache-2.0

335,754

gte-Qwen2-1.5B-instruct

gte-Qwen2-1.5B-instruct is the latest model in the gte (General Text Embedding) model family. The model is built on Qwen2-1.5B LLM model and use the same training data and strategies as the gte-Qwen2-7B-instruct model. - Integration of bidirectional attention mechanisms, enriching its contextual understanding. - Instruction tuning, applied solely on the query side for streamlined efficiency - Comprehensive training across a vast, multilingual text corpus spanning diverse domains and scenarios. This training leverages both weakly supervised and supervised data, ensuring the model's applicability across numerous languages and a wide array of downstream tasks. Model Information - Model Size: 1.5B - Embedding Dimension: 1536 - Max Input Tokens: 32k Observe the configsentencetransformers.json to see all pre-built prompt names. Otherwise, you can use `model.encode(queries, prompt="Instruct: ...\nQuery: "` to use a custom prompt of your choice. You can use the scripts/evalmteb.py to reproduce the following result of gte-Qwen2-1.5B-instruct on MTEB(English)/C-MTEB(Chinese): | Model Name | MTEB(56) | C-MTEB(35) | MTEB-fr(26) | MTEB-pl(26) | |:----:|:---------:|:----------:|:----------:|:----------:| | bge-base-en-1.5 | 64.23 | - | - | - | | bge-large-en-1.5 | 63.55 | - | - | - | | gte-large-en-v1.5 | 65.39 | - | - | - | | gte-base-en-v1.5 | 64.11 | - | - | - | | mxbai-embed-large-v1 | 64.68 | - | - | - | | acgetextembedding | - | 69.07 | - | - | | stella-mrl-large-zh-v3.5-1792d | - | 68.55 | - | - | | gte-large-zh | - | 66.72 | - | - | | multilingual-e5-base | 59.45 | 56.21 | - | - | | multilingual-e5-large | 61.50 | 58.81 | - | - | | e5-mistral-7b-instruct | 66.63 | 60.81 | - | - | | gte-Qwen1.5-7B-instruct | 67.34 | 69.52 | - | - | | NV-Embed-v1 | 69.32 | - | - | - | | gte-Qwen2-7B-instruct | 70.24 | 72.05 | 68.25 | 67.86 | | gte-Qwen2-1.5B-instruct | 67.16 | 67.65 | 66.60 | 64.04 | The gte series models have consistently released two types of models: encoder-only models (based on the BERT architecture) and decode-only models (based on the LLM architecture). | Models | Language | Max Sequence Length | Dimension | Model Size (Memory Usage, fp32) | |:-------------------------------------------------------------------------------------:|:--------:|:-----: |:---------:|:-------------------------------:| | GTE-large-zh | Chinese | 512 | 1024 | 1.25GB | | GTE-base-zh | Chinese | 512 | 512 | 0.41GB | | GTE-small-zh | Chinese | 512 | 512 | 0.12GB | | GTE-large | English | 512 | 1024 | 1.25GB | | GTE-base | English | 512 | 512 | 0.21GB | | GTE-small | English | 512 | 384 | 0.10GB | | GTE-large-en-v1.5 | English | 8192 | 1024 | 1.74GB | | GTE-base-en-v1.5 | English | 8192 | 768 | 0.51GB | | GTE-Qwen1.5-7B-instruct | Multilingual | 32000 | 4096 | 26.45GB | | GTE-Qwen2-7B-instruct | Multilingual | 32000 | 3584 | 26.45GB | | GTE-Qwen2-1.5B-instruct | Multilingual | 32000 | 1536 | 6.62GB | In addition to the open-source GTE series models, GTE series models are also available as commercial API services on Alibaba Cloud. - Embedding Models: Three versions of the text embedding models are available: text-embedding-v1/v2/v3, with v3 being the latest API service. - ReRank Models: The gte-rerank model service is available. Note that the models behind the commercial APIs are not entirely identical to the open-source models. GTE models can be fine-tuned with a third party framework SWIFT. If you find our paper or models helpful, please consider cite:

Alibaba-NLP

gte-large-en-v1.5

gte-multilingual-base

gte-reranker-modernbert-base

gte-base-en-v1.5

gte-Qwen2-1.5B-instruct

gte-Qwen2-7B-instruct

gme-Qwen2-VL-2B-Instruct

gte-multilingual-reranker-base

gte-modernbert-base

Tongyi-DeepResearch-30B-A3B

gme-Qwen2-VL-7B-Instruct

Simulation_LLM_google_14B_V2

gte-en-mlm-large

Simulation_LLM_google_7B_V2

gte-en-mlm-base

gte-multilingual-mlm-base

gte-Qwen1.5-7B-instruct

E2Rank 0.6B

WebSailor-32B

WebDancer-32B

E2Rank-4B

WebSailor-3B

GVE-3B

E2Rank-0.6B-Embedding-Only

E2Rank-8B

E2Rank-4B-Embedding-Only

E2Rank-8B-Embedding-Only

ERank-4B

GVE-7B

Simulation_LLM_wiki_7B_V2

ERank-32B

WebSailor-7B

Simulation_LLM_wiki_3B_V2

ERank-14B

WebWatcher-7B

Simulation_LLM_wiki_14B_V2

ZeroSearch_wiki_V2_Qwen2.5_7B_Instruct

Simulation_LLM_google_3B_V2

WebShaper-32B

WebWatcher-32B

ZeroSearch_google_V1_Qwen2.5_7B

Simulation_LLM_google_14B_V1

Simulation_LLM_google_7B_V1

Simulation_LLM_google_3B_V1

ZeroSearch_google_V2_Qwen2.5_7B

OmniSearch-Qwen-VL-Chat-en

ZeroSearch_google_V2_Llama_3.2_3B_Instruct

ZeroSearch_google_V2_Llama_3.2_3B

ZeroSearch_wiki_V2_Qwen2.5_3B

ZeroSearch_wiki_V2_Qwen2.5_3B_Instruct

ZeroSearch_google_V2_Qwen2.5_3B_Instruct

ZeroSearch_google_V2_Qwen2.5_3B

ZeroSearch_wiki_V2_Llama_3.2_3B_Instruct

ZeroSearch_google_V1_Qwen2.5_7B_Instruct

ZeroSearch_google_V2_Qwen2.5_7B_Instruct

ZeroSearch_wiki_V2_Llama_3.2_3B

ZeroSearch_google_v1_Qwen2.5_3B_Instruct

ZeroSearch_google_v1_Llama_3.2_3B

ZeroSearch_google_v1_Llama_3.2_3B_Instruct

ZeroSearch_google_v1_Qwen2.5_3B

ZeroSearch_wiki_V2_Qwen2.5_7B

WebSailor

new-impl

LaSER-Qwen3-0.6B

qwen2-impl

LaSER-Qwen3-4B

UVRB

LaSER-Qwen3-8B