huawei-noah

25 models • 1 total models in database

Sort by:

TinyBERT_General_4L_312D

TinyBERT: Distilling BERT for Natural Language Understanding ======== TinyBERT is 7.5x smaller and 9.4x faster on inference than BERT-base and achieves competitive performances in the tasks of natural language understanding. It performs a novel transformer distillation at both the pre-training and task-specific learning stages. In general distillation, we use the original BERT-base without fine-tuning as the teacher and a large-scale text corpus as the learning data. By performing the Transforme

—

83,872

huawei-noah

TinyBERT_General_4L_312D

TinyBERT_General_6L_768D

TinyBERT_4L_zh

EntityCS-39-MLM-xlmr-base

TinyBERT_6L_zh

DynaBERT_MNLI

JABERv2

DynaBERT_SST-2

TernaryBERT_MNLI

TernaryBERT_SST-2

AutoTinyBERT-S4

JABERv2-6L

MOASpec-Llama-3-8B-Instruct

EntityCS-39-WEP-xlmr-base

pangu-CodeCLM-300m

AT5Sv2

pycodegpt-CodeCLM-partial-100m

pangu-CodeCLM-full-300m

AutoTinyBERT-S1

AutoTinyBERT-S3

AutoTinyBERT-KD-S3

AutoTinyBERT-KD-S4

pycodegpt-CodeCLM-100m

Grad-TTS

AT5B