gigant
romanian-wav2vec2
--- language: - ro license: apache-2.0 tags: - automatic-speech-recognition - hf-asr-leaderboard - robust-speech-event datasets: - mozilla-foundation/common_voice_8_0 - gigant/romanian_speech_synthesis_0_8_1 base_model: facebook/wav2vec2-xls-r-300m model-index: - name: wav2vec2-ro-300m_01 results: - task: type: automatic-speech-recognition name: Automatic Speech Recognition dataset: name: Robust Speech Event type: speech-recognition-community-v2/dev_data args: ro metrics: - type: wer value: 46.9
whisper-medium-romanian
This model is a fine-tuned version of openai/whisper-medium on the Common Voice 11.0 dataset, and the Romanian speech synthesis corpus. It achieves the following results on the evaluation set: - evalloss: 0.06453 - evalwer: 4.717 - epoch: 7.03 - step: 3500 The architecture is the same as openai/whisper-medium. The model was trained on the Common Voice 11.0 dataset (`train+validation+other` splits) and the Romanian speech synthesis corpus, and was tested on the `test` split of the Common Voice 11.0 dataset. The following hyperparameters were used during training: - learningrate: 1e-05 - trainbatchsize: 32 - evalbatchsize: 32 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lrschedulertype: linear - lrschedulerwarmupsteps: 500 - trainingsteps: 5000 - mixedprecisiontraining: Native AMP - Transformers 4.26.0.dev0 - Pytorch 1.13.0+cu117 - Datasets 2.7.1.dev0 - Tokenizers 0.13.2