IbrahimAmin
Egyptian Arabic Wav2vec2 Xlsr 53
šŖšŖš¬ Egyptian Arabic ASR ā wav2vec2-large-xlsr-53 Fine-tuned This model is a fine-tuned version of omarxadel/wav2vec2-large-xlsr-53-arabic-egyptian, enhancing Egyptian Arabic, Modern Standard Arabic (MSA) and Gulf / Levantine Arabic for Automatic Speech Recognition. It was trained on a diverse combination of publicly available and custom-collected Arabic speech datasets, including: - šŗ YouTube Egyptian Arabic Speech (custom-curated) - š§ MASC (Media Arabic Speech Corpus) - š Common Voice 15 - Arabic - š» MGB-3 Broadcast Speech - šļø Arabic Speech Corpus - š Focused on real-life Egyptian Arabic speech (YouTube, spontaneous, conversational) - š Supports MSA and other Arabic dialects. - š Trained on both scripted and natural speech | Dialect | Coverage | | ---------------------------- | ------------ | | Egyptian Arabic | ā Primary | | Modern Standard Arabic (MSA) | ā Supported | | Gulf / Levantine | ā Supported | š£ļø Model Comparison on Common Voice 17.0 Arabic Subset (Test Set) | Model | WER (%) | | -------------------------------------------------- | ----------: | | `IbrahimAmin/egyptian-arabic-wav2vec2-xlsr-53` | 27.20 | | `jonatasgrosman/wav2vec2-large-xlsr-53-arabic` | 45.55 | | `AndrewMcDowell/wav2vec2-xls-r-300m-arabic` | 47.22 | | `openai/whisper-large-v3` | 52.36 | | `Ahmed107/hamsa-v0.6Q` | 53.27 | | `nadsoft/hamsa-v0.1-beta` | 65.60 | | `openai/whisper-medium` | 67.75 | | `openai/whisper-small` | 74.16 | | `omarxadel/wav2vec2-large-xlsr-53-arabic-egyptian` | 91.82 | | `arbml/wav2vec2-large-xlsr-53-arabic-egyptian` | 93.92 | | `mboushaba/whisper-large-v3-turbo-arabic` | 96.90 | \: Whisper models were decoded using beam search (`beamsize = 5`) and evaluated using `BasicTextNormalizer` with `removediacritics=False` and `splitletters=False`, applied to both predictions and reference text.