Wav2Vec2 Large XLSR 53 Japanese

Downloads
Hugging Face
3.8M
42
License
Updated
11/3/2025
by
jonatasgrosman

This model is designed for automatic speech recognition in Japanese, utilizing the Wav2Vec2 architecture. It is trained on the Common Voice dataset and evaluates performance using Word Error Rate (WER) and Character Error Rate (CER) metrics. The model is tagged for audio processing and fine-tuning in speech recognition tasks.

Audio Model
PYTORCH

Quick Info

Released
3/2/2022
Framework
PYTORCH

Resources