Wav2vec2 Large Xlsr Cantonese
Fine-tuned facebook/wav2vec2-large-xlsr-53 on Cantonese using the Common Voice. When using this model, make sure that your speech input is sampled at 16kHz.
The model can be used directly (without a language model) as follows:
The model can be evaluated as follows on the Chinese (Hong Kong) test data of Common Voice.
The Common Voice `train`, `validation` were used for training.