MahmoudAshraf
3 models • 1 total models in database
Sort by:
mms-300m-1130-forced-aligner
Forced Alignment with Hugging Face CTC Models This Python package provides an efficient way to perform forced alignment between text and audio using Hugging Face's pretrained models. it also features an improved implementation to use much less memory than TorchAudio forced alignment API. The model checkpoint uploaded here is a conversion from torchaudio to HF Transformers for the MMS-300M checkpoint trained on forced alignment dataset
—
4,856,983
63
acft-whisper-large-v3-turbo
license:apache-2.0
14
2
acft-whisper-large-v3
NaNK
license:apache-2.0
5
0