Cnam-LMSSC

27 models • 1 total models in database

Sort by:

Vibravox EBEN Models

Master Model Card: Vibravox Audio Bandwidth extension Models This master model card serves as an entry point for exploring multiple audio bandwidth extension (BWE) models trained on different sensor data from the Vibravox dataset. These models are designed to to enhance the audio quality of body-conducted captured speech, by denoising and regenerating mid and high frequencies from low frequency content only. The models are trained on specific sensors to address various audio capture scenarios using body conducted sound and vibration sensors. Disclaimer Each of these models has been trained for specific non-conventional speech sensors and is intended to be used with in-domain data. Please be advised that using these models outside their intended sensor data may result in suboptimal performance. Usage All models are trained using Configurable EBEN (see publication in IEEE TASLP - arXiv link) and adapted to different sensor inputs. They are intended to be used at a sample rate of 16kHz. Training Procedure Detailed instructions for reproducing the experiments are available on the jhauret/vibravox Github repository and in the VibraVox paper on arXiV. The following models are available, each trained on a different sensor on the `speechclean` or synthetically mixed `speechclean` and `speechless-noisy` subsets of (https://huggingface.co/datasets/Cnam-LMSSC/vibravox): | Transducer | EBEN configuration | Huggingface model trained on speech-clean link | Huggingface model trained on synthetically mixed speech-clean and speechless-noisy link | |:---------------------------|:---------------------|:---------------------|:---------------------| | In-ear comply foam-embedded microphone | M=4,P=2,Q=4 |EBENsoftinearmicrophone |EBENnoisysoftinearmicrophone| | In-ear rigid earpiece-embedded microphone | M=4,P=2,Q=4 |EBENrigidinearmicrophone | EBENnoisyrigidinearmicrophone| | Forehead miniature vibration sensor | M=4,P=4,Q=4 |EBENforeheadaccelerometer | EBENnoisyforeheadaccelerometer| | Temple vibration pickup | M=4,P=1,Q=4 |EBENtemplevibrationpickup | EBENnoisytemplevibrationpickup| | Laryngophone | M=4,P=2,Q=4 |EBENthroatmicrophone | EBENnoisythroatmicrophone|

license:mit

vibravox_phonemizers

license:mit

wav2vec2-spanish-phonemizer

license:mit

vibravox-phonemes-tokenizer

license:mit

EBEN_noisy_soft_in_ear_microphone

license:mit

Cnam-LMSSC

wav2vec2-french-phonemizer

wav2vec2-french-phonemizer-v2

wav2vec2-italian-phonemizer

phonemizer_headset_microphone

phonemizer_temple_vibration_pickup

EBEN_soft_in_ear_microphone

phonemizer_throat_microphone

phonemizer_forehead_accelerometer

phonemizer_rigid_in_ear_microphone

EBEN_temple_vibration_pickup

EBEN_reverse_temple_vibration_pickup

EBEN_reverse_forehead_accelerometer

EBEN_noisy_forehead_accelerometer

phonemizer_soft_in_ear_microphone

EBEN_reverse_rigid_in_ear_microphone

EBEN_noisy_rigid_in_ear_microphone

EBEN_noisy_temple_vibration_pickup

EBEN_forehead_accelerometer

EBEN_rigid_in_ear_microphone

EBEN_throat_microphone

EBEN_reverse_soft_in_ear_microphone

EBEN_noisy_throat_microphone

Vibravox EBEN Models

vibravox_phonemizers

wav2vec2-spanish-phonemizer

vibravox-phonemes-tokenizer

EBEN_noisy_soft_in_ear_microphone