Using Wav2Vec 2.0 from HuggingFace and Fairseq with SpeechBrain
This tutorial describes how to combine (use and finetune) pretrained wav2vec2 models coming both from fairseq or HuggingFace. Any wav2vec 2.0 model trained with the fairseq toolkit or integrated to the transformers interface of HuggingFace can be then plugged to SpeechBrain to approach a speech-related task: automatic speech recognition, speaker recognition, spoken language understanding ...
Open in Google Colab