Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers
-
Updated
Jul 12, 2022 - Jupyter Notebook
Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers
AI model for speech disorder detection
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
This repository demonstrates development of Hindi ASR model using transformers.
A natural language processing and machine learning project for a low resource langauge in Zambia.
Application to search for similar sound effects by voice and title.
Speaker recognition task using wav2vec2 model.
Speech Assessment API in NextJS
SER and audio classification using both a Wav2Vec2 based model and an ASR->Bert pipeline, as well as utilizing a multimodal late-fusion model
A simple Speech Emotion Recognition (SER) project based on Wav2Vec2.
This repository contains code/papers/research on Speech or Audio Classification
This repository contains the implementation of our published paper titled 'Improving Automatic Speech Recognition with Dialect-Specific Language Models,' presented at SPECOM'23.
Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires heavy GPU I've come up with a way to run this on Google Colab as well as local machines with minimum GPU.
Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning
Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS
A modular codebase to process audio dataset, generate custom tokenizer, finetune and infer wav2vec2 model on custom dataset.
Speech to text implementation using transformers in PyTorch.
Add a description, image, and links to the wav2vec2 topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec2 topic, visit your repo's landing page and select "manage topics."