#

wav2vec2

Here are 103 public repositories matching this topic...

lectly / wav2vec2-large-xlsr-53-egyptian-arabic

Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers

automatic-speech-recognition wav2vec2 xlsr

Updated Jul 12, 2022
Jupyter Notebook

keshavbhandari / Audioneme

AI model for speech disorder detection

wav2vec2 speech-disorder speech-disorder-detection child-speech

Updated Sep 7, 2022
Python

aitor-alvarez / large-speech-models

Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper

whisper asr asr-model speech-recognition-model wav2vec2 arabic-speech-recognition large-speech-models finetuning-wav2vec finetuning-whisper

Updated Jan 23, 2024
Python

subhasis-ai / Hindi-ASR-Wav2Vec2

This repository demonstrates development of Hindi ASR model using transformers.

transformers hindi language-model asr wav2vec2

Updated Jan 17, 2022
Jupyter Notebook

kalindasiaminwe / ChitongaASR

A natural language processing and machine learning project for a low resource langauge in Zambia.

nlp whisper nlp-machine-learning low-resource-languages asr-model zambian-developers wav2vec2 low-resource-nlp

Updated Dec 21, 2022
Jupyter Notebook

nomnomnonono / SoundEffect-Search

Application to search for similar sound effects by voice and title.

python sound-effects machine-learning deep-learning scraping poetry pytorch artificial-intelligence gradio bert vector-search huggingface wav2vec2

Updated Apr 29, 2023
Python

seb5433 / wav2vec2-speaker-recognition

Speaker recognition task using wav2vec2 model.

speaker-recognition fine-tuning speaker-recognition-systems wav2vec2

Updated Apr 25, 2024
Python

aryanxxvii / lark

Speech Assessment API in NextJS

machine-learning nextjs pronunciation speech-recognition prisma huggingface phoneme-recognition wav2vec2 llm

Updated May 26, 2024
TypeScript

viksit-siddhant / compare2023

SER and audio classification using both a Wav2Vec2 based model and an ASR->Bert pipeline, as well as utilizing a multimodal late-fusion model

transformers audio-classification bert asr speech-emotion-recognition multimodal wav2vec2

Updated Jul 4, 2023
Python

JingleCate / SpeechEmotionRecog

A simple Speech Emotion Recognition (SER) project based on Wav2Vec2.

audio classification wav2vec2

Updated Apr 22, 2024
Python

manthanthakker / AudioClassification

This repository contains code/papers/research on Speech or Audio Classification

audioclassification huggingface-transformers wav2vec2 gujarati-transliteration

Updated Feb 1, 2023
Jupyter Notebook

RajGothi / Improving-Automatic-Speech-Recognition-with-Dialect-Specific-Language-Models

This repository contains the implementation of our published paper titled 'Improving Automatic Speech Recognition with Dialect-Specific Language Models,' presented at SPECOM'23.

pytorch transformer automatic-speech-recognition language-model asr indian-language self-supervised-learning wav2vec2 bengali-asr bhojpuri-asr dialect-lm

Updated Dec 18, 2023
Jupyter Notebook

ahammedrohit / Speech-Recognition-using-wav2vec2-with-minimum-GPU

Python Colab for speech recognition with wav2vec2. Since wav2vec2 requires heavy GPU I've come up with a way to run this on Google Colab as well as local machines with minimum GPU.

text-to-speech speech-recognition wav2vec2

Updated Jan 19, 2022
Jupyter Notebook

moncefbenaicha / SpokenNER

Spoken NER implementation based on Wav2Vec2-XLS-R with experiments on transfer learning

speech-recognition transfer-learning ner asr spoken-language-understanding wav2vec2 xlsr spoken-ner

Updated May 7, 2024
Python

egorsmkv / wav2vec2-hidet

transformers pytorch wav2vec2 hidet

Updated Oct 4, 2023
Python

slinusc / speaker_identification_evaluation

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

whisper wav2vec2 xls-r

Updated Jun 25, 2024
Jupyter Notebook

fatou1526 / ASR_wav2vec2

This repo contains codes about loading audio data, training wav2vec2 model with custom language dataset

audio docker dataset gradio fastapi huggingface wav2vec2 wolof

Updated Sep 25, 2023
Jupyter Notebook

wngh1187 / IPET

Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS

transfer-learning wav2vec2 general-purpose-audio-model audio-spectrogram-transformer prompt-based-learning

Updated Sep 14, 2023
Python

appledora / wav2vec2_scripts

A modular codebase to process audio dataset, generate custom tokenizer, finetune and infer wav2vec2 model on custom dataset.

end-to-end inference speech-to-text fine-tuning huggingface wav2vec2

Updated Nov 12, 2023
Python

nisheethjaiswal / Speech-to-Text

Speech to text implementation using transformers in PyTorch.

transformers pytorch speech-to-text wav2vec2

Updated Apr 30, 2021
Jupyter Notebook

Improve this page

Add a description, image, and links to the wav2vec2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wav2vec2 topic, visit your repo's landing page and select "manage topics."