wav442letter

Fully convolutional speech-to-text model based on Facebook's Wav2Letter. Developed alongside Andrew Schallwig and Matt Palazzolo for EECS 442 at the University of Michigan.

The original paper can be found here.

Our results are summarized below, with Facebook's original results on the left and ours on the right. Our goal was to try to replicate Facebook's results with far fewer computational resources; although clearly not successful, we certainly achieved a decent approximation given that we used 0.3% of the training data and 30% of the trainable parameters of the original model.

The model was built in PyTorch and trained on the dev-clean subset of the LibriSpeech ASR corpus, available here.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
FinalPaper_Wav442Letter.pdf		FinalPaper_Wav442Letter.pdf
FinalPresentation_Wav442Letter.pdf		FinalPresentation_Wav442Letter.pdf
README.md		README.md
Wav442Letter.ipynb		Wav442Letter.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wav442letter

About

Releases

Packages

Languages

Aditya-Singhvi/wav442letter

Folders and files

Latest commit

History

Repository files navigation

wav442letter

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages