Innovating Speech Synthesis: Hierarchical Variational Approach in HierSpeech++
In this tutorial, we look at HierSpeech++ - one of the newest and greatest speech synthesis models - on Papersapce.
In this tutorial, we look at HierSpeech++ - one of the newest and greatest speech synthesis models - on Papersapce.
In this article, we introduce and breakdown Distil Whisper: a new release that offers up to 6x speed up on running the Whisper model for audio transcription.
In this tutorial, we show how to clone voices with TorToise TTS, and discuss necessary steps to ensure ideal cloning takes place.
This tutorial walks through the MusicGen demo and shows how to run it in a Gradient Notebook.
In this tutorial, we understand Data2Vec model from Meta AI and show how to train your own model with a ready-to-use codebase on the Gradient Notebook.
In this article, we looked at the novel VALL-E TTS model, and showed how to train it within a Gradient Notebook using Libri Light and our own voice recordings.
In this tutorial, we show how Whisper can be used with MoviePy to automatically generate and overlay translated subtitles from any video sample. We then walked through setting up this process to run both within a Notebook context and from an application served with Gradient Deployments.
In this article, we looked at the basic elements of an end-to-end Automatic Speech Recognition pipeline, the major challenges encountered with these pipelines, and some of the potential solutions.
We sit down with Tunebat and Specterr founder Oliver Reznik who is using machine learning to build powerful applications for DJs and musicians