Audio Generators

Examples #

At Aalto University #

SOPI Research Group
Deep Learning of Audio GitHub

Speech #

Speech-to-Text #

Whisper, 2022 #

Whisper Website

Music #

Magenta #

Magenta Website

MusicLM #

MusicLM
Paper
MusicCaps Dataset
Exploring MusicCaps, the evaluation data released to accompany Google’s MusicLM text-to-music model
- Browsable and searchable interface for exploring the MusicCaps dataset

OpenAI Jukebox, 2020 #

A neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles.

OpenAi jukebox Website

OpenAI MuseNet, 2019 #

“We’ve created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles. MuseNet was not explicitly programmed with our understanding of music, but instead discovered patterns of harmony, rhythm, and style by learning to predict the next token in hundreds of thousands of MIDI files. MuseNet uses the same general-purpose unsupervised technology as GPT-2, a large-scale transformer model trained to predict the next token in a sequence, whether audio or text.”

OpenAI MuseNet Website

Amadeus Code #

Amadeus Code

Unprocessed links #

https://www.audiocipher.com/
https://www.reddit.com/r/dalle2/comments/ul7p0p/is_there_anything_like_dalle_for_music/
https://melobytes.com/en/
https://melobytes.com/en/app/image2music
https://www.youtube.com/watch?v=YJu0iXn-T_U
Uberduck Open Source Voice AI Communit
https://thesoundofaiosr.github.io/
https://experiments.withgoogle.com/drum-machine
https://experiments.withgoogle.com/ai/sound-maker/view/