Audio Generators

Audio Generators

Examples #

At Aalto University #

Speech #

Speech-to-Text #

Whisper, 2022 #

Music #

Magenta #

MusicLM #

OpenAI Jukebox, 2020 #

A neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles.

OpenAI MuseNet, 2019 #

“We’ve created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles. MuseNet was not explicitly programmed with our understanding of music, but instead discovered patterns of harmony, rhythm, and style by learning to predict the next token in hundreds of thousands of MIDI files. MuseNet uses the same general-purpose unsupervised technology as GPT-2, a large-scale transformer model trained to predict the next token in a sequence, whether audio or text.”

Amadeus Code #