ads
Wednesday, November 15, 2023
Show HN: Vocal timing conditioned audio diffusion in real-time https://ift.tt/iQPXq0o
Show HN: Vocal timing conditioned audio diffusion in real-time We've been cooking up a new experiment where you can record yourself singing or talking and the app will generate vocals to match your words and timings. It's backed by an end-to-end latent diffusion model that generates audio conditioned on both the style and the lyric timings - and it's quite fast. Your actual voice and melody are not used, just the transcription, and we don't store the recording. We've found it's a really natural way to control the output you want and dream up a song concept. Curious to hear what you think! https://ift.tt/57hljbV November 15, 2023 at 11:33PM
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment