Building a text-to-ambient-sound generator with AudioLDM2
A text-to-ambient-sound generator powered by AudioLDM2
Type a description like βgentle rain on a window with distant thunderβ and get a generated ambient soundscape.
Live Demo on Hugging Face Spaces β
This is a learning-in-public project documenting my journey into Generative AI for Audio. Iβm building a text-to-ambient-sound app while studying the underlying models, papers, and techniques.
Features:
Follow the build process on the project blog:
A curated list of papers and resources on generative AI for audio and music β Papers & Resources
ambientgen/
βββ blog/ # Blog posts (Markdown)
βββ app/ # Gradio application code
βββ experiments/ # Colab notebooks & experiment logs
βββ docs/ # Papers reading list & resources
βββ README.md
MIT