AmbientGen

Building a text-to-ambient-sound generator with AudioLDM2

View the Project on GitHub my-sonicase/ambientgen

AmbientGen 🌧️

A text-to-ambient-sound generator powered by AudioLDM2

Type a description like β€œgentle rain on a window with distant thunder” and get a generated ambient soundscape.

πŸš€ Try it

Live Demo on Hugging Face Spaces β†’

🎯 What is this?

This is a learning-in-public project documenting my journey into Generative AI for Audio. I’m building a text-to-ambient-sound app while studying the underlying models, papers, and techniques.

Features:

πŸ“ Blog

Follow the build process on the project blog:

  1. What is AudioLDM2 and why I’m using it
  2. First Sounds β€” What AudioLDM2 Can and Can’t Do
  3. Prompt Engineering for Audio β€” What Actually Works
  4. Building AmbientGen β€” From Notebook to Product

πŸ“š Reading List

A curated list of papers and resources on generative AI for audio and music β†’ Papers & Resources

πŸ› οΈ Tech Stack

Project Structure

ambientgen/
β”œβ”€β”€ blog/           # Blog posts (Markdown)
β”œβ”€β”€ app/            # Gradio application code
β”œβ”€β”€ experiments/    # Colab notebooks & experiment logs
β”œβ”€β”€ docs/           # Papers reading list & resources
└── README.md

License

MIT