How will Google’s Gemini generate music?

Question

Hans Steiner · Accepted Answer

A new audio capability lands in the Gemini app

Google has added a music-generation feature to Gemini, powered by DeepMind’s Lyria 3 audio model. The feature is available in beta inside the chatbot app and can produce short, 30-second pieces of music from a variety of prompts — text descriptions, images or video clips — and can also remix an existing track.

Lyria 3’s rollout is intentionally limited at first: Google positioned the capability as a way to quickly produce short musical snippets rather than full-length, polished songs. The company published the feature with supporting creative assets — Nano Banana cover art in some demos — and said the capability is available in multiple languages while the beta progresses.

What this change means for creators and platforms

The integration matters because it puts generative audio into a mainstream conversational surface, where users can produce musical ideas without specialist tools. Practical implications include:

Faster prototyping: Musicians, video producers and creators can generate quick musical beds and mood pieces without leaving the Gemini chat interface.
Copyright and rights questions: Short, AI-generated clips can still echo training data; the limits of how those clips may be used or monetized remain an open issue for rights holders and platforms.
Quality and scope: At 30 seconds, outputs are ideal for demos, background cues, or social clips, but not a replacement for fully produced tracks — at least in this early beta.

Google’s move signals that audio generation is becoming a first-class feature for major AI assistants. Expect iterative improvements in quality, longer outputs, and more explicit licensing and attribution controls as the feature matures.