Smiles davis, veo 3 with native audio is now in the gemini api

Veo 3 is now available in the Gemini API

You can now generate high-fidelity, 720p videos with native audio using Veo 3 and Veo 3 Fast, available in paid preview in the Gemini API.

Veo 3 brings your prompts to life, creating 8-second videos with native audio. It lets you generate video from a text prompt, an initial image, or a combination of both to guide the style and starting frame. It can create a wide range of visual styles and natively generate dialogue in multiple languages, as well as sound effects and ambient noise.

You can use our SDKs for Python, JavaScript, and Go, or make a direct REST API call with cURL. Get your API key and start generating.

import time

from google import genai

client = genai.Client()

prompt = """A close up of two people staring at a cryptic drawing on a wall, torchlight flickering.

A man murmurs, 'This must be it. That's the secret code.' The woman looks at him and whispering excitedly, 'What did you find?'"""

# Start the generation job

operation = client.models.generate_videos(

model="veo-3.0-generate-preview",

prompt=prompt,

)

# Poll for the result

while not operation.done:

print("Waiting for video generation to complete...")

time.sleep(10)

operation = client.operations.get(operation)

# Download the final video

video = operation.response.generated_videos[0]

video.video.save("dialogue_example.mp4")

print("Generated video saved to dialogue_example.mp4")

Select a billing-enabled Google Cloud project for your API key and get started with Veo 3.

The Google AI Studio team

Prompt models

Manage API keys

Get help

Google

Save this email to start commenting

All Categories

Behavioral

Google

Smiles Davis, Veo 3 with native audio is now in the Gemini API

Save this email to start commenting