- Google Developers Blog

OCT. 15, 2025 / AI

Introducing Veo 3.1 and new creative capabilities in the Gemini API

Google is releasing Veo 3.1 and Veo 3.1 Fast, an updated video generation model, in paid preview via the Gemini API. This version offers richer native audio, greater narrative control, and enhanced image-to-video capabilities. New features include guiding generation with reference images, extending existing Veo videos, and generating transitions between frames. Companies like Promise Studios, Latitude, and Whering are already using Veo 3.1 for various applications.

OCT. 2, 2025 / AI

Gemini 2.5 Flash Image now ready for production with new aspect ratios

Our state-of-the-art image generation and editing model which has captured the imagination of the wo...

SEPT. 8, 2025 / AI

Veo 3 and Veo 3 Fast – new pricing, new configurations and better resolution

Today's Veo updates include support for vertical format (9:16) and 1080p HD outputs, along with new, lower pricing for Veo 3 ($0.40/second) and Veo 3 Fast ($0.15/second). These models are now stable for production use in the Gemini API. The MediaSim demo app showcases how Gemini's multimodal capabilities combine with Veo 3 for media simulations.

AUG. 28, 2025 / AI

How to prompt Gemini 2.5 Flash Image Generation for the best results

Detailed prompting techniques and best practices for various applications, including photorealistic scenes, stylized illustrations, product mockups, and more using Google's newly released Gemini 2.5 Flash Image; a natively multimodal model capable of generating, editing, and composing images using text, supporting capabilities like text-to-image, image editing, style transfer, and multi-image composition.

AUG. 26, 2025 / Gemini

Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

Gemini 2.5 Flash Image is a new state-of-the-art image generation and editing model that allows for blending multiple images, maintaining character consistency, and targeted transformations using natural language, leveraging Gemini's world knowledge, now available through the Gemini API, Google AI Studio, and Vertex AI.

AUG. 18, 2025 / Gemini

URL context tool for Gemini API now generally available

The Gemini API's URL Context tool is now generally available, allowing developers to ground prompts using web content instead of manual uploads. This release expands support to PDFs and images.

AUG. 15, 2025 / Google AI Studio

Announcing Imagen 4 Fast and the general availability of the Imagen 4 family in the Gemini API

Google announces the general availability of Imagen 4, its advanced text-to-image model, in the Gemini API and Google AI Studio, featuring significant improvements in text rendering. The new Imagen 4 Fast model, designed for speed and rapid image generation, is now available alongside Imagen 4 and Imagen 4 Ultra, with Imagen 4 and Imagen 4 Ultra also supporting up to 2K resolution image generation.

Imagen 4 Fast and the generally availability of the Imagen 4 family in the Gemini API

JULY 31, 2025 / AI

Veo 3 Fast and new image-to-video capabilities

Google introduces Veo 3 Fast, an optimized model for speed and price, along with new image-to-video capabilities for both Veo 3 and Veo 3 Fast, enabling developers to efficiently create high-quality video content from text or still images, with varying pricing based on the model and audio inclusion, now available in the Gemini API.

JULY 17, 2025 / Gemini

Build with Veo 3, now available in the Gemini API

Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

JUNE 24, 2025 / Gemini

Imagen 4 is now available in the Gemini API and Google AI Studio

Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

Posts by Alisa Fortin

Content Type

Product

Technology