- Google Developers Blog

JULY 22, 2025 / Gemini

Gemini 2.5 Flash-Lite is now stable and generally available

Gemini 2.5 Flash-Lite, previously in preview, is now stable and generally available. This cost-efficient model is ~1.5x faster than 2.0 Flash-Lite and 2.0 Flash, offers high quality, and includes 2.5 family features like a 1 million-token context window and multimodality.

Gemini 2.5 Flash is making it easy to build with the Gemini API in Google AI Studio

JULY 21, 2025 / Gemini

Conversational image segmentation with Gemini 2.5

Gemini's advanced capability for conversational image segmentation allows intuitive interaction with visual data by understanding complex phrases, conditional logic, and abstract concepts, streamlining developer experience and opening doors for new applications in media editing, safety monitoring, and damage assessment.

JULY 17, 2025 / Gemini

Build with Veo 3, now available in the Gemini API

Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

JULY 16, 2025 / Gemini

Simplify your Agent "vibe building" flow with ADK and Gemini CLI

The updated Agent Development Kit (ADK) simplifies and accelerates the process of building AI agents by providing the CLI with a deep, cost-effective understanding of the ADK framework, allowing developers to quickly ideate, generate, test, and improve functional agents through conversational prompts, eliminating friction and keeping them in a productive "flow" state.

ADK + Gemini CLI: Supercharge Your Agent Building Vibe

JULY 14, 2025 / Gemini

Gemini Embedding now generally available in the Gemini API

The Gemini Embedding text model is now generally available in the Gemini API and Vertex AI. This versatile model has consistently ranked #1 on the MTEB Multilingual leaderboard since its experimental launch in March, supports over 100 languages, has a 2048 maximum input token length, and is priced at $0.15 per 1M input tokens.

JULY 10, 2025 / Gemini

Announcing GenAI Processors: Build powerful and flexible Gemini applications

GenAI Processors is a new open-source Python library from Google DeepMind designed to simplify the development of AI applications, especially those handling multimodal input and requiring real-time responsiveness, by providing a consistent "Processor" interface for all steps from input handling to model calls and output processing, for seamless chaining and concurrent execution.

Announcing GenAI Processors: Streamline your Gemini application development

JULY 7, 2025 / Gemini

Batch Mode in the Gemini API: Process more for less

The new batch mode in the Gemini API is designed for high-throughput, non-latency-critical AI workloads, simplifying large jobs by handling scheduling and processing, and making tasks like data analysis, bulk content creation, and model evaluation more cost-effective and scalable, so developers can process large volumes of data efficiently.

Scale your AI workloads with batch mode in the Gemini API

JUNE 25, 2025 / Gemini

Simulating a neural operating system with Gemini 2.5 Flash-Lite

A research prototype simulating a neural operating system generates UI in real-time adapting to user interactions with Gemini 2.5 Flash-Lite, using interaction tracing for contextual awareness, streaming the UI for responsiveness, and achieving statefulness with an in-memory UI graph.

Behind the prototype: Simulating a neural operating system with Gemini

JUNE 24, 2025 / Gemini

Supercharge your notebooks: The new AI-first Google Colab is now available to everyone

The new AI-first Google Colab enhances productivity with improvements powered by features like iterative querying for conversational coding, a next-generation Data Science Agent for autonomous workflows, and effortless code transformation. Early adopters report a dramatic productivity boost, accelerating ML projects, debugging code faster, and effortlessly creating high-quality visualizations.