- Google Developers Blog

JULY 10, 2025 / Gemini

Announcing GenAI Processors: Build powerful and flexible Gemini applications

GenAI Processors is a new open-source Python library from Google DeepMind designed to simplify the development of AI applications, especially those handling multimodal input and requiring real-time responsiveness, by providing a consistent "Processor" interface for all steps from input handling to model calls and output processing, for seamless chaining and concurrent execution.

Announcing GenAI Processors: Streamline your Gemini application development

JULY 7, 2025 / Gemini

Batch Mode in the Gemini API: Process more for less

The new batch mode in the Gemini API is designed for high-throughput, non-latency-critical AI workloads, simplifying large jobs by handling scheduling and processing, and making tasks like data analysis, bulk content creation, and model evaluation more cost-effective and scalable, so developers can process large volumes of data efficiently.

Scale your AI workloads with batch mode in the Gemini API

JUNE 24, 2025 / Gemini

Imagen 4 is now available in the Gemini API and Google AI Studio

Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

Imagen 4 is now available on Gemini API and Google AI Studio

JUNE 24, 2025 / Gemini

Gemini 2.5 for robotics and embodied intelligence

Gemini 2.5 Pro and Flash are transforming robotics by enhancing coding, reasoning, and multimodal capabilities, including spatial understanding. These models are used for semantic scene understanding, code generation for robot control, and building interactive applications with the Live API, with a strong emphasis on safety improvements and community applications.

MAY 28, 2025 / Gemini

Exploring the Magic Mirror: an interactive experience powered by the Gemini models

The Magic Mirror project utilizes the Gemini API, including the Live API, Function Calling, and Grounding with Google Search, to create an interactive and dynamic experience, demonstrating the power of the Gemini models to generate visuals, tell stories, and provide real-time information through a familiar object.

MAY 9, 2025 / DeepMind

Advancing the frontier of video understanding with Gemini 2.5

Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

MAY 8, 2025 / Gemini

Gemini 2.5 Models now support implicit caching

The rollout of implicit caching in the Gemini API expands on the existing explicit caching API, providing an "always on" caching system which offers automatic cost savings to developers using Gemini 2.5 models and continued availability of the explicit caching API for guaranteed savings.

APRIL 29, 2025 / Gemini

How It’s Made: Little Language Lessons uses Gemini’s multilingual capabilities to personalize language learning

Little Language Lessons, a project leveraging Gemini's API and Cloud services to generate content, translate, and provide text-to-speech functionalities, includes vocabulary lessons, slang practice, and object recognition for language learning.

APRIL 23, 2025 / Mobile

Get ready for Google I/O: Program lineup revealed

Google I/O's agenda is live, with keynotes and sessions scheduled for May 20-21, focusing on AI advancements, Android development, and web technologies. Register now to explore the full program, join us during the event for livestreams, on-demand sessions, and codelabs.

APRIL 17, 2025 / Gemini

Start building with Gemini 2.5 Flash

Gemini 2.5 Flash is in preview, offering improved reasoning capabilities through a "thinking" process that developers can control for cost and latency tradeoffs. This updated version aims to provide a cost-effective solution for complex tasks, balancing performance and price.

Search

Content Type

Product

Technology