Home - Google Developers Blog

Featured articles

Get ready for Google I/O: Livestream schedule revealed

Google I/O returns May 19–20 to showcase major updates in AI, Android, Chrome, and Cloud, beginning with a keynote on the "agentic era" of development. The event will focus on new tools designed to automate complex workflows and simplify the creation of high-quality, AI-ready applications. Attendees can register to access live sessions, technical demos, and professional development resources both live and on-demand.

Gemini Google AI Studio AI AI Homepage Events

Get ready for Google I/O 2026

Google I/O returns May 19-20. Watch the livestreams for updates on Android, AI, Chrome, and Cloud. Registration is open on the Google I/O website.

AI AI Homepage Announcements Problem-Solving

Gemini 3 Flash is now available in Gemini CLI

Gemini 3 Flash is now available in Gemini CLI. It delivers Pro-grade coding performance with low latency and a lower cost, matching Gemini 3 Pro's SWE-bench Verified score of 76%. It significantly outperforms 2.5 Pro, improving auto-routing and agentic coding. It's ideal for high-frequency development tasks, handling complex code generation, large context windows (like processing 1,000 comment pull requests), and generating load-testing scripts quickly and reliably.

AI AI Homepage Announcements Solutions

Build with Google Antigravity, our new agentic development platform

Introducing Google Antigravity, a new agentic development platform for orchestrating code. It combines an AI-powered Editor View with a Manager Surface to deploy agents that autonomously plan, execute, and verify complex tasks across your editor, terminal, and browser. Agents communicate progress via Artifacts (screenshots, recordings) for easy verification. Available now in public preview.

AI AI Homepage Best Practices

Real-World Agent Examples with Gemini 3

Gemini 3 is powering the next generation of reliable, production-ready AI agents. This post highlights 6 open-source framework collaborations (ADK, Agno, Browser Use, Eigent, Letta, mem0), demonstrating practical agentic workflows for tasks like deep search, multi-agent systems, browser and enterprise automation, and stateful agents with advanced memory. Clone the examples and start building today.

AI AI Homepage Announcements

5 things to try with Gemini 3 Pro in Gemini CLI

Gemini 3 Pro is now integrated into Gemini CLI, unlocking state-of-the-art reasoning, agentic coding, and advanced tool use for enhanced developer productivity. It's available now for Google AI Ultra and paid Gemini API key subscribers (upgrade CLI to 0.16.x). Features include generating 3D apps and code from visual sketches, running complex shell commands, creating documentation, and debugging live Cloud Run services.

Mobile Web AI AI Homepage Web Homepage Mobile Homepage Announcements

LiteRT: The Universal Framework for On-Device AI

LiteRT, the evolution of TFLite, is now the universal framework for on-device AI. It delivers up to 1.4x faster GPU, new NPU support, and streamlined GenAI deployment for models like Gemma.

Latest blogs

MAY 4, 2026

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-style speculative decoding

Researchers at UCSD have successfully implemented DFlash, a block-diffusion speculative decoding method, on Google TPUs to bypass the sequential bottlenecks of traditional autoregressive drafting. By "painting" entire blocks of candidate tokens in a single forward pass rather than predicting them one-by-one, the system achieved average speedups of 3.13x, with peak performance nearly doubling that of existing methods like EAGLE-3. This open-source integration into the vLLM ecosystem optimizes TPU hardware by leveraging "free" parallel verification and high-quality draft predictions for complex reasoning tasks.

APRIL 30, 2026

Building with Gemini Embedding 2: Agentic multimodal RAG and beyond

Google has announced the general availability of Gemini Embedding 2, a unified model that maps text, images, video, audio, and documents into a single semantic space. This model allows developers to process interleaved multimodal inputs in a single request, significantly improving performance for tasks like agentic RAG, visual search, and content moderation. By supporting over 100 languages and offering features like task-specific prefixes and Matryoshka dimensionality reduction, the model provides a highly efficient and accurate foundation for building complex AI agents.

APRIL 29, 2026 / Cloud

Speeding Up AI: Bringing Google Colossus to PyTorch via GCSFS and Rapid Bucket

Google Cloud has introduced a high-performance integration that connects Rapid Storage directly to PyTorch via the fsspec interface to eliminate AI training bottlenecks. By utilizing Google’s Colossus architecture and bidirectional gRPC streaming, the solution offers up to 15 TiB/s aggregate throughput and significant reductions in latency. These improvements allow developers to speed up total training time by 23% with zero code changes required beyond updating the storage bucket type.

Subscribe to join a community of creative developers and learn the latest in Google technology.

Learn more

Follow and discover developer resources, community events, and inspirational stories.

Learn more

Join a community of creative developers and learn how to use the latest in technology.

Learn more

Subscribe to Google for Developers news. Your information will be used in accordance with Google’s privacy policy.

AI

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-st…

Building with Gemini Embedding 2: Agentic multimodal RAG and beyond

Building real-world on-device AI with LiteRT and NPU

Agents CLI in Agent Platform: create to production in one CLI

Production-Ready AI Agents: 5 Lessons from Refactoring a Monolith

A2UI v0.9: The New Standard for Portable, Framework-Agnostic Generative UI

MaxText Expands Post-Training Capabilities: Introducing SFT and RL on Single-Host T…

Subagents have arrived in Gemini CLI

Build Better AI Agents: 5 Developer Tips from the Agent Bake-Off

Mobile

Building real-world on-device AI with LiteRT and NPU

A2UI v0.9: The New Standard for Portable, Framework-Agnostic Generative UI

New enhancements for merchant initiated transactions with the Google Pay API

Bring state-of-the-art agentic skills to the edge with Gemma 4

Jump to play: Building with Gemini & MediaPipe

On-Device Function Calling in Google AI Edge Gallery

LiteRT: The Universal Framework for On-Device AI

Introducing A2UI: An open project for agent-driven interfaces

MediaTek NPU and LiteRT: Powering the next generation of on-device AI

Web

A2UI v0.9: The New Standard for Portable, Framework-Agnostic Generative UI

New enhancements for merchant initiated transactions with the Google Pay API

Get ready for Google I/O: Livestream schedule revealed

Bring state-of-the-art agentic skills to the edge with Gemma 4

Supporting Google Account username change in your app

Jump to play: Building with Gemini & MediaPipe

Turn creative prompts into interactive XR experiences with Gemini

LiteRT: The Universal Framework for On-Device AI

Introducing A2UI: An open project for agent-driven interfaces

Cloud

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-st…

Speeding Up AI: Bringing Google Colossus to PyTorch via GCSFS and Rapid Bucket

Agents CLI in Agent Platform: create to production in one CLI

Production-Ready AI Agents: 5 Lessons from Refactoring a Monolith

MaxText Expands Post-Training Capabilities: Introducing SFT and RL on Single-Host T…

Build Better AI Agents: 5 Developer Tips from the Agent Bake-Off

TorchTPU: Running PyTorch Natively on TPUs at Google Scale

Developer’s Guide to Building ADK Agents with Skills

ADK Go 1.0 Arrives!