- Google Developers Blog

SEPT. 4, 2025 / Gemma

Introducing EmbeddingGemma: The Best-in-Class Open Model for On-Device Embeddings

Introducing EmbeddingGemma: a new embedding model designed for efficient on-device AI applications from Google. This open model is the highest-ranking text-only multilingual embedding model under 500M parameters on the MTEB benchmark, enabling powerful features like RAG and semantic search directly on mobile devices without an internet connection.

SEPT. 4, 2025 / AI

From Fine-Tuning to Production: A Scalable Embedding Pipeline with Dataflow

Learn how to use Google's EmbeddingGemma, an efficient open model, with Google Cloud's Dataflow and vector databases like AlloyDB to build scalable, real-time knowledge ingestion pipelines.

AUG. 28, 2025 / AI

How to prompt Gemini 2.5 Flash Image Generation for the best results

Detailed prompting techniques and best practices for various applications, including photorealistic scenes, stylized illustrations, product mockups, and more using Google's newly released Gemini 2.5 Flash Image; a natively multimodal model capable of generating, editing, and composing images using text, supporting capabilities like text-to-image, image editing, style transfer, and multi-image composition.

AUG. 27, 2025 / Gemini

Beyond the terminal: Gemini CLI comes to Zed

Google and Zed have partnered to integrate Gemini CLI directly into the Zed code editor, bringing AI capabilities directly into the editor for developers and allowing for faster and more focused coding, enabling tasks like in-place code generation, instant answers, and natural chat within the terminal with a seamless review workflow for AI-generated changes.

Gemini CLI is now integrated into Zed, bringing AI directly to your code editor

AUG. 27, 2025 / Google Labs

Stop “vibe testing” your LLMs. It's time for real evals.

Stax, an experimental developer tool, addresses the insufficient nature of "vibe testing" LLMs by streamlining the LLM evaluation lifecycle, allowing users to rigorously test their AI stack and make data-driven decisions through human labeling and scalable LLM-as-a-judge auto-raters.

AUG. 26, 2025 / Gemini

Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

Gemini 2.5 Flash Image is a new state-of-the-art image generation and editing model that allows for blending multiple images, maintaining character consistency, and targeted transformations using natural language, leveraging Gemini's world knowledge, now available through the Gemini API, Google AI Studio, and Vertex AI.

AUG. 21, 2025 / Gemini

What's new in Gemini Code Assist

Gemini Code Assist's Agent Mode, now available in VS Code (Preview) and IntelliJ (Stable), streamlines complex coding tasks by proposing detailed plans for user review and approval. This intelligent, collaborative approach, enhanced with features like inline diffs and persistent chat history, aims to boost developer productivity and efficiency.

New in Gemini Code Assist: Agent Mode more widely available, IDE improvements and Gemini CLI updates

AUG. 18, 2025 / Gemini

URL context tool for Gemini API now generally available

The Gemini API's URL Context tool is now generally available, allowing developers to ground prompts using web content instead of manual uploads. This release expands support to PDFs and images.

AUG. 15, 2025 / Google AI Studio

Announcing Imagen 4 Fast and the general availability of the Imagen 4 family in the Gemini API

Google announces the general availability of Imagen 4, its advanced text-to-image model, in the Gemini API and Google AI Studio, featuring significant improvements in text rendering. The new Imagen 4 Fast model, designed for speed and rapid image generation, is now available alongside Imagen 4 and Imagen 4 Ultra, with Imagen 4 and Imagen 4 Ultra also supporting up to 2K resolution image generation.

Imagen 4 Fast and the generally availability of the Imagen 4 family in the Gemini API

AUG. 14, 2025 / Gemma

Introducing Gemma 3 270M: The compact model for hyper-efficient AI

Google's new Gemma 3 270M is a compact, 270-million parameter model offering energy efficiency, production-ready quantization, and strong instruction-following, making it a powerful solution for task-specific fine-tuning in on-device and research settings.

Search

Content Type

Product

Technology