- Google Developers Blog

JUNE 10, 2026 / AI

DiffusionGemma: The Developer Guide

DiffusionGemma is an experimental text-generation model built on the Gemma 4 architecture that uses diffusion-based parallel generation instead of token-by-token autoregression, enabling much faster inference, bidirectional context awareness, and real-time self-correction while remaining deployable on consumer GPUs. Its architecture generates and refines 256-token blocks in parallel through iterative denoising, allowing it to handle complex constraint-based tasks such as Sudoku more effectively than traditional language models and demonstrating strong gains from fine-tuning. The model integrates with vLLM and other popular inference frameworks, giving developers access to a new non-autoregressive approach that combines high performance, efficient long-context scaling, and straightforward customization and deployment.
JUNE 3, 2026 / AI

Gemma 4 12B: The Developer Guide

The newly released Gemma 4 12B is a dense, multimodal model designed for high-performance local AI execution on consumer devices. By introducing a novel, encoder-free architecture, it bypasses traditional visual and audio encoders to feed multimodal data directly into the LLM backbone.
JUNE 26, 2025 / Gemma

Introducing Gemma 3n: The developer guide

The Gemma 3n model has been fully released, building on the success of previous Gemma models and bringing advanced on-device multimodal capabilities to edge devices with unprecedented performance. Explore Gemma 3n's innovations, including its mobile-first architecture, MatFormer technology, Per-Layer Embeddings, KV Cache Sharing, and new audio and MobileNet-V5 vision encoders, and how developers can start building with it today.
MARCH 12, 2025 / Gemma

Introducing Gemma 3: The Developer Guide

Gemma 3 is a new, advanced version of the Gemma open-model family featuring multimodality, longer context windows, and improved language capabilities, with various sizes and deployment options for developers to experiment.
FEB. 19, 2025 / Gemma

Introducing PaliGemma 2 mix: A vision-language model for multiple tasks

PaliGemma 2 mix, an upgraded vision-language model, is now available, offering capabilities like image captioning, OCR, and object detection in various sizes.

Posts by Omar Sanseviero

Content Type

Product

Technology

DiffusionGemma: The Developer Guide

Gemma 4 12B: The Developer Guide

Introducing Gemma 3n: The developer guide

Introducing Gemma 3: The Developer Guide

Introducing PaliGemma 2 mix: A vision-language model for multiple tasks

Content Type

Product

Technology