- Google Developers Blog

OCT. 15, 2025 / AI

Introducing Veo 3.1 and new creative capabilities in the Gemini API

Google is releasing Veo 3.1 and Veo 3.1 Fast, an updated video generation model, in paid preview via the Gemini API. This version offers richer native audio, greater narrative control, and enhanced image-to-video capabilities. New features include guiding generation with reference images, extending existing Veo videos, and generating transitions between frames. Companies like Promise Studios, Latitude, and Whering are already using Veo 3.1 for various applications.

AUG. 15, 2025 / Google AI Studio

Announcing Imagen 4 Fast and the general availability of the Imagen 4 family in the Gemini API

Google announces the general availability of Imagen 4, its advanced text-to-image model, in the Gemini API and Google AI Studio, featuring significant improvements in text rendering. The new Imagen 4 Fast model, designed for speed and rapid image generation, is now available alongside Imagen 4 and Imagen 4 Ultra, with Imagen 4 and Imagen 4 Ultra also supporting up to 2K resolution image generation.

Imagen 4 Fast and the generally availability of the Imagen 4 family in the Gemini API

AUG. 12, 2025 / Kaggle

Train a GPT2 model with JAX on TPU for free

Build and train a GPT2 model from scratch using JAX on Google TPUs, with a complete Python notebook for free-tier Colab or Kaggle. Learn how to define a hardware mesh, partition model parameters and input data for data parallelism, and optimize the model training process.

JULY 17, 2025 / Gemini

Build with Veo 3, now available in the Gemini API

Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

JULY 16, 2025 / Cloud

Stanford’s Marin foundation model: The first fully open model developed using JAX

The Marin project aims to expand the definition of 'open' in AI to include the entire scientific process, not just the model itself, by making the complete development journey accessible and reproducible. This effort, powered by the JAX framework and its Levanter tool, allows for deep scrutiny, trust in, and building upon foundation models, fostering a more transparent future for AI research.

JUNE 24, 2025 / Gemini

Supercharge your notebooks: The new AI-first Google Colab is now available to everyone

The new AI-first Google Colab enhances productivity with improvements powered by features like iterative querying for conversational coding, a next-generation Data Science Agent for autonomous workflows, and effortless code transformation. Early adopters report a dramatic productivity boost, accelerating ML projects, debugging code faster, and effortlessly creating high-quality visualizations.

JUNE 24, 2025 / Gemini

Imagen 4 is now available in the Gemini API and Google AI Studio

Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

Imagen 4 is now available on Gemini API and Google AI Studio

JUNE 24, 2025 / Kaggle

Using KerasHub for easy end-to-end machine learning workflows with Hugging Face

KerasHub enables users to mix and match model architectures and weights across different machine learning frameworks, allowing checkpoints from sources like Hugging Face Hub (including those created with PyTorch) to be loaded into Keras models for use with JAX, PyTorch, or TensorFlow. This flexibility means you can leverage a vast array of community fine-tuned models while maintaining full control over your chosen backend framework.

How to load model weights from SafeTensors into KerasHub for multi-framework machine learning

JUNE 23, 2025 / Kaggle

Multilingual innovation in LLMs: How open models help unlock global communication

Developers adapt LLMs like Gemma for diverse languages and cultural contexts, demonstrating AI's potential to bridge global communication gaps by addressing challenges like translating ancient texts, localizing mathematical understanding, and enhancing cultural sensitivity in lyric translation.

JUNE 17, 2025 / Gemini

Gemini 2.5: Updates to our family of thinking models

Google is releasing updates to its Gemini 2.5 model family, including the generally available and stable Gemini 2.5 Pro and Flash, and the new Gemini 2.5 Flash-Lite "thinking models" in preview, offering enhanced performance and accuracy, with Flash-Lite providing a lower-cost option.

Search

Content Type

Product

Technology