Posts by Alisa Fortin

13 results

Clear filters
  • OCT. 15, 2025 / AI

    Introducing Veo 3.1 and new creative capabilities in the Gemini API

    Google is releasing Veo 3.1 and Veo 3.1 Fast, an updated video generation model, in paid preview via the Gemini API. This version offers richer native audio, greater narrative control, and enhanced image-to-video capabilities. New features include guiding generation with reference images, extending existing Veo videos, and generating transitions between frames. Companies like Promise Studios, Latitude, and Whering are already using Veo 3.1 for various applications.

    Veo3.1_16x9_meta
  • OCT. 2, 2025 / AI

    Gemini 2.5 Flash Image now ready for production with new aspect ratios

    Our state-of-the-art image generation and editing model which has captured the imagination of the wo...

    image2
  • SEPT. 8, 2025 / AI

    Veo 3 and Veo 3 Fast – new pricing, new configurations and better resolution

    Today's Veo updates include support for vertical format (9:16) and 1080p HD outputs, along with new, lower pricing for Veo 3 ($0.40/second) and Veo 3 Fast ($0.15/second). These models are now stable for production use in the Gemini API. The MediaSim demo app showcases how Gemini's multimodal capabilities combine with Veo 3 for media simulations.

    veo3-generally-available-social
  • AUG. 28, 2025 / AI

    How to prompt Gemini 2.5 Flash Image Generation for the best results

    Detailed prompting techniques and best practices for various applications, including photorealistic scenes, stylized illustrations, product mockups, and more using Google's newly released Gemini 2.5 Flash Image; a natively multimodal model capable of generating, editing, and composing images using text, supporting capabilities like text-to-image, image editing, style transfer, and multi-image composition.

    Gemini 2.5 Flash Image
  • AUG. 26, 2025 / Gemini

    Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

    Gemini 2.5 Flash Image is a new state-of-the-art image generation and editing model that allows for blending multiple images, maintaining character consistency, and targeted transformations using natural language, leveraging Gemini's world knowledge, now available through the Gemini API, Google AI Studio, and Vertex AI.

    Introducing Gemini 2.5 Flash Image
  • AUG. 18, 2025 / Gemini

    URL context tool for Gemini API now generally available

    The Gemini API's URL Context tool is now generally available, allowing developers to ground prompts using web content instead of manual uploads. This release expands support to PDFs and images.

    URL context tool for Gemini API now generally available
  • AUG. 15, 2025 / Google AI Studio

    Announcing Imagen 4 Fast and the general availability of the Imagen 4 family in the Gemini API

    Google announces the general availability of Imagen 4, its advanced text-to-image model, in the Gemini API and Google AI Studio, featuring significant improvements in text rendering. The new Imagen 4 Fast model, designed for speed and rapid image generation, is now available alongside Imagen 4 and Imagen 4 Ultra, with Imagen 4 and Imagen 4 Ultra also supporting up to 2K resolution image generation.

    Imagen 4 Fast and the generally availability of the Imagen 4 family in the Gemini API
  • JULY 31, 2025 / AI

    Veo 3 Fast and new image-to-video capabilities

    Google introduces Veo 3 Fast, an optimized model for speed and price, along with new image-to-video capabilities for both Veo 3 and Veo 3 Fast, enabling developers to efficiently create high-quality video content from text or still images, with varying pricing based on the model and audio inclusion, now available in the Gemini API.

    Build with Veo 3 Fast and new image-to-video capabilities, now available in the Gemini API
  • JULY 17, 2025 / Gemini

    Build with Veo 3, now available in the Gemini API

    Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

    Build with Veo 3, now available in the Gemini API and Google AI Studio
  • JUNE 24, 2025 / Gemini

    Imagen 4 is now available in the Gemini API and Google AI Studio

    Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

    Imagen 4 is now available on Gemini API and Google AI Studio