Posts by Alisa Fortin

11 results

Clear filters
  • SEPT. 8, 2025 / AI

    Veo 3 and Veo 3 Fast – new pricing, new configurations and better resolution

    Today, we’re launching three big Veo updates: support for vertical format outputs (9:16 aspect ratio...

    veo3-generally-available-social
  • AUG. 28, 2025 / AI

    How to prompt Gemini 2.5 Flash Image Generation for the best results

    Detailed prompting techniques and best practices for various applications, including photorealistic scenes, stylized illustrations, product mockups, and more using Google's newly released Gemini 2.5 Flash Image; a natively multimodal model capable of generating, editing, and composing images using text, supporting capabilities like text-to-image, image editing, style transfer, and multi-image composition.

    Gemini 2.5 Flash Image
  • AUG. 26, 2025 / Gemini

    Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

    Gemini 2.5 Flash Image is a new state-of-the-art image generation and editing model that allows for blending multiple images, maintaining character consistency, and targeted transformations using natural language, leveraging Gemini's world knowledge, now available through the Gemini API, Google AI Studio, and Vertex AI.

    Introducing Gemini 2.5 Flash Image
  • AUG. 18, 2025 / Gemini

    URL context tool for Gemini API now generally available

    The Gemini API's URL Context tool is now generally available, allowing developers to ground prompts using web content instead of manual uploads. This release expands support to PDFs and images.

    URL context tool for Gemini API now generally available
  • AUG. 15, 2025 / Google AI Studio

    Announcing Imagen 4 Fast and the general availability of the Imagen 4 family in the Gemini API

    Google announces the general availability of Imagen 4, its advanced text-to-image model, in the Gemini API and Google AI Studio, featuring significant improvements in text rendering. The new Imagen 4 Fast model, designed for speed and rapid image generation, is now available alongside Imagen 4 and Imagen 4 Ultra, with Imagen 4 and Imagen 4 Ultra also supporting up to 2K resolution image generation.

    Imagen 4 Fast and the generally availability of the Imagen 4 family in the Gemini API
  • JULY 31, 2025 / AI

    Veo 3 Fast and new image-to-video capabilities

    Google introduces Veo 3 Fast, an optimized model for speed and price, along with new image-to-video capabilities for both Veo 3 and Veo 3 Fast, enabling developers to efficiently create high-quality video content from text or still images, with varying pricing based on the model and audio inclusion, now available in the Gemini API.

    Build with Veo 3 Fast and new image-to-video capabilities, now available in the Gemini API
  • JULY 17, 2025 / Gemini

    Build with Veo 3, now available in the Gemini API

    Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

    Build with Veo 3, now available in the Gemini API and Google AI Studio
  • JUNE 24, 2025 / Gemini

    Imagen 4 is now available in the Gemini API and Google AI Studio

    Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

    Imagen 4 is now available on Gemini API and Google AI Studio
  • MAY 23, 2025 / Gemini

    Gemini API I/O updates

    Announcing new features and models for the Gemini API, with the introduction of Gemini 2.5 Flash Preview with improved reasoning and efficiency, Gemini 2.5 Pro and Flash text-to-speech supporting multiple languages and speakers, and Gemini 2.5 Flash native audio dialog for conversational AI.

    Gemini_API_metadata
  • APRIL 15, 2025 / Gemini

    Bring your ideas to life: Veo 2 video generation available for developers

    Generate high-quality videos from text and image prompts with Veo 2, a video generation model, now generally available in the Gemini API and Google AI Studio to enhance your content creation and marketing efforts.

    Veo 2 now generally available in the Gemini API and Google AI Studio