Search

22 results

Clear filters
  • AUG. 26, 2025 / Gemini

    Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

    Gemini 2.5 Flash Image is a new state-of-the-art image generation and editing model that allows for blending multiple images, maintaining character consistency, and targeted transformations using natural language, leveraging Gemini's world knowledge, now available through the Gemini API, Google AI Studio, and Vertex AI.

    Introducing Gemini 2.5 Flash Image
  • AUG. 15, 2025 / Google AI Studio

    Announcing Imagen 4 Fast and the general availability of the Imagen 4 family in the Gemini API

    Google announces the general availability of Imagen 4, its advanced text-to-image model, in the Gemini API and Google AI Studio, featuring significant improvements in text rendering. The new Imagen 4 Fast model, designed for speed and rapid image generation, is now available alongside Imagen 4 and Imagen 4 Ultra, with Imagen 4 and Imagen 4 Ultra also supporting up to 2K resolution image generation.

    Imagen 4 Fast and the generally availability of the Imagen 4 family in the Gemini API
  • JULY 17, 2025 / Gemini

    Build with Veo 3, now available in the Gemini API

    Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

    Build with Veo 3, now available in the Gemini API and Google AI Studio
  • JULY 14, 2025 / Gemini

    Gemini Embedding now generally available in the Gemini API

    The Gemini Embedding text model is now generally available in the Gemini API and Vertex AI. This versatile model has consistently ranked #1 on the MTEB Multilingual leaderboard since its experimental launch in March, supports over 100 languages, has a 2048 maximum input token length, and is priced at $0.15 per 1M input tokens.

    Gemini Embedding now generally available in the Gemini API
  • JUNE 24, 2025 / Gemini

    Gemini 2.5 for robotics and embodied intelligence

    Gemini 2.5 Pro and Flash are transforming robotics by enhancing coding, reasoning, and multimodal capabilities, including spatial understanding. These models are used for semantic scene understanding, code generation for robot control, and building interactive applications with the Live API, with a strong emphasis on safety improvements and community applications.

    Gemini 2.5 for robotics and embodied intelligence
  • JUNE 24, 2025 / Gemini

    Imagen 4 is now available in the Gemini API and Google AI Studio

    Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

    Imagen 4 is now available on Gemini API and Google AI Studio
  • MAY 23, 2025 / Gemini

    Gemini API I/O updates

    Announcing new features and models for the Gemini API, with the introduction of Gemini 2.5 Flash Preview with improved reasoning and efficiency, Gemini 2.5 Pro and Flash text-to-speech supporting multiple languages and speakers, and Gemini 2.5 Flash native audio dialog for conversational AI.

    Gemini_API_metadata
  • MAY 21, 2025 / Google AI Studio

    An upgraded dev experience in Google AI Studio

    Google AI Studio has been upgraded to enhance the developer experience, featuring native code generation with Gemini 2.5 Pro, agentic tools, and enhanced multimodal generation capabilities, plus new features like the Build tab, Live API, and improved tools for building sophisticated AI applications.

    google-io-event-meta
  • MAY 20, 2025 / Android

    What you should know from the Google I/O 2025 Developer keynote

    Top announcements from Google I/O 2025 focus on building across Google platforms and innovating with AI models from Google DeepMind, with key focus on new tools, APIs, and features designed to enhance developer productivity and create AI-powered experiences using Gemini, Android, Firebase, and web.

    What you should know from the Google I/O 2025 Developer keynote
  • MAY 9, 2025 / Cloud

    Google AI for game developers

    Revisit announcements from this year's Games Developer Conference (GDC). Explore how Gemma and Gemini models can aid in building AI experiences in games with the launch of Gemma 3, the Unity plugin, its application in a sample game, and scaling games with generative AI in Google Cloud.

    Google AI for Game Developers