Search

17 results

Clear filters
  • JULY 10, 2025 / Gemini

    Announcing GenAI Processors: Build powerful and flexible Gemini applications

    GenAI Processors is a new open-source Python library from Google DeepMind designed to simplify the development of AI applications, especially those handling multimodal input and requiring real-time responsiveness, by providing a consistent "Processor" interface for all steps from input handling to model calls and output processing, for seamless chaining and concurrent execution.

    Announcing GenAI Processors: Streamline your Gemini application development
  • JULY 7, 2025 / Gemini

    Batch Mode in the Gemini API: Process more for less

    The new batch mode in the Gemini API is designed for high-throughput, non-latency-critical AI workloads, simplifying large jobs by handling scheduling and processing, and making tasks like data analysis, bulk content creation, and model evaluation more cost-effective and scalable, so developers can process large volumes of data efficiently.

    Scale your AI workloads with batch mode in the Gemini API
  • JUNE 24, 2025 / Gemini

    Imagen 4 is now available in the Gemini API and Google AI Studio

    Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.

    Imagen 4 is now available on Gemini API and Google AI Studio
  • JUNE 24, 2025 / Gemini

    Gemini 2.5 for robotics and embodied intelligence

    Gemini 2.5 Pro and Flash are transforming robotics by enhancing coding, reasoning, and multimodal capabilities, including spatial understanding. These models are used for semantic scene understanding, code generation for robot control, and building interactive applications with the Live API, with a strong emphasis on safety improvements and community applications.

    Gemini 2.5 for robotics and embodied intelligence
  • MAY 28, 2025 / Gemini

    Exploring the Magic Mirror: an interactive experience powered by the Gemini models

    The Magic Mirror project utilizes the Gemini API, including the Live API, Function Calling, and Grounding with Google Search, to create an interactive and dynamic experience, demonstrating the power of the Gemini models to generate visuals, tell stories, and provide real-time information through a familiar object.

    Exploring the Magic Mirror: an interactive experience powered by the Gemini models
  • MAY 9, 2025 / DeepMind

    Advancing the frontier of video understanding with Gemini 2.5

    Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

    2.5Pro_Metadata_VideoUnderstanding
  • MAY 8, 2025 / Gemini

    Gemini 2.5 Models now support implicit caching

    The rollout of implicit caching in the Gemini API expands on the existing explicit caching API, providing an "always on" caching system which offers automatic cost savings to developers using Gemini 2.5 models and continued availability of the explicit caching API for guaranteed savings.

    Gemini 2.5 Models now support Implicit Caching
  • APRIL 29, 2025 / Gemini

    How It’s Made: Little Language Lessons uses Gemini’s multilingual capabilities to personalize language learning

    Little Language Lessons, a project leveraging Gemini's API and Cloud services to generate content, translate, and provide text-to-speech functionalities, includes vocabulary lessons, slang practice, and object recognition for language learning.

    How it's made: Little Language Lessons
  • APRIL 23, 2025 / Mobile

    Get ready for Google I/O: Program lineup revealed

    Google I/O's agenda is live, with keynotes and sessions scheduled for May 20-21, focusing on AI advancements, Android development, and web technologies. Register now to explore the full program, join us during the event for livestreams, on-demand sessions, and codelabs.

    Google I/O 2025 program lineup
  • APRIL 17, 2025 / Gemini

    Start building with Gemini 2.5 Flash

    Gemini 2.5 Flash is in preview, offering improved reasoning capabilities through a "thinking" process that developers can control for cost and latency tradeoffs. This updated version aims to provide a cost-effective solution for complex tasks, balancing performance and price.

    Gemini 2.5 Flash ai.dev