Search

7 results

Clear filters
  • JUNE 26, 2025 / Gemma

    Introducing Gemma 3n: The developer guide

    The Gemma 3n model has been fully released, building on the success of previous Gemma models and bringing advanced on-device multimodal capabilities to edge devices with unprecedented performance. Explore Gemma 3n's innovations, including its mobile-first architecture, MatFormer technology, Per-Layer Embeddings, KV Cache Sharing, and new audio and MobileNet-V5 vision encoders, and how developers can start building with it today.

    Introducing Gemma 3n: The Developer Guide
  • JUNE 24, 2025 / Gemini

    Gemini 2.5 for robotics and embodied intelligence

    Gemini 2.5 Pro and Flash are transforming robotics by enhancing coding, reasoning, and multimodal capabilities, including spatial understanding. These models are used for semantic scene understanding, code generation for robot control, and building interactive applications with the Live API, with a strong emphasis on safety improvements and community applications.

    Gemini 2.5 for robotics and embodied intelligence
  • MAY 20, 2025 / Gemma

    Announcing Gemma 3n preview: powerful, efficient, mobile-first AI

    Gemma 3n is a cutting-edge open model designed for fast, multimodal AI on devices, featuring optimized performance, unique flexibility with a 2-in-1 model, and expanded multimodal understanding with audio, empowering developers to build live, interactive applications and sophisticated audio-centric experiences.

    Gemma 3n
  • MAY 9, 2025 / DeepMind

    Advancing the frontier of video understanding with Gemini 2.5

    Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

    2.5Pro_Metadata_VideoUnderstanding
  • APRIL 30, 2025 / Gemma

    Gemma explained: What’s new in Gemma 3

    Gemma 3's new features include vision-language capabilities and architectural changes for improved memory efficiency and longer context handling compared to previous Gemma models.

    What's new in Gemma-3
  • APRIL 23, 2025 / Gemini

    Achieve real-time interaction: Build with the Live API

    Explore real world applications for the Live API for Gemini models, now updated to include enhanced features for real-time audio, video, and text processing, improved session management, control over interactions, and richer output options.

    gemini-live-api-meta
  • NOV. 20, 2024 / Gemini

    OpusClip achieves 30% cost savings in visual description processing with Gemini Flash

    OpusClip utilizes Gemini 1.5 Flash's multimodal capabilities to enhance video understanding and streamline content creation, leading to cost savings and increased engagement.

    OpusClip_metadata