Search

5 results

Clear filters
  • MAY 20, 2025 / Gemma

    Announcing Gemma 3n preview: powerful, efficient, mobile-first AI

    Gemma 3n is a cutting-edge open model designed for fast, multimodal AI on devices, featuring optimized performance, unique flexibility with a 2-in-1 model, and expanded multimodal understanding with audio, empowering developers to build live, interactive applications and sophisticated audio-centric experiences.

    Gemma 3n
  • MAY 9, 2025 / DeepMind

    Advancing the frontier of video understanding with Gemini 2.5

    Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

    2.5Pro_Metadata_VideoUnderstanding
  • APRIL 30, 2025 / Gemma

    Gemma explained: What’s new in Gemma 3

    Gemma 3's new features include vision-language capabilities and architectural changes for improved memory efficiency and longer context handling compared to previous Gemma models.

    What's new in Gemma-3
  • APRIL 23, 2025 / Gemini

    Achieve real-time interaction: Build with the Live API

    Explore real world applications for the Live API for Gemini models, now updated to include enhanced features for real-time audio, video, and text processing, improved session management, control over interactions, and richer output options.

    gemini-live-api-meta
  • NOV. 20, 2024 / Gemini

    OpusClip achieves 30% cost savings in visual description processing with Gemini Flash

    OpusClip utilizes Gemini 1.5 Flash's multimodal capabilities to enhance video understanding and streamline content creation, leading to cost savings and increased engagement.

    OpusClip_metadata