Search

47 results

Clear filters
  • JULY 16, 2025 / Cloud

    Stanford’s Marin foundation model: The first fully open model developed using JAX

    The Marin project aims to expand the definition of 'open' in AI to include the entire scientific process, not just the model itself, by making the complete development journey accessible and reproducible. This effort, powered by the JAX framework and its Levanter tool, allows for deep scrutiny, trust in, and building upon foundation models, fostering a more transparent future for AI research.

    Stanford Marin project in JAX
  • JULY 16, 2025 / Gemini

    Simplify your Agent "vibe building" flow with ADK and Gemini CLI

    The updated Agent Development Kit (ADK) simplifies and accelerates the process of building AI agents by providing the CLI with a deep, cost-effective understanding of the ADK framework, allowing developers to quickly ideate, generate, test, and improve functional agents through conversational prompts, eliminating friction and keeping them in a productive "flow" state.

    ADK + Gemini CLI: Supercharge Your Agent Building Vibe
  • JULY 10, 2025 / Gemini

    Announcing GenAI Processors: Build powerful and flexible Gemini applications

    GenAI Processors is a new open-source Python library from Google DeepMind designed to simplify the development of AI applications, especially those handling multimodal input and requiring real-time responsiveness, by providing a consistent "Processor" interface for all steps from input handling to model calls and output processing, for seamless chaining and concurrent execution.

    Announcing GenAI Processors: Streamline your Gemini application development
  • JUNE 26, 2025 / Gemma

    Introducing Gemma 3n: The developer guide

    The Gemma 3n model has been fully released, building on the success of previous Gemma models and bringing advanced on-device multimodal capabilities to edge devices with unprecedented performance. Explore Gemma 3n's innovations, including its mobile-first architecture, MatFormer technology, Per-Layer Embeddings, KV Cache Sharing, and new audio and MobileNet-V5 vision encoders, and how developers can start building with it today.

    Introducing Gemma 3n: The Developer Guide
  • JUNE 24, 2025 / Gemini

    Gemini 2.5 for robotics and embodied intelligence

    Gemini 2.5 Pro and Flash are transforming robotics by enhancing coding, reasoning, and multimodal capabilities, including spatial understanding. These models are used for semantic scene understanding, code generation for robot control, and building interactive applications with the Live API, with a strong emphasis on safety improvements and community applications.

    Gemini 2.5 for robotics and embodied intelligence
  • JUNE 10, 2025 / Pay

    Google Pay inside sandboxed iframe for PCI DSS v4 compliance

    Use a sandboxed iframe to implement Google Pay on checkout pages, which helps comply with PCI DSS v4 requirements by isolating scripts. Shopify successfully implemented this method and passed the PCI DSS v4 audit.

    Google Pay inside sandboxed iframe for PCI DSS v4 compliance
  • MAY 20, 2025 / Gemma

    Announcing Gemma 3n preview: powerful, efficient, mobile-first AI

    Gemma 3n is a cutting-edge open model designed for fast, multimodal AI on devices, featuring optimized performance, unique flexibility with a 2-in-1 model, and expanded multimodal understanding with audio, empowering developers to build live, interactive applications and sophisticated audio-centric experiences.

    Gemma 3n
  • MAY 20, 2025 / AI Edge

    LiteRT: Maximum performance, simplified

    LiteRT has been improved to boost AI model performance and efficiency on mobile devices by effectively utilizing GPUs and NPUs, now requiring significantly less code, enabling simplified hardware accelerator selection, and more for optimal on-device performance.

    Built with LiteRT: Maximum Performance, Simplified
  • MAY 9, 2025 / DeepMind

    Advancing the frontier of video understanding with Gemini 2.5

    Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

    2.5Pro_Metadata_VideoUnderstanding
  • APRIL 30, 2025 / Gemma

    Gemma explained: What’s new in Gemma 3

    Gemma 3's new features include vision-language capabilities and architectural changes for improved memory efficiency and longer context handling compared to previous Gemma models.

    What's new in Gemma-3