Search

8 results

Clear filters
  • MAY 19, 2026 / Mobile

    Blazing fast on-device GenAI with LiteRT-LM

    Google AI Edge’s LiteRT-LM provides a production-proven, highly optimized infrastructure for running Gemma 4 across cross-platform mobile and edge environments. It actively unlocks the model's native multimodal and agentic features on-device by utilizing memory-efficient dynamic loading, Multi-Token Prediction for up to a 2.2x speedup, and advanced orchestration tools like Thinking Mode and Constrained Decoding. Furthermore, the engine is rapidly expanding its integration surfaces beyond Android, introducing new native Swift APIs for Apple ecosystems and WebGPU-accelerated JavaScript APIs for high-performance, serverless browser inference.

    may2026_liteRT-LM_v2_2x
  • SEPT. 9, 2025 / Mobile

    Google AI Edge Gallery: Now with audio and on Google Play

    Google AI Edge has expanded the Gemma 3n preview to include audio support. Users can play with it on their own mobile phone using the Google AI Edge Gallery, which is now available in Open Beta on Play Store.

    GoogleAIEdge_Metadatal_RD2-V01
  • AUG. 27, 2025 / Google Labs

    Stop “vibe testing” your LLMs. It's time for real evals.

    Stax, an experimental developer tool, addresses the insufficient nature of "vibe testing" LLMs by streamlining the LLM evaluation lifecycle, allowing users to rigorously test their AI stack and make data-driven decisions through human labeling and scalable LLM-as-a-judge auto-raters.

    Stax
  • MARCH 25, 2025 / Gemini

    Introducing TxGemma: Open models to improve therapeutics development

    Google DeepMind releases TxGemma, built on Gemma, which predicts therapeutic properties, and Agentic-Tx, powered by Gemini 2.0 Pro, which tackles complex research problem-solving with advanced tools.

    TxGemma
  • MARCH 7, 2025 / Gemini

    State-of-the-art text embedding via the Gemini API

    A new experimental Gemini Embedding text model, now available in the Gemini API, achieves top rankings on the Massive Text Embedding Benchmark (MTEB) leaderboard and offers expanded language support and high-dimensional embeddings.

    Gemini spark
  • NOV. 11, 2024 / Chrome Web

    Web AI Summit 2024 Recap: Client-Side AI for Developers

    The first Web AI Summit, hosted by Google on October 18, 2024, brought together experts in machine learning models for web browsers.

    Web AI Summit 2024
  • OCT. 30, 2024 / Gemini

    Bringing AI Agents to production with Gemini API

    AgentOps uses the Gemini API to provide cost-effective and powerful LLM-powered agent observability for enterprises.

    Gemini-X-AgentOps
  • MAY 22, 2023 / AI

    Using Generative AI for Travel Inspiration and Discovery

    Google’s Partner Innovation team is developing a series of Generative AI templates showcasing the po...

    110