Search

67 results

Clear filters
  • MAY 20, 2025 / Gemini

    Building agents with Google Gemini and open source frameworks

    Google Gemini models offer several advantages when building AI agents, such as advanced reasoning, function calling, multimodality, and large context window capabilities. Open-source frameworks like LangGraph, CrewAI, LlamaIndex, and Composio can be used with Gemini for agent development.

    Building agents with Google Gemini and open source frameworks
  • MAY 20, 2025 / Android

    What you should know from the Google I/O 2025 Developer keynote

    Top announcements from Google I/O 2025 focus on building across Google platforms and innovating with AI models from Google DeepMind, with key focus on new tools, APIs, and features designed to enhance developer productivity and create AI-powered experiences using Gemini, Android, Firebase, and web.

    What you should know from the Google I/O 2025 Developer keynote
  • MAY 20, 2025 / Gemini

    From idea to app: Introducing Stitch, a new way to design UIs

    Stitch, a new Google Labs experiment, uses AI to generate UI designs and frontend code from text prompts and images, aiming to streamline the design and development workflow, offering features like UI generation from natural language or images, rapid iteration, and seamless paste to Figma and front-end code.

    From idea to app: Introducing Stitch, a new way to design UIs
  • MAY 9, 2025 / Cloud

    Google AI for game developers

    Revisit announcements from this year's Games Developer Conference (GDC). Explore how Gemma and Gemini models can aid in building AI experiences in games with the launch of Gemma 3, the Unity plugin, its application in a sample game, and scaling games with generative AI in Google Cloud.

    Google AI for Game Developers
  • MAY 9, 2025 / DeepMind

    Advancing the frontier of video understanding with Gemini 2.5

    Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

    2.5Pro_Metadata_VideoUnderstanding
  • MAY 8, 2025 / Gemini

    Gemini 2.5 Models now support implicit caching

    The rollout of implicit caching in the Gemini API expands on the existing explicit caching API, providing an "always on" caching system which offers automatic cost savings to developers using Gemini 2.5 models and continued availability of the explicit caching API for guaranteed savings.

    Gemini 2.5 Models now support Implicit Caching
  • MAY 7, 2025 / Gemini

    Create and edit images with Gemini 2.0 in preview

    Gemini 2.0 Flash's image generation capabilities, now available in preview in Google AI Studio and Vertex AI, feature higher rate limits, enhanced visual quality, more precise text rendering, and more, allowing developers to create applications for product recontextualization, collaborative image editing, and dynamic SKU generation.

    Generate images with Gemini 2.0 Flash in preview
  • MAY 6, 2025 / Gemini

    Gemini 2.5 Pro Preview: even better coding performance

    An updated I/O edition preview of Gemini 2.5 Pro is being released for developers, featuring best-in-class front-end and UI development performance, ranking #1 on the WebDev Arena leaderboard, and showcasing applications like video to code and easier feature development through starter apps.

    Gemini 2.5 Pro (I/O Edition): even better coding performance
  • APRIL 29, 2025 / Gemini

    How It’s Made: Little Language Lessons uses Gemini’s multilingual capabilities to personalize language learning

    Little Language Lessons, a project leveraging Gemini's API and Cloud services to generate content, translate, and provide text-to-speech functionalities, includes vocabulary lessons, slang practice, and object recognition for language learning.

    How it's made: Little Language Lessons
  • APRIL 23, 2025 / Gemini

    Achieve real-time interaction: Build with the Live API

    Explore real world applications for the Live API for Gemini models, now updated to include enhanced features for real-time audio, video, and text processing, improved session management, control over interactions, and richer output options.

    gemini-live-api-meta