Search - Google Developers Blog

Community/Events

Community/Events
Learn
Blog
YouTube

Posts by Andrew Zhang

1 results

Filter by

Content Type

Select all

Announcements
Beginner
Best Practices
Business and Leadership
Case Studies
Code Health
Community
Documentation
Events
Explore
Hard
How-To Guides
Industry Trends
Intermediate
Learn
Performance
Problem-Solving
Project Management/Agile
Q&A
Release Notes
Solutions
Testing
TotT
Tutorials

Product

Select all

Technology

Select all

AI
Cloud
Mobile
Web

NOV. 24, 2025 / Mobile

Unlocking Peak Performance on Qualcomm NPU with LiteRT

LiteRT's new Qualcomm AI Engine Direct (QNN) Accelerator unlocks dedicated NPU power for on-device GenAI on Android. It offers a unified mobile deployment workflow, SOTA performance (up to 100x speedup over CPU), and full model delegation. This enables smooth, real-time AI experiences, with FastVLM-0.5B achieving over 11,000 tokens/sec prefill on Snapdragon 8 Elite Gen 5 NPU.

Previous

Next

Clear filters

Content Type

Select all

Announcements
Beginner
Best Practices
Business and Leadership
Case Studies
Code Health
Community
Documentation
Events
Explore
Hard
How-To Guides
Industry Trends
Intermediate
Learn
Performance
Problem-Solving
Project Management/Agile
Q&A
Release Notes
Solutions
Testing
TotT
Tutorials

Product

Select all

Technology

Select all

AI
Cloud
Mobile
Web

Connect
- Blog
- Bluesky
- Instagram
- LinkedIn
- X (Twitter)
- YouTube
Programs
Developer consoles

Android
Chrome
Firebase
Google Cloud Platform
All products

Terms
Privacy