- Google Developers Blog

MAY 28, 2026 / AI

How the community trained Gemma to "Think" with Tunix and TPUs

The Google Tunix Hackathon on Kaggle challenged developers to transform small, non-reasoning base models into general reasoning engines using Kaggle TPUs and a limited compute budget. The winning teams achieved this by implementing multi-stage post-training pipelines that combined Supervised Fine-Tuning (SFT) with advanced alignment techniques like GRPO and SimPO. Ultimately, the competition democratized AI development by proving that highly capable, structured reasoning models can be successfully trained by the community using accessible, open-source resources.

Posts by Tianshu Bao

Content Type

Product

Technology

How the community trained Gemma to "Think" with Tunix and TPUs

Introducing Tunix: A JAX-Native Library for LLM Post-Training

Content Type

Product

Technology