Posts by Tianshu Bao

1 results

Clear filters
  • SEPT. 30, 2025 / AI

    Introducing Tunix: A JAX-Native Library for LLM Post-Training

    Tunix is a new JAX-native, open-source library for LLM post-training. It offers comprehensive tools for aligning models at scale, including SFT, preference tuning (DPO), advanced RL methods (PPO, GRPO, GSPO), and knowledge distillation. Designed for TPUs and seamless JAX integration, Tunix emphasizes developer control and shows a 12% relative improvement in pass@1 accuracy on GSM8K.

    Tunix logo