Posts by Weiren Yu

1 results

Clear filters
  • APRIL 16, 2026 / AI

    MaxText Expands Post-Training Capabilities: Introducing SFT and RL on Single-Host TPUs

    MaxText has introduced new support for Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) on single-host TPU configurations, leveraging JAX and the Tunix library for high-performance model refinement. These features enable developers to easily adapt pre-trained models for specialized tasks and complex reasoning using efficient algorithms like GRPO and GSPO. This update streamlines the post-training workflow, offering a scalable path from single-host setups to larger multi-host configurations.

    Building-1-banner