Posts by Danny McCormick

1 results

Clear filters
  • NOV 13, 2024 / Gemma

    Inference with Gemma using Dataflow and vLLM

    vLLM's continuous batching and Dataflow's model manager optimizes LLM serving and simplifies the deployment process, delivering a powerful combination for developers to build high-performance LLM inference pipelines more efficiently.

    Gemma-Dataflow-ML-vLLM