Skip to content

v0.2.0

Closed Feb 17, 2025 98% complete

Focus more on the advanced features

  • Distributed and Disaggregated Inference
  • Distributed KV Cache
  • Cost-efficient Heterogenous placement and routing
  • Cost-efficient Heterogeneous Serving
  • v0.1.0 feature quality improvement
Loading