Skip to content

v0.2.0-rc.1

Pre-release
Pre-release
Compare
Choose a tag to compare
@Jeffwan Jeffwan released this 10 Dec 20:16
· 110 commits to main since this release
0d40fbd

What's Changed

  • Add envoy gateway streaming support by @varungup90 in #377
  • Add client traffic policy to increase per connection buffer size from 32kb to 256kb by @varungup90 in #395
  • Misc: add support to metricsSources property of podautoscaler by @zhangjyr in #371
  • [Misc] Update runtime server startup command in v0.1.0 by @brosoul in #396
  • [CI] improve the ci efficiency by parallelizing the build tasks by @nwangfw in #398
  • Fix the ticker interval by removing unnecessary ms by @Jeffwan in #415
  • [Misc] Disable specific endpoints logs by @Jeffwan in #418
  • [CI] Github Action trigger condition optimized for cost saving by @nwangfw in #411
  • [Misc] Fix the mocked app role permission issue by @Jeffwan in #416
  • [CI] Nightly tag removed for release branch by @nwangfw in #422
  • Enable setting PodAutoscaler configuration via YAML labels by @kr11 in #409
  • Update manifest to adopt v0.1.1 images by @Jeffwan in #429
  • [Bug]: duplicated http in rest metrics fetcher (#408) by @zhangjyr in #421
  • [MISC]: Improve Request Trace Granularity with Version Control by @zhangjyr in #431
  • Support histogram metrics from engine in cache by @Jeffwan in #424
  • Support fetching metrics from remote Prometheus server by @Jeffwan in #433
  • [CI] Add python wheel to release artifact by @Jeffwan in #434
  • Fix update cache pod issue and refactor updatePod handler by @Jeffwan in #439
  • Extract common metrics structure to types and utils by @Jeffwan in #438
  • Fix gateway startup issue due to missing prometheus config by @Jeffwan in #441
  • [feat]: GPU Optimizer and Simulator development app by @zhangjyr in #430
  • Add selectrandom fallback in routing and only scraping healthy pods by @Jeffwan in #445
  • AIBrix Workload Generator / Scenario Simulator by @happyandslow in #428
  • CrashLoopBackOff status detection in CI by @nwangfw in #444
  • Support installing individual controllers from giant controller-manager by @nwangfw in #442
  • Refactor Scaler: Resolve Issues with Metric Parameter Updates in Multiple KPAs by @kr11 in #437
  • Support metrics multi labels for different models by @brosoul in #450
  • Add health check api interface for runtime by @Jeffwan in #451
  • Fix the service name override issue in rolebindings by @Jeffwan in #453
  • Reorganize docs/development and docs/tutorial structure by @Jeffwan in #455
  • Move tools to separate folders and update mocked app README.md by @Jeffwan in #457
  • Fix multi models metric result in PromQL by @brosoul in #458
  • Support Azure LLM trace in workload generator by @happyandslow in #462
  • Fix autoscaler scalingstrategy switching logic by @nwangfw in #475
  • Fix missing handle of PromQL scope is PodMetricScope by @brosoul in #479
  • [Misc] Consolidate app and simulator by @zhangjyr in #477
  • [Bug] Avoid including sensitive info in Dockerfile ENV by @zhangjyr in #487
  • Refactor generator to generate time-based traces by @happyandslow in #478
  • [CI] Update deploy workload script in installation test by @nwangfw in #499
  • [Bug] handle metricKey creation with MetricsSources by @nwangfw in #498
  • Adding Client for Workload Generator Workload File by @happyandslow in #501
  • [Feat] Integrate deployment configurations and fix autoscaler/gpu optimizer connectivity by @zhangjyr in #500
  • Fix some simulator format issue and add some TODOs by @Jeffwan in #505
  • [Bug] Fix the way how podautoscaler handle 0 pods. by @zhangjyr in #508
  • [Misc] Improve gpu optimizer debugging on podautoscaler. by @zhangjyr in #509
  • Optimize kustomize overlay for volcano engine deployment by @Jeffwan in #512
  • [perf] Refact tos downloader in Runtime by @brosoul in #510
  • Refactor metric source for customized protocol, port and path by @kr11 in #511
  • [Bug] Fixed the yaml of deployments in heterogenous GPU settings to make KPA scaling work as expected. by @zhangjyr in #513
  • [Misc] Heterogeneous GPU Optimizer Logging Clean Up by @nwangfw in #514
  • Fix KPA bug, and an elaborate KPA test case by @kr11 in #515
  • Cut v0.2.0-rc.1 release by @Jeffwan in #516

Full Changelog: v0.1.1...v0.2.0-rc.1