v0.2.0-rc.1
Pre-release
Pre-release
What's Changed
- Add envoy gateway streaming support by @varungup90 in #377
- Add client traffic policy to increase per connection buffer size from 32kb to 256kb by @varungup90 in #395
- Misc: add support to metricsSources property of podautoscaler by @zhangjyr in #371
- [Misc] Update runtime server startup command in v0.1.0 by @brosoul in #396
- [CI] improve the ci efficiency by parallelizing the build tasks by @nwangfw in #398
- Fix the ticker interval by removing unnecessary ms by @Jeffwan in #415
- [Misc] Disable specific endpoints logs by @Jeffwan in #418
- [CI] Github Action trigger condition optimized for cost saving by @nwangfw in #411
- [Misc] Fix the mocked app role permission issue by @Jeffwan in #416
- [CI] Nightly tag removed for release branch by @nwangfw in #422
- Enable setting PodAutoscaler configuration via YAML labels by @kr11 in #409
- Update manifest to adopt v0.1.1 images by @Jeffwan in #429
- [Bug]: duplicated http in rest metrics fetcher (#408) by @zhangjyr in #421
- [MISC]: Improve Request Trace Granularity with Version Control by @zhangjyr in #431
- Support histogram metrics from engine in cache by @Jeffwan in #424
- Support fetching metrics from remote Prometheus server by @Jeffwan in #433
- [CI] Add python wheel to release artifact by @Jeffwan in #434
- Fix update cache pod issue and refactor updatePod handler by @Jeffwan in #439
- Extract common metrics structure to types and utils by @Jeffwan in #438
- Fix gateway startup issue due to missing prometheus config by @Jeffwan in #441
- [feat]: GPU Optimizer and Simulator development app by @zhangjyr in #430
- Add selectrandom fallback in routing and only scraping healthy pods by @Jeffwan in #445
- AIBrix Workload Generator / Scenario Simulator by @happyandslow in #428
- CrashLoopBackOff status detection in CI by @nwangfw in #444
- Support installing individual controllers from giant controller-manager by @nwangfw in #442
- Refactor Scaler: Resolve Issues with Metric Parameter Updates in Multiple KPAs by @kr11 in #437
- Support metrics multi labels for different models by @brosoul in #450
- Add health check api interface for runtime by @Jeffwan in #451
- Fix the service name override issue in rolebindings by @Jeffwan in #453
- Reorganize docs/development and docs/tutorial structure by @Jeffwan in #455
- Move tools to separate folders and update mocked app README.md by @Jeffwan in #457
- Fix multi models metric result in PromQL by @brosoul in #458
- Support Azure LLM trace in workload generator by @happyandslow in #462
- Fix autoscaler scalingstrategy switching logic by @nwangfw in #475
- Fix missing handle of PromQL scope is PodMetricScope by @brosoul in #479
- [Misc] Consolidate app and simulator by @zhangjyr in #477
- [Bug] Avoid including sensitive info in Dockerfile ENV by @zhangjyr in #487
- Refactor generator to generate time-based traces by @happyandslow in #478
- [CI] Update deploy workload script in installation test by @nwangfw in #499
- [Bug] handle metricKey creation with MetricsSources by @nwangfw in #498
- Adding Client for Workload Generator Workload File by @happyandslow in #501
- [Feat] Integrate deployment configurations and fix autoscaler/gpu optimizer connectivity by @zhangjyr in #500
- Fix some simulator format issue and add some TODOs by @Jeffwan in #505
- [Bug] Fix the way how podautoscaler handle 0 pods. by @zhangjyr in #508
- [Misc] Improve gpu optimizer debugging on podautoscaler. by @zhangjyr in #509
- Optimize kustomize overlay for volcano engine deployment by @Jeffwan in #512
- [perf] Refact tos downloader in Runtime by @brosoul in #510
- Refactor metric source for customized protocol, port and path by @kr11 in #511
- [Bug] Fixed the yaml of deployments in heterogenous GPU settings to make KPA scaling work as expected. by @zhangjyr in #513
- [Misc] Heterogeneous GPU Optimizer Logging Clean Up by @nwangfw in #514
- Fix KPA bug, and an elaborate KPA test case by @kr11 in #515
- Cut v0.2.0-rc.1 release by @Jeffwan in #516
Full Changelog: v0.1.1...v0.2.0-rc.1