Release v0.2.0-rc.1 · vllm-project/aibrix

What's Changed

Add envoy gateway streaming support by @varungup90 in #377
Add client traffic policy to increase per connection buffer size from 32kb to 256kb by @varungup90 in #395
Misc: add support to metricsSources property of podautoscaler by @zhangjyr in #371
[Misc] Update runtime server startup command in v0.1.0 by @brosoul in #396
[CI] improve the ci efficiency by parallelizing the build tasks by @nwangfw in #398
Fix the ticker interval by removing unnecessary ms by @Jeffwan in #415
[Misc] Disable specific endpoints logs by @Jeffwan in #418
[CI] Github Action trigger condition optimized for cost saving by @nwangfw in #411
[Misc] Fix the mocked app role permission issue by @Jeffwan in #416
[CI] Nightly tag removed for release branch by @nwangfw in #422
Enable setting PodAutoscaler configuration via YAML labels by @kr11 in #409
Update manifest to adopt v0.1.1 images by @Jeffwan in #429
[Bug]: duplicated http in rest metrics fetcher (#408) by @zhangjyr in #421
[MISC]: Improve Request Trace Granularity with Version Control by @zhangjyr in #431
Support histogram metrics from engine in cache by @Jeffwan in #424
Support fetching metrics from remote Prometheus server by @Jeffwan in #433
[CI] Add python wheel to release artifact by @Jeffwan in #434
Fix update cache pod issue and refactor updatePod handler by @Jeffwan in #439
Extract common metrics structure to types and utils by @Jeffwan in #438
Fix gateway startup issue due to missing prometheus config by @Jeffwan in #441
[feat]: GPU Optimizer and Simulator development app by @zhangjyr in #430
Add selectrandom fallback in routing and only scraping healthy pods by @Jeffwan in #445
AIBrix Workload Generator / Scenario Simulator by @happyandslow in #428
CrashLoopBackOff status detection in CI by @nwangfw in #444
Support installing individual controllers from giant controller-manager by @nwangfw in #442
Refactor Scaler: Resolve Issues with Metric Parameter Updates in Multiple KPAs by @kr11 in #437
Support metrics multi labels for different models by @brosoul in #450
Add health check api interface for runtime by @Jeffwan in #451
Fix the service name override issue in rolebindings by @Jeffwan in #453
Reorganize docs/development and docs/tutorial structure by @Jeffwan in #455
Move tools to separate folders and update mocked app README.md by @Jeffwan in #457
Fix multi models metric result in PromQL by @brosoul in #458
Support Azure LLM trace in workload generator by @happyandslow in #462
Fix autoscaler scalingstrategy switching logic by @nwangfw in #475
Fix missing handle of PromQL scope is PodMetricScope by @brosoul in #479
[Misc] Consolidate app and simulator by @zhangjyr in #477
[Bug] Avoid including sensitive info in Dockerfile ENV by @zhangjyr in #487
Refactor generator to generate time-based traces by @happyandslow in #478
[CI] Update deploy workload script in installation test by @nwangfw in #499
[Bug] handle metricKey creation with MetricsSources by @nwangfw in #498
Adding Client for Workload Generator Workload File by @happyandslow in #501
[Feat] Integrate deployment configurations and fix autoscaler/gpu optimizer connectivity by @zhangjyr in #500
Fix some simulator format issue and add some TODOs by @Jeffwan in #505
[Bug] Fix the way how podautoscaler handle 0 pods. by @zhangjyr in #508
[Misc] Improve gpu optimizer debugging on podautoscaler. by @zhangjyr in #509
Optimize kustomize overlay for volcano engine deployment by @Jeffwan in #512
[perf] Refact tos downloader in Runtime by @brosoul in #510
Refactor metric source for customized protocol, port and path by @kr11 in #511
[Bug] Fixed the yaml of deployments in heterogenous GPU settings to make KPA scaling work as expected. by @zhangjyr in #513
[Misc] Heterogeneous GPU Optimizer Logging Clean Up by @nwangfw in #514
Fix KPA bug, and an elaborate KPA test case by @kr11 in #515
Cut v0.2.0-rc.1 release by @Jeffwan in #516

Full Changelog: v0.1.1...v0.2.0-rc.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.2.0-rc.1

What's Changed

Contributors