Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supporting varying number of pods in radix-tree cache data structure. #681

Open
gangmuk opened this issue Feb 15, 2025 · 0 comments
Open
Assignees
Labels
area/gateway kind/enhancement New feature or request priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete.
Milestone

Comments

@gangmuk
Copy link
Collaborator

gangmuk commented Feb 15, 2025

🚀 Feature Description and Motivation

Currently, the radix tree cache does not support varying number of GPUs (pods). The corresponding tree nodes in RadixTree should be updated accordingly in real time with varying number of pods.

WARNING. I am worried about potential race condition between reflecting running pods in radixtree data structure and router makes decision based on outdated radixtree. It is more than performance. let's say a previously running pod is being terminated and the radix tree has been updated late. And the routing policy returns terminating pod as best prefix matching pod. I wonder how it is done in the current hash based prefix aware routing.

Use Case

Prefix and load aware routing

Proposed Solution

No response

@Jeffwan Jeffwan added this to the v0.3.0 milestone Feb 15, 2025
@Jeffwan Jeffwan added kind/enhancement New feature or request area/gateway priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. labels Feb 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/gateway kind/enhancement New feature or request priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete.
Projects
None yet
Development

No branches or pull requests

2 participants