-
Notifications
You must be signed in to change notification settings - Fork 14.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #15337 from kubernetes/dev-1.16
Official 1.16 Release Docs
- Loading branch information
Showing
220 changed files
with
74,093 additions
and
3,472 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,62 @@ | ||
--- | ||
reviewers: | ||
- dchen1107 | ||
- egernst | ||
- tallclair | ||
title: Pod Overhead | ||
content_template: templates/concept | ||
weight: 20 | ||
--- | ||
|
||
{{% capture overview %}} | ||
|
||
{{< feature-state for_k8s_version="v1.16" state="alpha" >}} | ||
|
||
|
||
When you run a Pod on a Node, the Pod itself takes an amount of system resources. These | ||
resources are additional to the resources needed to run the container(s) inside the Pod. | ||
_Pod Overhead_ is a feature for accounting for the resources consumed by the pod infrastructure | ||
on top of the container requests & limits. | ||
|
||
|
||
{{% /capture %}} | ||
|
||
|
||
{{% capture body %}} | ||
|
||
## Pod Overhead | ||
|
||
In Kubernetes, the pod's overhead is set at | ||
[admission](https://kubernetes.io/docs/reference/access-authn-authz/extensible-admission-controllers/#what-are-admission-webhooks) | ||
time according to the overhead associated with the pod's | ||
[RuntimeClass](https://kubernetes.io/docs/concepts/containers/runtime-class/). | ||
|
||
When Pod Overhead is enabled, the overhead is considered in addition to the sum of container | ||
resource requests when scheduling a pod. Similarly, Kubelet will include the pod overhead when sizing | ||
the pod cgroup, and when carrying out pod eviction ranking. | ||
|
||
### Set Up | ||
|
||
You need to make sure that the `PodOverhead` | ||
[feature gate](/docs/reference/command-line-tools-reference/feature-gates/) is enabled (it is off by default) | ||
across your cluster. This means: | ||
|
||
- in {{< glossary_tooltip text="kube-scheduler" term_id="kube-scheduler" >}} | ||
- in {{< glossary_tooltip text="kube-apiserver" term_id="kube-apiserver" >}} | ||
- in the {{< glossary_tooltip text="kubelet" term_id="kubelet" >}} on each Node | ||
- in any custom API servers that use feature gates | ||
|
||
{{< note >}} | ||
Users who can write to RuntimeClass resources are able to have cluster-wide impact on | ||
workload performance. You can limit access to this ability using Kubernetes access controls. | ||
See [Authorization Overview](/docs/reference/access-authn-authz/authorization/) for more details. | ||
{{< /note >}} | ||
|
||
{{% /capture %}} | ||
|
||
{{% capture whatsnext %}} | ||
|
||
* [RuntimeClass](/docs/concepts/containers/runtime-class/) | ||
* [PodOverhead Design](https://github.com/kubernetes/enhancements/blob/master/keps/sig-node/20190226-pod-overhead.md) | ||
|
||
{{% /capture %}} |
197 changes: 197 additions & 0 deletions
197
content/en/docs/concepts/configuration/resource-bin-packing.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,197 @@ | ||
--- | ||
reviewers: | ||
- bsalamat | ||
- k82cn | ||
- ahg-g | ||
title: Resource Bin Packing for Extended Resources | ||
content_template: templates/concept | ||
weight: 10 | ||
--- | ||
|
||
{{% capture overview %}} | ||
|
||
{{< feature-state for_k8s_version="1.16" state="alpha" >}} | ||
|
||
The kube-scheduler can be configured to enable bin packing of resources along with extended resources using `RequestedToCapacityRatioResourceAllocation` priority function. Priority functions can be used to fine-tune the kube-scheduler as per custom needs. | ||
|
||
{{% /capture %}} | ||
|
||
{{% capture body %}} | ||
|
||
## Enabling Bin Packing using RequestedToCapacityRatioResourceAllocation | ||
|
||
Before Kubernetes 1.15, Kube-scheduler used to allow scoring nodes based on the request to capacity ratio of primary resources like CPU and Memory. Kubernetes 1.16 added a new parameter to the priority function that allows the users to specify the resources along with weights for each resource to score nodes based on the request to capacity ratio. This allows users to bin pack extended resources by using appropriate parameters improves the utilization of scarce resources in large clusters. The behavior of the `RequestedToCapacityRatioResourceAllocation` priority function can be controlled by a configuration option called `requestedToCapacityRatioArguments`. This argument consists of two parameters `shape` and `resources`. Shape allows the user to tune the function as least requested or most requested based on `utilization` and `score` values. Resources | ||
consists of `name` which specifies the resource to be considered during scoring and `weight` specify the weight of each resource. | ||
|
||
Below is an example configuration that sets `requestedToCapacityRatioArguments` to bin packing behavior for extended resources `intel.com/foo` and `intel.com/bar` | ||
|
||
```json | ||
{ | ||
"kind" : "Policy", | ||
"apiVersion" : "v1", | ||
|
||
... | ||
|
||
"priorities" : [ | ||
|
||
... | ||
|
||
{ | ||
"name": "RequestedToCapacityRatioPriority", | ||
"weight": 2, | ||
"argument": { | ||
"requestedToCapacityRatioArguments": { | ||
"shape": [ | ||
{"utilization": 0, "score": 0}, | ||
{"utilization": 100, "score": 10} | ||
], | ||
"resources": [ | ||
{"name": "intel.com/foo", "weight": 3}, | ||
{"name": "intel.com/bar", "weight": 5} | ||
] | ||
} | ||
} | ||
} | ||
], | ||
} | ||
``` | ||
|
||
**This feature is disabled by default** | ||
|
||
### Tuning RequestedToCapacityRatioResourceAllocation Priority Function | ||
|
||
`shape` is used to specify the behavior of the `RequestedToCapacityRatioPriority` function. | ||
|
||
```yaml | ||
{"utilization": 0, "score": 0}, | ||
{"utilization": 100, "score": 10} | ||
``` | ||
|
||
The above arguments give the node a score of 0 if utilization is 0% and 10 for utilization 100%, thus enabling bin packing behavior. To enable least requested the score value must be reversed as follows. | ||
|
||
```yaml | ||
{"utilization": 0, "score": 100}, | ||
{"utilization": 100, "score": 0} | ||
``` | ||
|
||
`resources` is an optional parameter which by defaults is set to: | ||
|
||
``` yaml | ||
"resources": [ | ||
{"name": "CPU", "weight": 1}, | ||
{"name": "Memory", "weight": 1} | ||
] | ||
``` | ||
|
||
It can be used to add extended resources as follows: | ||
|
||
```yaml | ||
"resources": [ | ||
{"name": "intel.com/foo", "weight": 5}, | ||
{"name": "CPU", "weight": 3}, | ||
{"name": "Memory", "weight": 1} | ||
] | ||
``` | ||
|
||
The weight parameter is optional and is set to 1 if not specified. Also, the weight cannot be set to a negative value. | ||
|
||
### How the RequestedToCapacityRatioResourceAllocation Priority Function Scores Nodes | ||
|
||
This section is intended for those who want to understand the internal details | ||
of this feature. | ||
Below is an example of how the node score is calculated for a given set of values. | ||
|
||
``` | ||
Requested Resources | ||
intel.com/foo : 2 | ||
Memory: 256MB | ||
CPU: 2 | ||
Resource Weights | ||
intel.com/foo : 5 | ||
Memory: 1 | ||
CPU: 3 | ||
FunctionShapePoint {{0, 0}, {100, 10}} | ||
Node 1 Spec | ||
Available: | ||
intel.com/foo : 4 | ||
Memory : 1 GB | ||
CPU: 8 | ||
Used: | ||
intel.com/foo: 1 | ||
Memory: 256MB | ||
CPU: 1 | ||
Node Score: | ||
intel.com/foo = resourceScoringFunction((2+1),4) | ||
= (100 - ((4-3)*100/4) | ||
= (100 - 25) | ||
= 75 | ||
= rawScoringFunction(75) | ||
= 7 | ||
Memory = resourceScoringFunction((256+256),1024) | ||
= (100 -((1024-512)*100/1024)) | ||
= 50 | ||
= rawScoringFunction(50) | ||
= 5 | ||
CPU = resourceScoringFunction((2+1),8) | ||
= (100 -((8-3)*100/8)) | ||
= 37.5 | ||
= rawScoringFunction(37.5) | ||
= 3 | ||
NodeScore = (7 * 5) + (5 * 1) + (3 * 3) / (5 + 1 + 3) | ||
= 5 | ||
Node 2 Spec | ||
Available: | ||
intel.com/foo: 8 | ||
Memory: 1GB | ||
CPU: 8 | ||
Used: | ||
intel.com/foo: 2 | ||
Memory: 512MB | ||
CPU: 6 | ||
Node Score: | ||
intel.com/foo = resourceScoringFunction((2+2),8) | ||
= (100 - ((8-4)*100/8) | ||
= (100 - 25) | ||
= 50 | ||
= rawScoringFunction(50) | ||
= 5 | ||
Memory = resourceScoringFunction((256+512),1024) | ||
= (100 -((1024-768)*100/1024)) | ||
= 75 | ||
= rawScoringFunction(75) | ||
= 7 | ||
CPU = resourceScoringFunction((2+6),8) | ||
= (100 -((8-8)*100/8)) | ||
= 100 | ||
= rawScoringFunction(100) | ||
= 10 | ||
NodeScore = (5 * 5) + (7 * 1) + (10 * 3) / (5 + 1 + 3) | ||
= 7 | ||
``` | ||
|
||
{{% /capture %}} |
Oops, something went wrong.