Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump sigs.k8s.io/jobset from 0.5.2 to 0.6.0 #2898

Closed

Conversation

dependabot[bot]
Copy link
Contributor

@dependabot dependabot bot commented on behalf of github Aug 26, 2024

Bumps sigs.k8s.io/jobset from 0.5.2 to 0.6.0.

Release notes

Sourced from sigs.k8s.io/jobset's releases.

v0.6.0

Highlights

  • New JobSet Failure Policy API - allows users to configure different behavior for different types of errors, enabling them to use compute resources more efficiently and improve ML training goodput.
  • Add Coordinator field to JobSet spec, enabling user to define a global coordinator pod for distributed ML/HPC workloads. The stable network endpoint for this pod will be added as a label and annotation to every Job and Pod in the JobSet for easy use in application code. A common use case for this is TPU Multislice training with multiple different Job templates. See linked issue for details.
  • Add global Job index label/annotation to every Job and Pod, which is needed to support TPU Multislice training with multiple different Job templates. See linked issue for details.
  • Added new metrics
  • Improved test coverage
  • Bug fixes
  • New examples and documentation

What's Changed

... (truncated)

Commits
  • d66f1d5 update to k8s 0.30.4 (#654)
  • 226e14c Bump github.com/prometheus/client_golang from 1.19.1 to 1.20.0 (#653)
  • 7886910 Add new job-id annotation to assign globally unique job index to each job (#650)
  • ec39730 Merge pull request #647 from kubernetes-sigs/dependabot/go_modules/github.com...
  • 71194ca Merge pull request #642 from kannon92/fully-label-name
  • a7eba1c add the unparam linter to golangci and fix those issues flagged (#643)
  • e1523db Bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0
  • f7f3b28 refactor jobset webhook (#646)
  • 8bade1e Merge pull request #644 from mimowo/fix-suspend
  • c89e112 Review remarks
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [sigs.k8s.io/jobset](https://github.com/kubernetes-sigs/jobset) from 0.5.2 to 0.6.0.
- [Release notes](https://github.com/kubernetes-sigs/jobset/releases)
- [Changelog](https://github.com/kubernetes-sigs/jobset/blob/main/RELEASE.md)
- [Commits](kubernetes-sigs/jobset@v0.5.2...v0.6.0)

---
updated-dependencies:
- dependency-name: sigs.k8s.io/jobset
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <[email protected]>
@dependabot dependabot bot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. release-note-none Denotes a PR that doesn't merit a release note. labels Aug 26, 2024
@k8s-ci-robot k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Aug 26, 2024
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: dependabot[bot]
Once this PR has been reviewed and has the lgtm label, please assign mimowo for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot
Copy link
Contributor

Hi @dependabot[bot]. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@k8s-ci-robot k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Aug 26, 2024
Copy link

netlify bot commented Aug 26, 2024

Deploy Preview for kubernetes-sigs-kueue ready!

Name Link
🔨 Latest commit afddc79
🔍 Latest deploy log https://app.netlify.com/sites/kubernetes-sigs-kueue/deploys/66cc52010ec0150008f89ad4
😎 Deploy Preview https://deploy-preview-2898--kubernetes-sigs-kueue.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

@k8s-ci-robot
Copy link
Contributor

@dependabot[bot]: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
pull-kueue-test-scheduling-perf-main afddc79 link true /test pull-kueue-test-scheduling-perf-main
pull-kueue-test-integration-main afddc79 link true /test pull-kueue-test-integration-main
pull-kueue-test-e2e-main-1-28 afddc79 link true /test pull-kueue-test-e2e-main-1-28
pull-kueue-build-image-main afddc79 link true /test pull-kueue-build-image-main
pull-kueue-test-e2e-main-1-31 afddc79 link true /test pull-kueue-test-e2e-main-1-31
pull-kueue-test-multikueue-e2e-main afddc79 link true /test pull-kueue-test-multikueue-e2e-main
pull-kueue-test-e2e-main-1-29 afddc79 link true /test pull-kueue-test-e2e-main-1-29
pull-kueue-test-e2e-main-1-30 afddc79 link true /test pull-kueue-test-e2e-main-1-30
pull-kueue-verify-main afddc79 link true /test pull-kueue-verify-main
pull-kueue-test-unit-main afddc79 link true /test pull-kueue-test-unit-main

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@kannon92
Copy link
Contributor

/hold

We are waiting for a K8s bump to merge this.

cc @alculquicondor @tenzen-y

Jobset was updated to use 0.30.

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Aug 26, 2024
@alculquicondor
Copy link
Contributor

why is that a problem? Kueue is already on k8s 0.30 libraries.

@mbobrovskyi
Copy link
Contributor

why is that a problem? Kueue is already on k8s 0.30 libraries.

No, we still have v0.29.8 (https://github.com/kubernetes-sigs/kueue/blob/main/go.mod#L24).

@alculquicondor
Copy link
Contributor

oh, interestingly, we upgraded apimachinery. Can you upgrade before we upgrade to v1.31?

@mbobrovskyi
Copy link
Contributor

mbobrovskyi commented Aug 26, 2024

oh, interestingly, we upgraded apimachinery. Can you upgrade before we upgrade to v1.31?

Unfortunately, no. We already had an issue that was fixed on v1.31.0 (kubernetes/kubernetes#125499). So that's why was skipped v1.30.x

@kannon92
Copy link
Contributor

kannon92 commented Aug 26, 2024

@mimowo was pushing pretty hard for jobset 0.6.0 for Kueue integration (#2700).

I'm not sure how important this is but we would need this PR for this to work and we depend on jobset 0.6.0 which requires 0.30 APIs for K8s.

I see the issue now. If the Kueue + JobSet bug fix is important, than maybe we consider merging the 0.31 API update before the first patch?

@mimowo
Copy link
Contributor

mimowo commented Sep 2, 2024

k8s 1.31.0 is released, so could we use it in Kueue to unblock bumping JobSet?

@mbobrovskyi
Copy link
Contributor

k8s 1.31.0 is released, so could we use it in Kueue to unblock bumping JobSet?

No, due to #2402 (comment).

@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Sep 3, 2024
@k8s-ci-robot
Copy link
Contributor

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@mimowo
Copy link
Contributor

mimowo commented Sep 12, 2024

This should be unblocked after #2402 is merged.

Still, I would suggest to open a dedicated PR to say that JobSet 0.6 is supported in release notes (not sure we can update dependabot's PR with that).

@kannon92
Copy link
Contributor

I opened up a separate PR because this seems to have issues. I can't do the normal dependabot commands.

@kannon92
Copy link
Contributor

/close

@k8s-ci-robot
Copy link
Contributor

@kannon92: Closed this PR.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Copy link
Contributor Author

dependabot bot commented on behalf of github Sep 12, 2024

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

@dependabot dependabot bot deleted the dependabot/go_modules/sigs.k8s.io/jobset-0.6.0 branch September 12, 2024 14:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. ok-to-test Indicates a non-member PR verified by an org member that is safe to test. release-note-none Denotes a PR that doesn't merit a release note. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants