Releases · kubernetes-sigs/lws

30 Mar 13:13

kerthcet

v0.6.0

23d2e4c

v0.6.0 Latest

Latest

What's Changed:

🚀 Major Features:

Introduce new component config flag by @ardaguclu in #325
Creating LWS site by @Edwinhr716 in #426
KEP 257: LeaderExcluded SubGroup support by @Edwinhr716 in #428

✨ Features:

Make lws image multi-platform by @yankay in #321
Inject environment variable LWS_WORKER_INDEX by @singeleaf in #416

🐛 Bugs:

Add controllerrevisions rbac to helm chart. by @wqlparallel in #319
Fix for odd numbers of arguments passed as key-value pairs for logging issue by @KeerthanaAP in #360
Fix infinite StatefulSet creation loops by validate leader annotations by @yankay in #394
Change annotation leaderworkerset.gke.io/subgroup-size to leaderworkerset.sigs.k8s.io/subgroup-size by @yankay in #434
ignore 0 replica check by @congcongke in #439

♻️ Cleanups:

Remove pods create permission by @wqlparallel in #323
add junit and artifacts directory for e2e tests so testgrid will disp… by @kannon92 in #335
Pass ARTIFACTS env variable when building e2e tests by @Edwinhr716 in #345
Add adopters, integrations, talks by @kerthcet in #367
Deprecate flags as the recommended path is to pass configuration file by @ardaguclu in #361
doc: Add terraform based EKS Blueprints pattern for multi-node inference by @askulkarni2 in #373
Add missing external types to apply configurations by @astefanutti in #374
Ignore LeaderElectionNamespace in cmd unit tests as it is dynamic by @ardaguclu in #383
Allow pass through of additional params to "ray start" in ray_init.sh by @dims in #378
use staging images instead of gcr by @kannon92 in #398
Add files to publish helm chart by @Edwinhr716 in #411
fix image build toolchain errors by @kannon92 in #424
Run controller-gen only for api and pkg directories by @ardaguclu in #417
Add Label leaderworkerset.sigs.k8s.io/name to the headlessService by @yankay in #433
Use create jobs command instead of run pods in e2e testing by @ardaguclu in #445
Export kind and cluster related logs under artifacts directory for e2e by @ardaguclu in #451
Set readOnlyRootFilesystem explicitly to true by @ardaguclu in #457
Check controller_runtime_webhook_requests_total that always exists in both pods by @ardaguclu in #456
Explicitly bind metrics 8443 as containerPort in kustomize and helm by @ardaguclu in #458

📃 Docs:

Adding User Guide for TensorRT-LLM by @Edwinhr716 in #356
Adding User Guide for SGLang by @yankay in #377
[Docs] Use the vllm image instead of building by the user self in the sample by @yankay in #453
[Doc] Add troubleshooting page in site by @yankay in #468

New Contributors

@yankay made their first contribution in #321
@ardaguclu made their first contribution in #325
@DiegoLelis made their first contribution in #354
@KeerthanaAP made their first contribution in #360
@askulkarni2 made their first contribution in #373
@astefanutti made their first contribution in #374
@dims made their first contribution in #378
@singeleaf made their first contribution in #416
@congcongke made their first contribution in #439

Full Changelog: v0.5.0...v0.6.0

Contributors

dims, astefanutti, and 11 other contributors

Assets 4

03 Feb 16:54

Edwinhr716

v0.5.1

3ee2d9a

v0.5.1

What's Changed

Automated cherry pick of #347: Removed update operation from pod webhook by @Edwinhr716 in #348
Automated cherry pick of #342: Added handling of nil revision in pod controller by @Edwinhr716 in #349
Automated cherry pick of #341: Removed restarting group log in pod controller by @Edwinhr716 in #350

Full Changelog: v0.5.0...v0.5.1

Contributors

Edwinhr716

Assets 3

08 Jan 22:36

ahg-g

v0.5.0

2f73269

Release v0.5.0

What's Changed

Update installation guide by @liurupeng in #217
Bump the kubernetes group with 5 updates by @dependabot in #224
Add support to install lws at an arbitrary namespace by @kerthcet in #226
feat: add build arg to docker buildx by @vie-serendipity in #227
Injecting TPU_NAME environment variable by @Edwinhr716 in #209
Creating new workerStatefulSet instead of patching when updating LWS template by @Edwinhr716 in #229
Update KEP #173 to reflect implementation by @Edwinhr716 in #233
Add cherry-pick script by @liurupeng in #235
Add vLLM TPU Multihost example by @Edwinhr716 in #234
Bump the kubernetes group with 5 updates by @dependabot in #241
Bump sigs.k8s.io/controller-runtime from 0.19.0 to 0.19.1 by @dependabot in #242
Changed log level to 2 by @Edwinhr716 in #244
Bump github.com/onsi/ginkgo/v2 from 2.20.2 to 2.21.0 by @dependabot in #245
Bump github.com/onsi/gomega from 1.34.2 to 1.35.1 by @dependabot in #246
change to not use NetworkConfig in hash calculation if it has default value by @Edwinhr716 in #250
Added validation for subdomainPolicy values by @Edwinhr716 in #251
changed cherry pick script to be executable by @Edwinhr716 in #253
add pods finalizer update rbac to allow lws to run in openshift by @kannon92 in #249
Update installation instructions for release 0.4.2 by @ahg-g in #255
Bump github.com/open-policy-agent/cert-controller from 0.11.0 to 0.12.0 by @dependabot in #256
Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.1 to 4.4.3 by @dependabot in #258
Unify Leader Election Namespace with Deployment Namespace by @wqlparallel in #263
Bump github.com/onsi/gomega from 1.35.1 to 1.36.0 by @dependabot in #271
Added validation for LWS Name by @Edwinhr716 in #272
Updating OWNERS file by @Edwinhr716 in #273
Bump github.com/onsi/ginkgo/v2 from 2.21.0 to 2.22.0 by @dependabot in #269
Bump sigs.k8s.io/controller-runtime from 0.19.1 to 0.19.2 by @dependabot in #270
Bump the kubernetes group with 5 updates by @dependabot in #268
Bump sigs.k8s.io/controller-runtime from 0.19.2 to 0.19.3 by @dependabot in #278
update golang to 1.23 and fix docker warnings by @kannon92 in #279
Use the namespace of the service account by @kannon92 in #282
Bump the kubernetes group with 5 updates by @dependabot in #286
Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.3 to 4.5.0 by @dependabot in #287
Bump github.com/onsi/gomega from 1.36.0 to 1.36.1 by @dependabot in #288
add doc to say how to install in a different namespace by @kannon92 in #285
Add helm charts for lws. by @wqlparallel in #266
add KEP 238, to add controller revision by @Edwinhr716 in #261
address non linear parsing cve by upgrading golang net by @kannon92 in #290
Bump github.com/onsi/ginkgo/v2 from 2.22.0 to 2.22.1 by @dependabot in #292
Remove namespace arg from manager command by @chiragjn in #293
Add Controller Revision (Implementation of KEP #238) by @Edwinhr716 in #277
Bump github.com/onsi/gomega from 1.36.1 to 1.36.2 by @dependabot in #295
Tests for Controller Revision by @Edwinhr716 in #297
Minimize number of unnecessary logs by @Edwinhr716 in #298
Update Reference Doc by @Edwinhr716 in #300
Updating owners by @Edwinhr716 in #301
Add Events by @Edwinhr716 in #299
Bump github.com/onsi/ginkgo/v2 from 2.22.1 to 2.22.2 by @dependabot in #303
Replaced Kueue with LWS on chart/lws ReadMe by @Edwinhr716 in #305
Ignore not found error when handling ReacreateGroupOnPodRestart by @Edwinhr716 in #304
Replace kube-rbac-proxy by @anshuman-agarwala in #284
Fixed flaky ControllerRevision test and add CR events by @Edwinhr716 in #306
Fixed metrics service port, added log level flag back by @Edwinhr716 in #307
Removed kube-rbac-proxy container from helm charts by @Edwinhr716 in #308

New Contributors

@vie-serendipity made their first contribution in #227
@wqlparallel made their first contribution in #263
@chiragjn made their first contribution in #293
@anshuman-agarwala made their first contribution in #284

Full Changelog: v0.4.0...v0.5.0

Contributors

kannon92, liurupeng, and 8 other contributors

Assets 3

09 Nov 03:30

ahg-g

v0.4.2

d620512

0.4.2

What's Changed

Automated cherry pick of #250: change to not use NetworkConfig in hash calculation if it has default value by @Edwinhr716 in #252

Full Changelog: v0.4.1...v0.4.2

Contributors

Edwinhr716

Assets 3

18 Oct 20:23

liurupeng

v0.4.1

58c0ac6

LeaderWorkerSet Patch release 0.4.1

This is a patch release for v0.4.0

Assets 3

14 Sep 00:09

liurupeng

v0.4.0

edc9eac

Leaderworkerset v0.4.0

Features:

Support specify NetworkConfig
Support leader elect for lws controller
Add group size as an environment variable
Add an example for using llama.cpp to deploy a distributed inference service
Update the GPU multi-node inference with vLLM example to serve Llama3.1-405b model