Skip to content

Releases: kubernetes-sigs/lws

v0.6.0

30 Mar 13:13
v0.6.0
23d2e4c
Compare
Choose a tag to compare

What's Changed:

🚀 Major Features:

Features:

🐛 Bugs:

  • Add controllerrevisions rbac to helm chart. by @wqlparallel in #319
  • Fix for odd numbers of arguments passed as key-value pairs for logging issue by @KeerthanaAP in #360
  • Fix infinite StatefulSet creation loops by validate leader annotations by @yankay in #394
  • Change annotation leaderworkerset.gke.io/subgroup-size to leaderworkerset.sigs.k8s.io/subgroup-size by @yankay in #434
  • ignore 0 replica check by @congcongke in #439

♻️ Cleanups:

  • Remove pods create permission by @wqlparallel in #323
  • add junit and artifacts directory for e2e tests so testgrid will disp… by @kannon92 in #335
  • Pass ARTIFACTS env variable when building e2e tests by @Edwinhr716 in #345
  • Add adopters, integrations, talks by @kerthcet in #367
  • Deprecate flags as the recommended path is to pass configuration file by @ardaguclu in #361
  • doc: Add terraform based EKS Blueprints pattern for multi-node inference by @askulkarni2 in #373
  • Add missing external types to apply configurations by @astefanutti in #374
  • Ignore LeaderElectionNamespace in cmd unit tests as it is dynamic by @ardaguclu in #383
  • Allow pass through of additional params to "ray start" in ray_init.sh by @dims in #378
  • use staging images instead of gcr by @kannon92 in #398
  • Add files to publish helm chart by @Edwinhr716 in #411
  • fix image build toolchain errors by @kannon92 in #424
  • Run controller-gen only for api and pkg directories by @ardaguclu in #417
  • Add Label leaderworkerset.sigs.k8s.io/name to the headlessService by @yankay in #433
  • Use create jobs command instead of run pods in e2e testing by @ardaguclu in #445
  • Export kind and cluster related logs under artifacts directory for e2e by @ardaguclu in #451
  • Set readOnlyRootFilesystem explicitly to true by @ardaguclu in #457
  • Check controller_runtime_webhook_requests_total that always exists in both pods by @ardaguclu in #456
  • Explicitly bind metrics 8443 as containerPort in kustomize and helm by @ardaguclu in #458

📃 Docs:

  • Adding User Guide for TensorRT-LLM by @Edwinhr716 in #356
  • Adding User Guide for SGLang by @yankay in #377
  • [Docs] Use the vllm image instead of building by the user self in the sample by @yankay in #453
  • [Doc] Add troubleshooting page in site by @yankay in #468

New Contributors

Full Changelog: v0.5.0...v0.6.0

v0.5.1

03 Feb 16:54
v0.5.1
3ee2d9a
Compare
Choose a tag to compare

What's Changed

  • Automated cherry pick of #347: Removed update operation from pod webhook by @Edwinhr716 in #348
  • Automated cherry pick of #342: Added handling of nil revision in pod controller by @Edwinhr716 in #349
  • Automated cherry pick of #341: Removed restarting group log in pod controller by @Edwinhr716 in #350

Full Changelog: v0.5.0...v0.5.1

Release v0.5.0

08 Jan 22:36
v0.5.0
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.0...v0.5.0

0.4.2

09 Nov 03:30
v0.4.2
d620512
Compare
Choose a tag to compare

What's Changed

  • Automated cherry pick of #250: change to not use NetworkConfig in hash calculation if it has default value by @Edwinhr716 in #252

Full Changelog: v0.4.1...v0.4.2

LeaderWorkerSet Patch release 0.4.1

18 Oct 20:23
v0.4.1
Compare
Choose a tag to compare

This is a patch release for v0.4.0

Leaderworkerset v0.4.0

14 Sep 00:09
v0.4.0
edc9eac
Compare
Choose a tag to compare

Features:

  • Support specify NetworkConfig
  • Support leader elect for lws controller
  • Add group size as an environment variable
  • Add an example for using llama.cpp to deploy a distributed inference service
  • Update the GPU multi-node inference with vLLM example to serve Llama3.1-405b model

What's Changed

New Contributors

Full Changelog: v0.3.0...v0.4.0

Leaderworkerset v0.3.0

04 Jun 20:42
v0.3.0
f55ce01
Compare
Choose a tag to compare

Features:

  • RollingUpdate with MaxSurge support
  • Subgroup support for disaggregated serving
  • Example for multi-node serving of llama 70B on GPUs with vLLM
  • Add a new start policy API
  • Inject leader address environment variable to every container
  • Spec.rolloutStrategy should be a non-required field

Acknowledgments

Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Edwinhr716 @googs1025 @gujingit @jjk-g @kerthcet @liurupeng @nayihz

Leaderworkerset v0.2.0

19 Apr 18:47
78268be
Compare
Choose a tag to compare

Features:

  • Support RollingUpdate with MaxUnavailable
  • Allow Prometheus to gather metrics gathered by controller-runtime
  • Fix TPU env var assignment when leader pod doesn't request TPU
  • User guide to deploy multi-host inference with Saxml
  • Increase qps limit for pod scheduling
  • Setup E2E test and improve test coverage

Acknowledgments

Thanks to our contributors in this release, in alphabetic order:
@ahg-g @Bslabe123 @Edwinhr716 @googs1025 @kannon92 @kerthcet @liurupeng @nayihz @Zeel-Patel

Leaderworkerset v0.1.0

13 Mar 02:51
v0.1.0
652405d
Compare
Choose a tag to compare

Features:

  • Support creating groups of pods as a unit
  • Support dual-template, one for leader and one for the workers
  • Support autoscaling through HPA
  • Support topology-aware placement
  • Support all-or-nothing restart for failure handling

Acknowledgments

Thanks to our contributors in this release, in no particular order:
@liurupeng @Edwinhr716 @kerthcet @ahg-g