feat(injector): Set probe timeouts based on pod deployment spec #4149

davinci26 · 2021-09-21T18:35:13Z

Fixes #4137

Overall we aim to maintain the following invariance:

For all health probes we want the following to be true:

All Envoy (implicit or explicit) timeouts >= the timeout specified in
the pod deployment spec.

Changes:

Remove connect timeouts from all clusters
For health probe clusters we set the route timeout to be equal to the
timeout set on the kubernetes pod. This is only applied on HTTP routes
since plain tcp routes have infinite timeout.

Signed-off-by: Sotiris Nanopoulos [email protected]

Description:

Testing done:

Affected area:

Functional Area
New Functionality	[ ]
CI System	[ ]
CLI Tool	[ ]
Certificate Management	[ ]
Control Plane	[ ]
Demo	[ ]
Documentation	[ ]
Egress	[ ]
Ingress	[ ]
Install	[ ]
Networking	[ ]
Observability	[ ]
Performance	[ ]
SMI Policy	[ ]
Security	[ ]
Sidecar Injection	[ ]
Tests	[ ]
Upgrade	[ ]
Other	[ ]

Please answer the following questions with yes/no.

Does this change contain code from or inspired by another project?
- Did you notify the maintainers and provide attribution?
Is this a breaking change?

davinci26 · 2021-09-21T18:35:34Z

Creating as draft to make sure that lint passes, because it timeouts on my devbox

codecov-commenter · 2021-09-21T18:47:21Z

Codecov Report

Merging #4149 (1397512) into main (dcb2629) will increase coverage by 0.34%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #4149      +/-   ##
==========================================
+ Coverage   69.47%   69.81%   +0.34%     
==========================================
  Files         210      212       +2     
  Lines       11423    11579     +156     
==========================================
+ Hits         7936     8084     +148     
- Misses       3434     3442       +8     
  Partials       53       53

Flag	Coverage Δ
unittests	`69.81% <100.00%> (+0.34%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
pkg/envoy/cds/cluster.go	`93.20% <100.00%> (-0.20%)`	⬇️
pkg/envoy/cds/tracing.go	`100.00% <100.00%> (ø)`
pkg/injector/envoy_config_health_probes.go	`94.14% <100.00%> (+0.04%)`	⬆️
pkg/injector/health_probes.go	`100.00% <100.00%> (ø)`
pkg/reconciler/mutating_webhook_handler.go	`88.57% <0.00%> (-6.03%)`	⬇️
pkg/reconciler/crd_handler.go	`85.00% <0.00%> (-5.25%)`	⬇️
pkg/crdconversion/crdconversion.go	`72.44% <0.00%> (-3.07%)`	⬇️
cmd/cli/util.go	`71.42% <0.00%> (-1.24%)`	⬇️
pkg/validator/patch.go	`95.40% <0.00%> (-0.60%)`	⬇️
pkg/envoy/ads/stream.go	`10.55% <0.00%> (-0.17%)`	⬇️
... and 20 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dcb2629...1397512. Read the comment docs.

shashankram

@davinci26 Thanks for addressing the issue based on our discussion. This change looks good, except that the commit and PR description is misleading (references to idle_timeout). Could you please address this, looks good otherwise.

Fixes openservicemesh#4137 Overall we aim to maintain the following invariance: For all health probes we want the following to be true: All Envoy (implicit or explicit) timeouts >= the timeout specified in the pod deployment spec. Changes: * Remove connect timeouts from all clusters * For health probe clusters we set the route timeout to be equal to the timeout set on the kubernetes pod. This is only applied on HTTP routes since plain tcp routes have infinite timeout. Signed-off-by: Sotiris Nanopoulos <[email protected]>

davinci26 · 2021-09-27T19:09:47Z

@shashankram fixed. Thanks for the feedback on this PR offline, highly appreciate it!

…servicemesh#4149) Fixes openservicemesh#4137 Overall we aim to maintain the following invariance: For all health probes we want the following to be true: All Envoy (implicit or explicit) timeouts >= the timeout specified in the pod deployment spec. Changes: * Remove connect timeouts from all clusters * For health probe clusters we set the route timeout to be equal to the timeout set on the kubernetes pod. This is only applied on HTTP routes since plain tcp routes have infinite timeout. Signed-off-by: Sotiris Nanopoulos <[email protected]>

…servicemesh#4149) Fixes openservicemesh#4137 Overall we aim to maintain the following invariance: For all health probes we want the following to be true: All Envoy (implicit or explicit) timeouts >= the timeout specified in the pod deployment spec. Changes: * Remove connect timeouts from all clusters * For health probe clusters we set the route timeout to be equal to the timeout set on the kubernetes pod. This is only applied on HTTP routes since plain tcp routes have infinite timeout. Signed-off-by: Sotiris Nanopoulos <[email protected]> Signed-off-by: Sneha Chhabria <[email protected]>

…servicemesh#4149) Fixes openservicemesh#4137 Overall we aim to maintain the following invariance: For all health probes we want the following to be true: All Envoy (implicit or explicit) timeouts >= the timeout specified in the pod deployment spec. Changes: * Remove connect timeouts from all clusters * For health probe clusters we set the route timeout to be equal to the timeout set on the kubernetes pod. This is only applied on HTTP routes since plain tcp routes have infinite timeout. Signed-off-by: Sotiris Nanopoulos <[email protected]>

davinci26 force-pushed the probeTimeouts branch 3 times, most recently from 1397512 to 5376e77 Compare September 27, 2021 16:48

davinci26 marked this pull request as ready for review September 27, 2021 17:15

davinci26 requested a review from a team as a code owner September 27, 2021 17:15

shashankram reviewed Sep 27, 2021

View reviewed changes

davinci26 force-pushed the probeTimeouts branch from 5376e77 to 4406dc5 Compare September 27, 2021 19:09

shashankram approved these changes Sep 27, 2021

View reviewed changes

ksubrmnn approved these changes Sep 27, 2021

View reviewed changes

ksubrmnn merged commit 3e727ed into openservicemesh:main Sep 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(injector): Set probe timeouts based on pod deployment spec #4149

feat(injector): Set probe timeouts based on pod deployment spec #4149

Uh oh!

davinci26 commented Sep 21, 2021 •

edited

Loading

Uh oh!

davinci26 commented Sep 21, 2021 •

edited

Loading

Uh oh!

codecov-commenter commented Sep 21, 2021 •

edited

Loading

Uh oh!

shashankram left a comment

Uh oh!

davinci26 commented Sep 27, 2021

Uh oh!

Uh oh!

feat(injector): Set probe timeouts based on pod deployment spec #4149

feat(injector): Set probe timeouts based on pod deployment spec #4149

Uh oh!

Conversation

davinci26 commented Sep 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davinci26 commented Sep 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Sep 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

shashankram left a comment

Choose a reason for hiding this comment

Uh oh!

davinci26 commented Sep 27, 2021

Uh oh!

Uh oh!

davinci26 commented Sep 21, 2021 •

edited

Loading

davinci26 commented Sep 21, 2021 •

edited

Loading

codecov-commenter commented Sep 21, 2021 •

edited

Loading