Skip to content

[ycabled] add notification for gRPC connection state transitions to IDLE/TRANSIENT_FAILURE #295

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Sep 21, 2022

Conversation

vdahiya12
Copy link
Contributor

@vdahiya12 vdahiya12 commented Sep 20, 2022

Signed-off-by: vaibhav-dahiya [email protected]
For the cases where gRPC connectivity to server becomes IDLE/TRANSIENT_FAILURE, the ToR should periodically query/retry establish admin state and revert the Forwarding state back to active-active for both the T0's
This PR tries to attain that via adding a transient failure message to APP DB when connectivity is lost, so each time connectivity state changes back to not desired state(IDLE/TRANSIENT_FAILURE) we log a message, and linkmgr will query again to get gRPC in sync

Description

Motivation and Context

How Has This Been Tested?

Unit-tests and deploying changes to testbed

Additional Information (Optional)

@vdahiya12 vdahiya12 requested a review from zjswhhh September 20, 2022 18:20
zjswhhh
zjswhhh previously approved these changes Sep 20, 2022
Copy link

@zjswhhh zjswhhh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@vdahiya12 vdahiya12 changed the title [ycabled] add notification if gRPC connection goes IDLE/TRANSIENT_FAILURE [ycabled] add notification for gRPC connection state transitions to IDLE/TRANSIENT_FAILURE Sep 20, 2022
@lgtm-com
Copy link

lgtm-com bot commented Sep 20, 2022

This pull request introduces 2 alerts when merging 4f00c33 into 8ff5f37 - view on LGTM.com

new alerts:

  • 2 for Variable defined multiple times

Signed-off-by: vaibhav-dahiya <[email protected]>
Signed-off-by: vaibhav-dahiya <[email protected]>
@vdahiya12 vdahiya12 merged commit 6522c46 into sonic-net:master Sep 21, 2022
yxieca pushed a commit that referenced this pull request Sep 21, 2022
…DLE/TRANSIENT_FAILURE (#295)

Signed-off-by: vaibhav-dahiya [email protected]
For the cases where gRPC connectivity to server becomes IDLE/TRANSIENT_FAILURE, the ToR should periodically query/retry establish admin state and revert the Forwarding state back to active-active for both the T0's
This PR tries to attain that via adding a transient failure message to APP DB when connectivity is lost, so each time connectivity state changes back to not desired state(IDLE/TRANSIENT_FAILURE) we log a message, and linkmgr will query again to get gRPC in sync

Description
Motivation and Context
How Has This Been Tested?
Unit-tests and deploying changes to testbed

Additional Information (Optional)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants