Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[ycabled] add notification for gRPC connection state transitions to IDLE/TRANSIENT_FAILURE #295

Merged
merged 3 commits into from
Sep 21, 2022

Conversation

vdahiya12
Copy link
Contributor

@vdahiya12 vdahiya12 commented Sep 20, 2022

Signed-off-by: vaibhav-dahiya vdahiya@microsoft.com
For the cases where gRPC connectivity to server becomes IDLE/TRANSIENT_FAILURE, the ToR should periodically query/retry establish admin state and revert the Forwarding state back to active-active for both the T0's
This PR tries to attain that via adding a transient failure message to APP DB when connectivity is lost, so each time connectivity state changes back to not desired state(IDLE/TRANSIENT_FAILURE) we log a message, and linkmgr will query again to get gRPC in sync

Description

Motivation and Context

How Has This Been Tested?

Unit-tests and deploying changes to testbed

Additional Information (Optional)

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
@vdahiya12 vdahiya12 requested a review from zjswhhh September 20, 2022 18:20
zjswhhh
zjswhhh previously approved these changes Sep 20, 2022
Copy link

@zjswhhh zjswhhh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@vdahiya12 vdahiya12 changed the title [ycabled] add notification if gRPC connection goes IDLE/TRANSIENT_FAILURE [ycabled] add notification for gRPC connection state transitions to IDLE/TRANSIENT_FAILURE Sep 20, 2022
@lgtm-com
Copy link

lgtm-com bot commented Sep 20, 2022

This pull request introduces 2 alerts when merging 4f00c33 into 8ff5f37 - view on LGTM.com

new alerts:

  • 2 for Variable defined multiple times

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
@vdahiya12 vdahiya12 merged commit 6522c46 into sonic-net:master Sep 21, 2022
yxieca pushed a commit that referenced this pull request Sep 21, 2022
…DLE/TRANSIENT_FAILURE (#295)

Signed-off-by: vaibhav-dahiya vdahiya@microsoft.com
For the cases where gRPC connectivity to server becomes IDLE/TRANSIENT_FAILURE, the ToR should periodically query/retry establish admin state and revert the Forwarding state back to active-active for both the T0's
This PR tries to attain that via adding a transient failure message to APP DB when connectivity is lost, so each time connectivity state changes back to not desired state(IDLE/TRANSIENT_FAILURE) we log a message, and linkmgr will query again to get gRPC in sync

Description
Motivation and Context
How Has This Been Tested?
Unit-tests and deploying changes to testbed

Additional Information (Optional)
# for free to join this conversation on GitHub. Already have an account? # to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants