-
Notifications
You must be signed in to change notification settings - Fork 812
[syslog] test_syslog_rate_limit.py fail when test on syncd container #11181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi, I don't have system that runs on Cisco platform. There is no acms container running in Nvidia platform. And syncd container for different vendor might have different docker build file. Could you please check:
|
Hi @Junchao-Mellanox , I took one in 2700 platform.
|
Comparing to eventd, I can see containercfgd runnning in it
|
Thanks. But I don't see acms container on my side. Could you please point me to the docker folder in sonic-buildimage? I don't see it in https://github.com/sonic-net/sonic-buildimage/tree/master/dockers Maybe it is a private container on your side? |
Hi Junchao, I found that acms container was for internal use only. That's why you cannot see it.
Do you mean syncd was built differently in each vendor with the same source code? If so, maybe we should bypass syncd syslog rate-limit test. |
Thanks for the confirmation. For syncd, do you see issue on Nvidia/Mellanox platform? We don't find it in our local regression. To my understanding, each vendor should maintain the FEATURE table for their platforms. In case a FEATURE does not support syslog rate limit, they should set FEATURE.support_syslog_rate_limit to false. For example, cisco does not support syslog rate limit for syncd, they should have following in FEATURE table: {
"FEATULRE": {
"syncd": {
"support_syslog_rate_limit": "false"
}
}
} The test case will ignore such service. |
I don't see issue on Nvdia/Mellanox platform. Thanks. I will close this issue and add check for other platform. |
Hi @Junchao-Mellanox , found one issue in mellanox and also other platform. |
What is the output of |
I saw the issue being widely happen in 20230531.14 which doesn't have your sonic-mgmt PR included. |
Thanks. There was a recent change related to syslog rate limit. The feature is disabled by default in that change. So, we need explicitly enable it in sonic-mgmt before doing the test. |
Close it after nightly testcase pass. |
Description
The test will fail when randomly pick acms or syncd container to test rate limiter. Related code: https://github.com/sonic-net/sonic-buildimage/blob/master/src/sonic-containercfgd/containercfgd/containercfgd.py#L158
The root cause is that there is no containercfgd to restart when config syslog rate-limit-container on these two container.
Steps to reproduce the issue:
acms
orsyncd
From syslog, it does send the command to udpate the rate-limits. But there is no rsyslog restart.
Describe the results you received:
acms or syncd container didn't restart thus test fail. Because it wait forever for pid referesh.
Describe the results you expected:
Need mellanox team to confirm if the behavior is expected.
acms
orsyncd
is expected to not start after config rate limit, we should improve the test or bypassacms
andsyncd
rate-limit test.Below is the pass case of
teamd
. The container restart and test pass.From syslog, it does config the rate limit and the container restarts
Additional information you deem important:
The text was updated successfully, but these errors were encountered: