Skip to content

[action] [PR:9075] [chassis] accomodate test_memory_exhaustion.py if random dut is chassis sup #9295

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Aug 4, 2023

Conversation

mssonicbld
Copy link
Collaborator

Description of PR

There are 2 fixes in this PR to enhance for chassis:

  1. Right now in processes_utils.py when check_critical_services, we rely on what is pre-defined in inventory file, otherwise it is 300 sec, but for SUP, we should give 900sec as most SUPs have 10+ dockers that need to come back up, it should differ from linecards or any single-dut. magic 900 sec comes from experientment, where I had run twice the test, 1st time need 800+sec, 2nd time needs ~702sec
  2. During memory exhaustion test, if the random selected dut is chassis SUP, we also need to check linecards bootup and critical services are up.

Summary:
Fixes # (issue)
#9087

Type of change

  • Bug fix
  • Testbed and Framework(new/improvement)
  • Test case(new/improvement)

Back port request

  • 201911
  • 202012
  • 202205

Approach

What is the motivation for this PR?

How did you do it?

How did you verify/test it?

Any platform specific information?

Supported testbed topology if it's a new test case?

Documentation

@mssonicbld
Copy link
Collaborator Author

Original PR: #9075

@mssonicbld mssonicbld merged commit cd3b341 into sonic-net:202205 Aug 4, 2023
@mlok-nokia
Copy link
Contributor

mlok-nokia commented Aug 7, 2023

@wenyiz2021 @arlakshm This PR cause an issue. wait_lc_healthy_if_sup() has been defined with 3 arguments. But the cause only given 3.

@@ -41,6 +49,7 @@ def tearDown(self, duthosts, enum_rand_one_per_hwsku_hostname, localhost, pdu_co
'Recover {} by PDU reboot failed'.format(hostname))
# Wait until all critical processes are healthy.
wait_critical_processes(duthost)
self.wait_lc_healthy_if_sup(duthost, duthosts)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wait_lc_healthy_if_sup() missing localhost as an argument

@@ -68,6 +77,7 @@ def test_memory_exhaustion(self, duthosts, enum_rand_one_per_hwsku_hostname, loc
'DUT {} did not startup'.format(hostname))
# Wait until all critical processes are healthy.
wait_critical_processes(duthost)
self.wait_lc_healthy_if_sup(duthost, duthosts)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wait_lc_healthy_if_sup missing localhost as an argument.

@wenyiz2021
Copy link
Contributor

@wenyiz2021 @arlakshm This PR cause an issue. wait_lc_healthy_if_sup() has been defined with 3 arguments. But the cause only given 3.

thanks @mlok-nokia for catching this! I raised a fix PR #9320

@mssonicbld mssonicbld deleted the cherry/202205/9075 branch February 4, 2024 08:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants