Skip to content

[Smartwitch][reboot-cause] Log error generated when DPU is powered off and the switch is rebooted #22072

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
gpunathilell opened this issue Mar 18, 2025 · 2 comments
Assignees
Labels
Bug 🐛 smartswitch Triaged this issue has been triaged

Comments

@gpunathilell
Copy link
Contributor

Issue Summary

When the DPU is powered off (Transition from online to offline for DPU. We create a file prev_reboot_time.txt in the history folder for each of the DPU modules: Example: /host/reboot-cause/module/dpu0/history/prev_reboot_time.txt
Due to this file, if we perform switch reboot the following log error is seen:
2025 Mar 18 20:50:26.274641 sonic ERR process-reboot-cause[16633]: Unable to process reload cause file /host/reboot-cause/module/dpu0/history/prev_reboot_time.txt: Extra data: line 1 column 5 (char 4) on the switch.

Method to Reproduce

Power off one of the DPUs during runtime : config chassis modules shutdown DPU0
Wait for DPU oper status to transition to offline : show chassis modules status DPU0 -> Check that Oper-status is Offline
Confirm that the file prev_reboot_time.txt exists:
cat /host/reboot-cause/module/dpu0/history/prev_reboot_time.txt
Reboot the switch: reboot
Wait for switch to finish rebooting.
Confirm that the following log error is seen:
2025 Mar 18 20:50:26.274641 sonic ERR process-reboot-cause[16633]: Unable to process reload cause file /host/reboot-cause/module/dpu0/history/prev_reboot_time.txt: Extra data: line 1 column 5 (char 4)

@vvolam
Copy link
Contributor

vvolam commented Apr 2, 2025

@vvolam vvolam added the Bug 🐛 label Apr 3, 2025
@gpunathilell
Copy link
Contributor Author

Fixed by sonic-net/sonic-host-services#231

mssonicbld added a commit that referenced this issue Apr 26, 2025
…D automatically (#22415)

#### Why I did it
src/sonic-platform-daemons
```
* a86dd89 - (HEAD -> master, origin/master, origin/HEAD) Store reboot_cause history files with json extension. Fix for #22072 (#603) (3 days ago) [rameshraghupathy]
* 6dfb552 - Refactor ledd daemon & fix high CPU usage due to unexpected socket close (#548) (3 days ago) [Prince George]
* d36e830 - Change PSU key to use get_name API instead of index (#446) (4 days ago) [Gregory Boudreau]
```
#### How I did it
#### How to verify it
#### Description for the changelog
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug 🐛 smartswitch Triaged this issue has been triaged
Projects
None yet
Development

No branches or pull requests

4 participants