Skip to content

[voq][orchagent][fabric]"Failed to get port object for portid" error during line card boot up #9033

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
vganesan-nokia opened this issue Oct 21, 2021 · 5 comments
Assignees
Labels
Chassis 🤖 Modular chassis support NOKIA Triaged this issue has been triaged

Comments

@vganesan-nokia
Copy link
Contributor

Description

In VOQ chassis, in the linecard syslog, series of following error logs are seen for all asic instances. These errors are seen during the linecard boot up

ERR swss0#orchagent: :- doTask: Failed to get port object for port id 0x10000000000a9
ERR swss0#orchagent: :- doTask: Failed to get port object for port id 0x10000000000ac
ERR swss0#orchagent: :- doTask: Failed to get port object for port id 0x1000000000130
.
.
.
ERR swss1#orchagent: :- doTask: Failed to get port object for port id 0x101000000000118
ERR swss1#orchagent: :- doTask: Failed to get port object for port id 0x101000000000105
ERR swss1#orchagent: :- doTask: Failed to get port object for port id 0x101000000000115
.
.
.

Steps to reproduce the issue:

  1. Have a VOQ chassis with a supervisor card and a linecard (single asic or multiple asic)
  2. Reboot chassis with appropriate minimum configuration like switch_type = "voq"
  3. Check the syslog. For all asics instances of the linecard the above error will be seen dring initialization time.

Describe the results you received:

As given in the description section above

Describe the results you expected:

The reported errors should not be seen.

Output of show version:

N/A

Output of show techsupport:

N/A

Additional information you deem important (e.g. issue happens only occasionally):

  • The issue happens all the time.
  • These errors occur when port state change notification (form down to up) come for fabric ports from syncd. Since orchagent in linecard only knows about the front panel ports and recycle port (as supplied in port_config.ini and confirmed by configuration), orchagent does not have corresponding port oids for the received fabric ports. Hence these errors are displayed. We need to either mask these error messages coming for the fabric ports or linecard orchagent should retrieve fabric port oids from SAI and match them with port oids given via port state notifications to avoid these error messages.
@vganesan-nokia vganesan-nokia changed the title [voq][orchagent]"Failed to get port object for portid" error during line card boot up [voq][orchagent][fabric]"Failed to get port object for portid" error during line card boot up Oct 21, 2021
@zhangyanzhao zhangyanzhao added the Triaged this issue has been triaged label Dec 8, 2021
@rlhui rlhui added Chassis 🤖 Modular chassis support NOKIA labels May 24, 2022
@rlhui
Copy link
Contributor

rlhui commented May 25, 2022

@sanmalho-git will check if Nokia could help, thanks.

@skeesara-nokia
Copy link

The VID (Virtual OID) encoding the ports only includes the object type (port) and does not indicate what kind of port (Fabric port or a regular ethernet port for example) it is. Also the VID structure is private to syncd where it is allocated.

It is possible to do a VID to RID translation and the RID (Real OID) may have a the port-type encoded in it. But the structure of the RID on the SAI implementation and cannot be relied upon in common code.

At this point the options are -

  1. Implement Fabric port configuration so that the SWSS can find the object when the notifcation is received from syncd for Fabric port state transistions OR
  2. Make changes to syncd to encode the PORT_TYPE in the VID structure OR
  3. Make changes to the sonic-mgmt tests to ignore these log messages on the LC when executing tests that cause fabric links ot bounce.

@abdosi
Copy link
Contributor

abdosi commented Aug 17, 2022

@mlorrillere can you confirm if your PR: sonic-net/sonic-swss#1944 will fix this

@abdosi
Copy link
Contributor

abdosi commented Aug 17, 2022

Sandeep from Nokia please add test pr link here that ignores this error.

@sanmalho-git
Copy link

Test PR to ignore these messages at in reload/reboot scenarios of supervisor is sonic-net/sonic-mgmt#6056

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Chassis 🤖 Modular chassis support NOKIA Triaged this issue has been triaged
Projects
None yet
Development

No branches or pull requests

7 participants