Skip to content

[Mellanox] Update SAI to v2205.22.1.19 and SDK/FW to v4.5.3168/v2010.3170 #12205

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

volodymyrsamotiy
Copy link
Collaborator

@volodymyrsamotiy volodymyrsamotiy commented Sep 28, 2022

Signed-off-by: Volodymyr Samotiy [email protected]

Why I did it

To include latest fixes and new functionality

SAI fixes and new features

  • fix #3205239, incorrect object type returned for SG child list
  • Fix VRF-VNI map entries remove issue
  • ECC health event and logging
  • [Port Buffers] restore default queue and pg configuration when all user pools are deleted
  • Fix EVPN type3 error on removal of uc/bc flood group
  • Fix EVPN type2 MAC move from local to remote results in SAI failure
  • Fix Disable learning on VXLAN tunnel
  • Fix error on VXLAN v6 tunnel removal
  • Fix port cannot apply schedule group when it is a lag member
  • Fix BFD add more detailed message on BFD packet not related to any existing session
  • gcc10 compilation fixes
  • Disable learning on VXLAN tunnel
  • Support BFD remote-disc exchange in negotiation stage
  • Tunnel Loopback packet action attribute implementation (for Dual TOR)
  • Add KVD resources MIN/MAX functionality (pending CRM issue with MIN only)
  • Support for CRC2 hash algorithm
  • Bulk counter support for PGs, queues
  • Support mirror sample rate attribute (SPC2+)
  • [Functional] [QoS] | Unable to remove SCHEDULE profile table even if there is no object referencing it
  • Next hop group optimized bulk API
  • Reduce verbosity of shared database already exists print
  • Span mirror policer (SPC2+), optimize pipeline for acl mirror action with policer on SPC2+
  • use same size descriptor pool for rx/tx
  • fix bfd - notify Sonic for admin-down event
  • 2201 - empty list for supported fec for RJ45 ports
  • Fix don't disable used tunnel underlay interfaces

SDK fixes

  • 100GbE FCI DAC (10137628-4050LF/HPE PN: 845408-B21) was recognized by mistake as supporting "cable burning' which caused the switch firmware to read page 0x9f (which unsupported in the cable) and to report this cable as having "bad eeprom".
  • Added remote peer UDP port information in BFD packet event.
  • After editing an ECMP, the resilient ECMP next-hop counter may not count correctly.
  • Fixed potential memory leaks in some APIs related to LPM
  • If TTL_CMD_COPY is used in Encap direction for a packet with no TTL, then the value passed in the ttl data structure will be used if non-zero (default 255 if zero).
  • In SN2201: When configuring Force mode, user should configure Speed and FEC on both sides
  • In Flex Tunnel encapsulation flow, if the encapsulation is with an IPv6 header, the flow label field may not be updated as expected.
  • In some cases, when changing speed to 400GbE over 8 lanes, the first few packets would be dropped.
  • In some traffic patterns involving small packets, the PortRcvErrors counter may mistakenly count events of local physical errors due to an internal flow in the hardware that involves link packets.
  • On Spectrum systems, sometimes during link failure, not all previous firmware indications cleared properly, potentially affecting the next link up attempt.
  • On the NVIDIA Spectrum-2 switch, when receiving a packet with Symbol Errors on ports that are configured to cut-thought mode, a pipeline might get stuck.
  • PCI calibration changes from a static to a dynamic mechanism.
  • SDK debug dump shows "Unknown" Counter in RFC3635 Counter Group.
  • SDK debug dump shows "Unknown" Counter in the PPCNT Traffic Class Counter Group.
  • SDK Dump missing column headers in some GC tables may result in difficulty understanding the dump.
  • SLL configuration is missing in SDK dump.
  • Spectrum-2 systems, do no support 1GbE on supported 40GbE modules.
  • When binding a UDP port which is already in use for BFD TX session, the error message appears incorrectly.
  • When Flex Tunnel was used, Flex Modifier sometimes experienced a brief mis-configuration during ISSU.
  • When many ports are active (e.g. 70 ports up), and the configuration of shared buffer is applied on the fly, occasionally, the firmware might get stuck.
  • When running 1GbE speeds on SN4600 system, the port remained active while peer side was closed.
  • When toggling many ports of the Spectrum devices while raising 10GbE link up and link maintenance is enabled, the switch may get stuck and may need to be rebooted.
  • When trying to reconfigure the Flex Parser header and Flex transition parameters after ISSU, the switch will returned an error even if the configuration was identical to that done before performing the ISSU.
  • While toggling the cable, and the low power mode is set to ON, an unexpected PMPE event error is received.

How I did it

Updated SDK/SAI submodule and relevant makefiles with the required versions.

How to verify it

Build an image and run tests from "sonic-mgmt".

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106

Description for the changelog

[Mellanox] Update SAI to v2205.22.1.6 and SDK/FW to v4.5.3166/v2010.3166

A picture of a cute animal (not mandatory but encouraged)

@liat-grozovik liat-grozovik merged commit 92bd6da into sonic-net:master Sep 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants