Skip to content

Add new T1 Mellanox-SN4280-C8C40 SKU #21967

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 346 commits into from

Conversation

sschlafman
Copy link
Contributor

Why I did it

Add new T1 Mellanox-SN4280-C8C40 SKU
8x100G , 40x100G

Work item tracking
  • Microsoft ADO (number only):

How I did it

How to verify it

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

zhixzhu and others added 30 commits January 8, 2025 10:31
Exclude pie ports from buffer and qos config

Signed-off-by: Zhixin Zhu <[email protected]>
Disable vstest job because sonictest agent pool is failing.
To unblock PRs.
…onic-net#21299)

In SAI 11.x the SAI integrity counters is not supported completely. So, to detect packet drops due to packet corruptions or credit watchdog timeouts. We enable these interrupts and set the verbose level to error. This will generate syslogs which can be used to detect.

* add sai_postinit_cmd.soc with interupt IDs for alerting

* update config.bcm with the path to sai_postinit_cmd.soc

* update config_bcm for j2 card
---------

Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <[email protected]>
…sonic-net#21245)

Issue to be fix: Currently operational status of mgmt interface is not present or correct for multi-asic devices.
Root cause: Operational status of mgmt interface is updated by portsyncd in swss docker. In case of multi-asic platform, swss service is started only in asic namespace context. Since portsyncd is running in a specific network namespace context, it is not aware of mgmt interface present in the host namespace of multi-asic platform. Therefore there is no way for portsyncd to find the operational status of mgmt interface and update in STATE_DB MGMT_PORT_TABLE.
Use case: SNMP interface MIB reads MGMT_PORT_TABLE in STATE_DB to retrieve oper status of mgmt interface periodically. In case of multi-asic platform, currently this is returning the oper status of 'eth0' interface which is the virtual interface that is present inside asic namespace which gets created as a part of database docker and is not the actual management interface.

---------

Signed-off-by: Suvarna Meenakshi <[email protected]>
…utomatically (sonic-net#21369)

#### Why I did it
src/sonic-host-services
```
* 744c673 - (HEAD -> master, origin/master, origin/HEAD) Fix no info log in syslog for caclmgrd (sonic-net#200) (10 minutes ago) [Zhaohui Sun]
```
#### How I did it
#### How to verify it
#### Description for the changelog
…omatically (sonic-net#21359)

#### Why I did it
src/sonic-swss-common
```
* 12c428e - (HEAD -> master, origin/master, origin/HEAD) [schema] add SRv6 config db tables (sonic-net#962) (21 hours ago) [Yakiv Huryk]
```
#### How I did it
#### How to verify it
#### Description for the changelog
…tically (sonic-net#21376)

#### Why I did it
src/sonic-sairedis
```
* 9697baf0 - (HEAD -> master, origin/master, origin/HEAD) Fix to not miss the entire set of counters to be added in addObject for CounterContext::updateSupportedCount (sonic-net#1493) (9 hours ago) [judyjoseph]
```
#### How I did it
#### How to verify it
#### Description for the changelog
…lly (sonic-net#21002)

[submodule] Update submodule sonic-swss to the latest HEAD
Change DBConnector to connect redis with unix socket.

Why I did it
Improve SONiC Redis security by switch to unix socket connection.

How I did it
Change DBConnector parameter to connect redis with unix socket:
DBConnector(const std::string &dbName, unsigned int timeout_ms, bool isTcpConn = false);

How to verify it
Pass all UT.

Description for the changelog
Change DBConnector to connect redis with unix socket.
Add disk full event to sonic-events-host yang model.

Why I did it
This PR need publish disk full event:
sonic-net/sonic-utilities#3700

Work item tracking
Microsoft ADO: 30608100
How I did it
Add disk full event to yang model.

How to verify it
Pass all UT.

Description for the changelog
Add disk full event to sonic-events-host yang model.
…lly (sonic-net#21382)

#### Why I did it
src/sonic-swss
```
* 54a499bf - (HEAD -> master, origin/master, origin/HEAD) Fix VRF update handling for loopback interfaces in IntfsOrch (sonic-net#3461) (6 hours ago) [Sai Rama Mohan Reddy S]
* 6f30dddb - Remove update of mgmt oper status in swss (sonic-net#3439) (24 hours ago) [SuvarnaMeenakshi]
```
#### How I did it
#### How to verify it
#### Description for the changelog
…21362)

Why I did it
Arista-7060X6-64PE-C256S2 is missing flex counters configs.

Work item tracking
Microsoft ADO (number only):
How I did it
Update flex counters config for Arista-7060X6-64PE-C256S2

How to verify it
Test it locally on Arista-7060X6-64PE-C256S2.
…lly (sonic-net#21387)

#### Why I did it
src/sonic-swss
```
* 4ff8153a - (HEAD -> master, origin/master, origin/HEAD) Add a delay between killing teamd processes (sonic-net#3325) (7 hours ago) [Saikrishna Arcot]
```
#### How I did it
#### How to verify it
#### Description for the changelog
Why I did it
There is a motivation to add capabilities in SONiC that allows static configuration of SRv6 network.

Work item tracking
Microsoft ADO (number only): 30251795

How I did it
I added a SRv6 manager in Bgpcfgd that subscribes to SRV6_MY_LOCATORS and SRV6_MY_SIDS in CONFIG_DB and programs the changes to FRR's configuration.

Note: this change depends on the availability and implementation details of the following FRR patch FRR SRv6 Static SID CLI

How to verify it
- Run unit tests
- Build an image that contains this change and the relevant FRR CLI support.
- Test the Image on a virtual device or physical device
Adding FRR CLI to support SRv6 static. The HLD for the feature is available at sonic-net/SONiC#1860

Signed-off-by: Carmine Scarpitta <[email protected]>
Why I did it
To support the addition of two new tables in CONFIG_DB, i.e. SRV6_MY_SIDS and SRV6_MY_LOCATORS, in order to allow configuration for SRv6 in SONiC.

Work item tracking
Microsoft ADO (number only): 30513277

How I did it
I define the YANG model based on SRv6 HLD.

How to verify it
Run the unit tests and build image.
Why I did it
if critical process crashes or killed, bmp docker container will not be auto-restarted.

How I did it
/usr/bin/supervisor-proc-exit-listener takes in charge of critical process monitor and event publish, thus it should be autorestar-ted in any case, otherwise there might be issue if supervisor-proc-exit-listener crashes, or in some test cases like
"docker exec bmp kill -SIGKILL -1" critical processes may not work correctly in some race condition (depends on whether supervisor-proc-exit-listener is the last one to be killed)

When a container receives the SIGKILL signal to terminate its processes, the order in which the processes are actually terminated can depend on the scheduling and resource availability within the container.

If supervisor-proc-exit-listener is killed first before critical process, container auto restart will not be launched as expected.
…t#21366)

Why I did it
Use debian mirror snapshot instead of debian version pinning.
Because debian version pinning can't handle package uninstallation scenario.
…atically (sonic-net#21420)

#### Why I did it
src/sonic-snmpagent
```
* 9e2c50a - (HEAD -> master, origin/master, origin/HEAD) Fix snmp agent not-responding issue when high CPU utilization (sonic-net#345) (2 hours ago) [Jianquan Ye]
```
#### How I did it
#### How to verify it
#### Description for the changelog
…tomatically (sonic-net#21416)

#### Why I did it
src/sonic-linux-kernel
```
* 416e7a4 - (HEAD -> master, origin/master, origin/HEAD) Fix optoe's write_max when using native i2c driver (sonic-net#407) (6 hours ago) [Prince George]
```
#### How I did it
#### How to verify it
#### Description for the changelog
…lly (sonic-net#21422)

#### Why I did it
src/sonic-swss
```
* 4eb74f00 - (HEAD -> master, origin/master, origin/HEAD) [orchagent] Fix: ERR swss#orchagent: :- setPortPvid: pvid setting for tunnel Port_EVPN_XXX is not allowed (sonic-net#3402) (9 hours ago) [Brad House]
```
#### How I did it
#### How to verify it
#### Description for the changelog
sonic-net#21355)

Why I did it
It's one part of the fixes of sonic-net#21314
SNMP walker request will always timeout when 100% CPU utilization.

Work item tracking
Microsoft ADO 30112399:

How I did it
Enable SNMP dynamic frequency on packet chassis.

How to verify it
snmp/test_snmp_cpu.py(https://github.com/sonic-net/sonic-mgmt/blob/master/tests/snmp/test_snmp_cpu.py) tests the scenario.
Why I did it
After docker-syncd-brcm-dnx-rpc is moved to bookworm in master, the libthrift*.so is not installed inside the syncd docker and the syncd process fails to come up.

Work item tracking
Microsoft ADO (number only):
How I did it
Installed libthrift-0.17.0

How to verify it
Verified that the syncd dockers and swss dockers stay up and able to run Qos tests
Why I did it
Improve the t1 config to align with YANG validation

How I did it
Add missing leafref and mandatory field to the config

How to verify it
YANG validation check on generated config
libthrift did not get installed in the Broadcom syncd RPC container. However, syncd-rpc requires it.
…utomatically (sonic-net#21437)

#### Why I did it
src/sonic-host-services
```
* 0430ada - (HEAD -> master, origin/master, origin/HEAD) Add implementation for DockerService.List (sonic-net#199) (16 hours ago) [Dawei Huang]
```
#### How I did it
#### How to verify it
#### Description for the changelog
Why I did it
Add an additional platform to the SONiC support list

Work item tracking
Microsoft ADO (number only):
How I did it
Added necessary platform configurations and identification logic.
Some iterations are still necessary on those.

How to verify it
An image containing this PR and the necessary driver changes should end up with links up.

Which release branch to backport (provide reason below if selected)
 202411
 msft-202412
Description for the changelog
Add initial support for Moby platform
Why I did it
Fix front panel LEDs for Quicksilver
Fix fan LEDs for Quicksilver
Add Moby platform
Work item tracking
Microsoft ADO (number only):
How I did it
Updated Arista platform submodules
- Why I did it
Update SAI Version SAIBuild245.3..13

- How I did it
Upload SAI artifact and update mlnx-sai.mk file

- How to verify it
Run sonic-mgmt tests
LARLSN and others added 8 commits March 13, 2025 15:16
feat(frrcfgd): update for bgp locator config
Use software_bfd instead of switch_type
Why I did it
Before pushing the newly built docker image to the registry, we add a verification step to ensure the image is valid.

Work item tracking
Microsoft ADO (number only):

How I did it
Before pushing the newly built Docker image to the registry, we add a verification step to ensure the image is valid.

How to verify it
For ref https://dev.azure.com/mssonic/build/_build/results?buildId=794495&view=results
If the validate step fails, it won't push the image to registry.
Why I did it
The mount for smartswitch is not working on pmon container due to additional space in the bash script generated(pmon.sh) This is fixed

How I did it
Remove the space from .j2 file which generates pmon.sh script

How to verify it
Build image and Test it again
#### Why I did it  
Currently, the log displayed during curl failure does not have enough information:  
`Try 5: /usr/bin/curl --retry-connrefused --retry 5 failed to get: -L`  
The array expansion is not done, and only the first parameter is printed (since the `log_err` function only prints `$1`).  
After the change:  
`Try 1: /usr/bin/curl --retry-connrefused --retry 5 failed to get: -L -f -o target/debs/bookworm/libgrpc-dev-dbgsym_1.39.0-1_arm64.deb https://github.com/user/a/b/c/d/test1.deb. Retry..`  
This would help in debugging.  

#### Description for the changelog  
Write a short (one-line) summary that describes the changes in this pull request for inclusion in the changelog:  

#### Link to config_db schema for YANG module changes  
Provide a link to `config_db` schema for the table for which the YANG model is defined.  
Link should point to the correct section on [Configuration.md](https://github.com/Azure/sonic-buildimage/blob/master/src/sonic-yang-models/doc/Configuration.md).
…utomatically (sonic-net#22041)

#### Why I did it
src/sonic-host-services
```
* 085db0f - (HEAD -> master, origin/master, origin/HEAD) Periodically check the status of the pmon container during HALT and return early if it has stopped (sonic-net#229) (3 days ago) [Vasundhara Volam]
```
#### How I did it
#### How to verify it
#### Description for the changelog
[installer] Add CSTATE configuration for the AMD CPU.
@mssonicbld
Copy link
Collaborator

/azp run Azure.sonic-buildimage

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

vvolam and others added 2 commits March 18, 2025 11:19
… to the latest HEAD (sonic-net#21941)

Update to Translib Bulk API made sonic-gnmi and sonic-mgmt-common on each other for compilation.
@mssonicbld
Copy link
Collaborator

/azp run Azure.sonic-buildimage

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

opcoder0 and others added 5 commits March 19, 2025 14:24
…c-net#22064)

Why I did it
Create a structured issue template for bug, enhancement, regression, and testgap.

How I did it
Create new issue templates

How to verify it
Similar to sonic-mgmt repository
…3B platform) (sonic-net#21710)

Why I did it
Nokia adds X3B platform support:
Platform: x86_64-nokia_ixr7250_x3b-r0 Hwsku: Nokia-IXR7250-X3B

How I did it
Add new platform Nokia-IXR7250-X3B. platform string: x86_64-nokia_ixr7250_x3b-r0. Hwsku: Nokia-IXR7250-X3B.
Nokia-IXR7250-X3B support VoQ T2 feature. Follow the existing multi-asic chassis design, database-chassis is required to be created and support on the PizzaBox. It means that chassisdb.conf will be defined and used in the platform. To support the database-chassis, docker0 (240.127.1.1) will be used for the redis-chassis.server.
Key file platform_env.conf will be defined without "supervisor=1" definition to differentiate the x3b and Chassis Supervisor.
This Platform support requires PR sonic-net#21709
This PR requires PR sonic-net#21737 to pass the image build

How to verify it
Install and run the image on X3B platform and verify all features

Signed-off-by: mlok <[email protected]>
[YANG] sonic-neigh.yang need support portchannel
@mssonicbld
Copy link
Collaborator

/azp run Azure.sonic-buildimage

Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@sschlafman sschlafman changed the base branch from master to 202411 March 24, 2025 20:10
@sschlafman
Copy link
Contributor Author

Closing this PR.
New PR : #22136

@sschlafman sschlafman closed this Mar 25, 2025
@sschlafman sschlafman deleted the Mellanox-SN4280-C8C40 branch April 23, 2025 18:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.