Description
Spinning this out of #3104 since the existing machines are currently back online temporarily.
In February 2021 some of our systems were migrated to from Joyent's data centers to Equinix using an account managed by Joyent team members which was separate from our existing one at Equinix.
Recently it became apparent that some of those were hosted on the Equinix data centers which were due to be shut down at the end of November. After a call today between myself, @richardlau and @bahamat we now have a good understanding of where we area and how to move forward.
To summarise where we currently are: There are two systems hosted on the account managed by Joyent. Both are SmartOS hosts with virtual images inside them. One of these hosts is called nc-backup-01
and only contains one VM - the backup server which is SmartOS 15.4 and is in the DFW2 data center.
The second host is called nc-compute-01
and it contains all the other systems referenced in #3104 - some are KVM instances, some are SmartOS zones and one is an lx-branded zone. The details and breakdown are as follows:
[root@nc-compute-01 ~]# vmadm list
UUID TYPE RAM STATE ALIAS
0f85685d-0150-4f8f-e211-9ecee63e8b61 KVM 3840 running test-joyent-ubuntu1604_arm_cross-x64-1
1cf77dcc-8a17-6c35-9132-83f55a8e058f KVM 3840 running test-joyent-ubuntu1804-x64-1
49f0a164-4e86-4fda-de73-abcf257587a0 KVM 3840 running release-joyent-ubuntu1604_arm_cross-x64-1
356655a2-12e6-e1d7-ac7b-b5188ad37cb0 OS 4096 running test-joyent-smartos20-x64-3
49089cfe-915f-c226-c697-a9faca6041f2 OS 4096 running release-joyent-smartos20-x64-2
94f76b46-6d20-612c-84e1-92c0dc3bae69 OS 4096 running release-joyent-smartos18-x64-2
c6e3d47a-1421-ee11-c52d-c3c80c198e95 OS 4096 running test-joyent-smartos20-x64-4
d894f3c6-d09a-c9df-d7ae-b6f613d9b413 OS 4096 running test-joyent-smartos18-x64-3
db3664d7-dd31-c233-cafb-df79efb9d069 OS 4096 running test-joyent-smartos18-x64-4
d357fd3c-a929-cd9c-da35-ad53b53e2875 KVM 7936 running release-joyent-ubuntu1804_docker-x64-1
feb21098-8101-66f6-f410-bd092952f84e KVM 16128 running infra-joyent-debian10-x64-1
12fa9eea-ba7a-4d55-abd9-d32c64ae1965 LX 32768 running infra-joyent-ubuntu1604-x64-1-new
The infrastructure team's public ssh key has now been put onto both SmartOS hosts so that those team members can access the systems. @richardlau and @sxa have also been invited to co-administer the Equnix instance hosting these two in case any recovery of the hosts is required, and to hopefully set up to receive notifications.
We explored a few potential options:
- Provision a new machine for the SmartOS systems and migrate the others to our existing Equinix account
- Provision a new machine and reprovision all the servers from scratch
- provision a new machine and migrate the existing instances across
Given that option 3 was feasible and solved the immediate problem with Equinix attempting to shut down their data centeras we have chosen that one and we intend to start migrating the system tomorrow (evening UTC). @bahamat will handle provisioning the replacement SmartOS host on Equinix and migrating the images across. This will result in an outage on these systems while the migration takes place. The new server will need to be Intel rather than AMD to support SmartOS' KVM implementation.
We will also aim to rename these so they do not have joyent
in the name since they are now hosted at equinix (Likely using a new equinix_mnx
provider name to indicate that it's hosted separately from our other equinix systems)
FYI @nodejs/build @nodejs/build-infra @vielmetti