-
-
Notifications
You must be signed in to change notification settings - Fork 15.4k
builds started failing on Hydra's new hash-named x86 machines #64126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Eh, another problem, likely related and much worse – those machines quite often end an x86_64-linux build with:
Again, I could never reproduce these. /cc @NixOS/rfc-steering-committee I don't really have an idea about whom to ping, but some of them certainly should know about those new Hydra machines. |
Apparently this currently blocks larger rebuilds, even with multiple restart attempts, e.g. see this build. /cc @FRidh who deals with staging-next a lot (to know I've posted a thread for this). |
@grahamc have you seen this? |
No, I haven't seen this. |
These hash-named x86 machines have Intel Scalable Gold 5120 cpus, and are transient -- so it is a bit lucky that this exact build's machine still exists. However, unlucky because I can't log in to it:
Evidently something very strange happened to it. I've since destroyed that server. I picked up another one of the machines (b5b77143) which is alive, and boy did something stick out to me! Look at this selection from
|
How much would you bet These machines are spot and transient, so they never fully "install". In the x86 case I accidentally left the |
@grahamc: thanks for the quick reaction. I should've tried to mention you directly; now I'll know who knows best about these builders, too. The aarch64-linux ones also suffer from this, apparently: this build (step 6). |
Yes, indeed. I have terminated those now as well. Same problem with unionfs. I'm traveling this week, which makes it a bit trickier to fix and re-launch these instances, but I'll give it a go! Thank you for the heads up. |
I have updated the filesystem layout, and |
I think the issue has been resolved so closing. |
A couple days ago, i686 nixos tests started failing consistently, e.g. https://hydra.nixos.org/build/95616203 I can't reproduce the problem locally, and apparently there's something different about those hash-named build machines (which might have been added or at least changed around that time, too).
i686 tests aren't too important nowadays, I suppose, but we could at least do something simple, e.g. remove the i686 platform tag from these machines.
The text was updated successfully, but these errors were encountered: