Skip to content

Fedora Rawhide Storage test failures #21947

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
jelly opened this issue Apr 24, 2025 · 4 comments
Open

Fedora Rawhide Storage test failures #21947

jelly opened this issue Apr 24, 2025 · 4 comments
Labels

Comments

@jelly
Copy link
Member

jelly commented Apr 24, 2025

Explain what happens

TestStorageRaid1.testMetadataAtEnd fails on rawhide:

[    3.630490] EXT4-fs (vda3): mounted filesystem 0fdd1a25-7a62-4739-a0b4-dfbe221b2cd3 r/w with ordered data mode. Quota mode: none.
[   13.720305] loop0: detected capacity change from 0 to 97656
[   14.004566] loop1: detected capacity change from 0 to 97656
[   17.234154] md/raid1:md127: not clean -- starting background reconstruction
[   17.234511] md/raid1:md127: active with 2 out of 2 mirrors
[   17.234792] md127: detected capacity change from 0 to 97536
[   17.235389] md: resync of RAID array md127
[   17.366810] md: md127: resync done.
[   18.152986] md127: detected capacity change from 97536 to 0
[   18.153270] md: md127 stopped.
[   18.168695] GPT:Primary header thinks Alt. header is not at the end of the disk.
[   18.169399] GPT:97535 != 97655
[   18.169684] GPT:Alternate GPT header not at the end of the disk.
[   18.171088] GPT:97535 != 97655
[   18.171418] GPT: Use GNU Parted to correct GPT errors.
[   18.171886]  loop1:
[   18.176426] GPT:Primary header thinks Alt. header is not at the end of the disk.
[   18.176709] GPT:97535 != 97655
[   18.176816] GPT:Alternate GPT header not at the end of the disk.
[   18.179000] GPT:97535 != 97655
[   18.179126] GPT: Use GNU Parted to correct GPT errors.
[   18.179311]  loop0:

This is reproduced when upgrading to 6.15rc3 by sitting in the test:

 # Delete the mdraid device.  Both disks should go back to "Unformatted data"
 testlib.sit()

Then via ssh stopping the mdraid array and calling systemctl daemon-reload which then hangs the whole virtual machine. An different existing ssh session ps hangs, stracing gives:

openat(AT_FDCWD, "/proc/1016/cmdline", O_RDONLY) = 4
read(4, "/sbin/agetty\0-o\0-- \\u\0--noreset\0"..., 131072) = 86
read(4, "", 130986)                     = 0
close(4)                                = 0
openat(AT_FDCWD, "/proc/1016/ctty", O_RDONLY) = -1 ENOENT (No such file or directory)
newfstatat(AT_FDCWD, "/dev/ttyS64",
@jelly jelly added the bug label Apr 24, 2025
@jelly
Copy link
Member Author

jelly commented Apr 24, 2025

Cannot reproduce it with:

sudo dnf update --exclude=kernel*

So this is a kernel regression

@jelly
Copy link
Member Author

jelly commented Apr 25, 2025

Reproducer:

truncate --size=50MB /var/tmp/member1; losetup -P --show --find /var/tmp/member1
truncate --size=50MB /var/tmp/member2; losetup -P --show --find /var/tmp/member2
mdadm --create SOMERAID --run --level=1 --metadata=1.0 --raid-devices=2 /dev/loop0 /dev/loop1
mdadm --stop /dev/md/SOMERAID
systemctl daemon-reload

@jelly
Copy link
Member Author

jelly commented Apr 25, 2025

Reported here https://bugzilla.redhat.com/show_bug.cgi?id=2362273

So for bisect purposes last good commit is fc96b232f8e7c0a6c282f47726b2ff6a5fb341d2

git log v6.15-rc3...fc96b232f8e7c0a6c282f47726b2ff6a5fb341d2

@jelly
Copy link
Member Author

jelly commented Apr 28, 2025

Should be fixed in https://lkml.org/lkml/2025/4/23/188 or rc4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant