Flatcar systemd units fail at first boot after disk is resized (resize done because the storage was full) #1279

ader1990 · 2023-12-06T12:21:12Z

Description

If the Flatcar storage is full, a power off is made and a subsequent disk resize is performed.
After the instance is started, the partition is resized automatically (vda9) but some of the systemd units fail to start.

Impact

Low. If another reboot is performed, the systemd units are back running fine.

Environment and steps to reproduce

How to reproduce:

create a Flatcar qemu-kvm instance using the https://alpha.release.flatcar-linux.net/amd64-usr/current/flatcar_production_qemu.sh
in the vm, create a more than enough big file using dd if=/dev/zero of=toobig.file bs=1G count=1000 which ends when it runs out of space
shutdown the vm
start the vm - a bunch of systemd services are in failed state (expected).
shut down the vm
qemu-img resize to a bigger size
start the vm -- the same systemd services are in failed state (unexpected), but the vda9 has been resized
reboot the vm (systemd services are not in the failed state).

The image used was https://alpha.release.flatcar-linux.net/amd64-usr/current/flatcar_production_qemu_image.img

The issue is that there are systemd units starting before the resize has been performed.

Failed units:

Failed Units: 3
  systemd-hwdb-update.service
  systemd-journal-catalog-update.service
  systemd-update-done.service

The text was updated successfully, but these errors were encountered:

pothos · 2023-12-06T12:56:59Z

Thanks, for systemd-hwdb-update.service we should pre-build the DB at image generation with systemd-hwdb --usr --root=/build/amd64-usr/. When /usr/lib/udev/hwdb.bin exists it will be used. In the update postinst action we can delete the file in /etc from the upperdir or delete it at boot.

For the rest we should do the resize from the initrd, before initrd-setup-root to make sure that we always use the available space. (Maybe systemd-repart could do that.)

ader1990 added the kind/bug Something isn't working label Dec 6, 2023

github-project-automation bot added this to Flatcar tactical, release planning, and roadmap Dec 6, 2023

github-project-automation bot moved this to 📝 Needs Triage in Flatcar tactical, release planning, and roadmap Dec 6, 2023

ader1990 changed the title ~~Flatcar systemd units fail at first boot after disk is resized (because storage was full)~~ Flatcar systemd units fail at first boot after disk is resized (resize done because the storage was full) Dec 6, 2023

pothos moved this from 📝 Needs Triage to 🪵Backlog in Flatcar tactical, release planning, and roadmap Dec 6, 2023

github-actions bot mentioned this issue Dec 22, 2023

Monthly contributions report 2023-11-22 - 2023-12-21 #1305

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flatcar systemd units fail at first boot after disk is resized (resize done because the storage was full) #1279

Flatcar systemd units fail at first boot after disk is resized (resize done because the storage was full) #1279

ader1990 commented Dec 6, 2023

pothos commented Dec 6, 2023 •

edited

Loading

Flatcar systemd units fail at first boot after disk is resized (resize done because the storage was full) #1279

Flatcar systemd units fail at first boot after disk is resized (resize done because the storage was full) #1279

Comments

ader1990 commented Dec 6, 2023

Description

Impact

Environment and steps to reproduce

pothos commented Dec 6, 2023 • edited Loading

pothos commented Dec 6, 2023 •

edited

Loading