meaning ping does not work as an unprivileged user:
weasel@loghost01:~$ ping localhostping: socket: Operation not permittede2:weasel@loghost01:~$
This could likely be fixed with re-installing iputils-ping or setting cap_net_raw+ep manually, but we should figure out why the instance-debbotstrap installed machines are broken this way.
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Child items 0
Show closed items
No child items are currently assigned. Use child items to break down this issue into smaller parts.
Linked items 0
Link issues together to show that they're related.
Learn more.
I'm not sure how this happened on loghost, but it didn't happen on bacula-director-01, as created in #31786 (moved) (which has the detailed commandline used in the creation).
It seems to me we don't have a clear way of reproducing this. From here on, maybe we need VM creators to explicitly document exactly which steps were taken to create the box? Maybe there's a slight change in the way partitions are created or something...
i confirm the problem occurs when the filesystem is cached. filed the bug in Debian in https://bugs.debian.org/942114 and wrote a patch that fixes the problem, deployed on both nodes.
Trac: Owner: tpa to anarcat Status: needs_information to assigned
i tested this and it seems to work. next step is to have another person confirm that, to get this merged in the debian package, have an upload to fix it in unstable and then stable, and then deploy this debian package on the nodes.
in the meantime, we should at least add a note in the new-machine docs before this ticket is marked as resolved.
Particularly interesting are hetzner-hel1-01.torproject.org and hetzner-nbg1-01.torproject.org because they were setup using the Hetzner cloud stuff, so our install procedure is broken there as well.
uploaded the hotfix to our own debian repo so that changes get deployed on new boxes automatically. hot-fixed two new machines (tbb-nightlies-master and onionoo-backend-01) because they were deployed from fsn-node-03, which didn't have the fix then.
fix the hetzner-cloud installer or at least add a node to check it could not figure out what is wrong with that installer, so I just added a note in the install procedure
check the other installers the "robot", "linaro" and "ganeti procedures have been confirmed as checked and fixed. the "cloud" procedure is still unclear but has been documented.
i think this ticket can be closed. mitigations are in place and workarounds will either expire with the bullseye release or show up in the next install but are flagged in the procedures.
Trac: Status: needs_review to closed Resolution: N/Ato fixed