Failures in chroot #135

dbnicholson · 2017-01-04T17:57:42Z

In our Endless image builder, we chroot into the ostree deployment to install apps with flatpak. The triggers always fail for 2 reasons:

The slave mounting of / fails because the deployment directory is not actually a mountpoint. This is easily fixed by doing a bind mount before hand, but I think this can be done in bubblewrap, too. Systemd does this - https://github.com/systemd/systemd/blob/master/src/core/namespace.c#L910.
pivot_root fails with EINVAL for reasons I can't quite grok. See https://github.com/torvalds/linux/blob/master/fs/namespace.c#L3035. FWIW, I can't really see why the pivot_root is needed. It seems that you could just build up the newroot, then move the mount over /. This is also what systemd does. It used to use pivot_root, but changed that in systemd/systemd@ac0930c.

The text was updated successfully, but these errors were encountered:

jlebon · 2017-01-04T18:10:28Z

This same issue also happens when running bubblewrap inside mock. I wrote a little compat script to make it work: https://gist.github.com/jlebon/fb6e7c6dcc3ce17d3e2a86f5938ec033

Of relevance to your (2):

# The parent of mount in which we'll chroot can't be shared
# or pivot_root will barf. So we just remount onto itself,
# but make sure to make the first parent mount private.

dbnicholson · 2017-01-04T18:21:34Z

@jlebon Thank you! I could not figure out the right private/bind magic to make that happen. Let me try that out.

cgwalters · 2017-01-04T19:38:06Z

See some discussion on this in opencontainers/runc#41

cgwalters · 2017-01-05T14:05:45Z

That said, I think you should bubblewrap instead of chroot - we should support nested containerization if you're root, or the host has unprivileged userns enabled.

alexlarsson · 2017-01-11T17:43:57Z

The nice thing about pivot_root is that we can completely clean out any references to any mounts we didn't create from the sandbox. MS_MOVE would just cover them. This seems like a safer option.

alexlarsson · 2017-01-11T17:44:40Z

auto-creating a mountpoint for the root seems nice though

dbnicholson · 2017-01-11T17:52:50Z

Yeah, I noticed that later searching around about pivot_root. I had a try hacking in the root mount, but it didn't quite work out.

dbnicholson · 2017-10-04T10:55:25Z

In case anyone ever feels like picking this up, https://gist.github.com/dbnicholson/da8aa72ea3bd7ee8731c9da2792fd5a3 is what I played with before but didn't get working.

Bubblewrap uses pivot_root to provide a clean enviroment for its sandbox. Unfortunately, pivot_root requires that current root mount and its parent mount are not shared mounts, which they are by default when making new mounts. To accomplish that, make the chroot root mount private and then bind mount the chroot on top of itself. This will guarantee that both conditions are satisfied. See containers/bubblewrap#135 for details and the workaround suggested in https://gist.github.com/jlebon/fb6e7c6dcc3ce17d3e2a86f5938ec033. https://phabricator.endlessm.com/T14860

safinaskar · 2023-04-28T17:46:39Z

I understand how to fix this. We need to bind mount / on /. Just doing C analog of mount --rbind / / will not work, because root directory of our process will still point to "old" root. In more precise terms: root directory (RTD) of our process (i. e. task_struct::fs.root ( https://elixir.bootlin.com/linux/v6.3/source/include/linux/fs_struct.h#L15 )) will still point to path ( https://elixir.bootlin.com/linux/v6.3/source/include/linux/path.h#L8 ) of "old" root, not "new" one. See also: https://elixir.bootlin.com/busybox/1.36.0/source/util-linux/switch_root.c#L356 .

So we need to do C analog of this: mount --rbind / /foo; cd /foo; mount --move . /; chroot ..

But this gives another problem: all filesystems mounted in original namespace will remain mounted inside bubblewrap, even after two pivot_roots. They will be inaccessible and hidden, but still mounted.

What is wrong with such situation? Consider this: we insert USB flash drive and mount it. Then start bubblewrap. Then we umount flash drive (in our host system). Actually it remained to be mounted in bubblewrap's namespace. So when we remove flash drive, we get data loss. What to do? We have two choices how to fix this situation:

Escape chroot. This can be done :)
Unmount as lot as possible

I like second solution more (but I can implement both). Second solution will look like this:

mount --rbind / /foo
Iterate over all mounts (using getmntent) (except for everything below /foo) and unmount
cd /foo; mount --move . /; chroot .

I can write patch if you want.

Also I can describe workaround for users of bubblewrap

smcv · 2023-05-01T11:07:21Z

I can write patch if you want

If you think you know how to solve this, please do: reviewing a pull request and checking for things that can go wrong there will be a lot easier than reviewing a text description that is less precise than code.

safinaskar · 2023-05-04T16:18:54Z

@smcv , I spent some more time thinking about bubblewrap. Now I think bubblewrap don't need to work in chroot. Let me tell why. As well as I understand bubblewrap needs root privileges. And it acquires them in one of 3 ways:

Bubblewrap already started as root
Bubblewrap is setuid
Bubblewrap creates new user namespace and thus becomes root

In 3rd way it is absolutely impossible to make bubblewrap to work in chroot, because this is prohibited in the kernel ( https://elixir.bootlin.com/linux/v6.2/source/kernel/user_namespace.c#L105 ). You can easily verify this by running this command (as root, not in chroot): chroot --userspec=1000:1000 /somedir unshare -r bash. The command will fail, because of the mentioned line in kernel sources.

So I think in 1st way and in 2nd way bubblewrap in chroot should not work, too. For consistency purposes. (But keep in mind that sometimes it still works.) So, I will not write any patch, I'm sorry about this. (Of course, you can try to convince me.)

Also, this will be very cool to disable setuid mode. I. e. simply to exit if we run as setuid binary. Because, as well as I understand, in modern distros user namespaces are enabled by default anyway

smcv · 2023-05-04T18:31:07Z

As well as I understand bubblewrap needs root privileges

Not exactly, it needs CAP_SYS_ADMIN and various other capabilities(7) in its current namespace. "root" is specifically uid 0, but in the most common use-cases for bubblewrap, uid 1000 without capabilities (in the initial namespace) becomes uid 1000 with capabilities (in the new namespace), with no "root" involved.

If you're saying "root" but you really mean "CAP_SYS_ADMIN" (and other capabilities), it's probably easier to understand the constraints if we're as precise as possible about what is happening.

(Just to make this extra-confusing, there is a concept called a capabilities-based security system, but the capabilities(7) feature is using a different meaning for that word.)

Bubblewrap creates new user namespace and thus becomes root

Again, this would be more accurately stated as: bubblewrap creates a new user namespace, and thus gains all capabilities(7) in that user namespace. It doesn't matter whether it's uid 0 in the new userns or not.

In 3rd way it is absolutely impossible to make bubblewrap to work in chroot, because this is prohibited in the kernel ( https://elixir.bootlin.com/linux/v6.2/source/kernel/user_namespace.c#L105 )

Right, yes.

So I think in 1st way and in 2nd way bubblewrap in chroot should not work, too. For consistency purposes.

For what you're calling the 1st way, where bubblewrap is started such that it already has elevated capabilities (most commonly by being root, for example sudo bwrap ...), it's not obvious to me that it shouldn't work. I am mostly only interested in running bubblewrap as an unprivileged user (for use-cases like Flatpak), but some of the other bubblewrap maintainers seem to value the ability to have elevated privileges (which is why issues like #518 and #551 stay open, instead of being closed as out-of-scope). But if you're doing that, then you could probably equally well use something else that is less limited than bwrap, like unshare and newuidmap; so I can see an argument that having bwrap provide that functionality is unnecessary.

For what you're calling the 2nd way, where bubblewrap is setuid root, the design principle is that bubblewrap shouldn't allow anything that an unprivileged user on a suitable kernel wouldn't be allowed to do. So I agree with your assertion that a setuid bubblewrap shouldn't work when run inside a chroot.

Also, this will be very cool to disable setuid mode. I. e. simply to exit if we run as setuid binary. Because, as well as I understand, in modern distros user namespaces are enabled by default anyway

Sorry, I don't understand "this will be very cool to".

Are you requesting a new feature: the ability to configure bubblewrap at build-time so that if it is run while setuid (as detected via AT_SECURE or by comparing real uid with effective uid), it will simply refuse to run and exit with an error? I have thought about that myself. If you or someone else can propose a pull request implementing that feature, I'll try to review it.

Or are you saying that something related to chroots would be a useful mechanism to use to provide that feature? If that, I don't understand: please be more specific.

dbnicholson · 2023-05-04T20:13:06Z

I don't see any reason why case 1 (bubblewrap started by a privileged user inside a chroot) couldn't work. It's why I opened the issue and started working on fixes to make it comply with pivot_root, after all. Even to this day our image builder, which runs very privileged because it needs to do things like setup loop devices, goes through the mount dance before chrooting just so that bwrap won't fail when we try to install flatpaks within it.

This could certainly be deemed wontfix, but I think it's a legitimate use case.

safinaskar · 2023-05-05T22:35:24Z

Are you requesting a new feature: the ability to configure bubblewrap at build-time so that if it is run while setuid (as detected via AT_SECURE or by comparing real uid with effective uid), it will simply refuse to run and exit with an error?

I want bubblewrap to always refuse to run and exit with an error if it detects it runs as a setuid

cgwalters added the question label Jan 17, 2017

jlebon mentioned this issue Oct 17, 2017

flatpak fails with "pivot_root: Invalid argument" #214

Open

smcv mentioned this issue Jun 4, 2021

Proton don't run in chroot ValveSoftware/steam-runtime#415

Open

smcv mentioned this issue Aug 5, 2021

[Question] bwrap in LXC #362

Closed

smcv mentioned this issue Sep 10, 2021

Question: support on non-metal EC2 instances? #450

Closed

This was referenced Nov 24, 2022

[Feature request]: Ability to read package install lists from stdin flatpak/flatpak#5186

Open

Installing openh264 fails in containers and chroot flatpak/flatpak#3238

Open

smcv mentioned this issue Aug 21, 2023

Error: While trying to apply extra data: apply_extra script failed, exit status 256 Updates comple flatpak/flatpak#4107

Closed

smcv mentioned this issue Sep 11, 2023

"pivot_root: Invalid argument" when running on a SLURM cluster node from NFS #594

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failures in chroot #135

Failures in chroot #135

dbnicholson commented Jan 4, 2017

jlebon commented Jan 4, 2017

dbnicholson commented Jan 4, 2017

cgwalters commented Jan 4, 2017

cgwalters commented Jan 5, 2017

alexlarsson commented Jan 11, 2017

alexlarsson commented Jan 11, 2017

dbnicholson commented Jan 11, 2017

dbnicholson commented Oct 4, 2017

safinaskar commented Apr 28, 2023 •

edited

Loading

smcv commented May 1, 2023

safinaskar commented May 4, 2023

smcv commented May 4, 2023

dbnicholson commented May 4, 2023

safinaskar commented May 5, 2023

Failures in chroot #135

Failures in chroot #135

Comments

dbnicholson commented Jan 4, 2017

jlebon commented Jan 4, 2017

dbnicholson commented Jan 4, 2017

cgwalters commented Jan 4, 2017

cgwalters commented Jan 5, 2017

alexlarsson commented Jan 11, 2017

alexlarsson commented Jan 11, 2017

dbnicholson commented Jan 11, 2017

dbnicholson commented Oct 4, 2017

safinaskar commented Apr 28, 2023 • edited Loading

smcv commented May 1, 2023

safinaskar commented May 4, 2023

smcv commented May 4, 2023

dbnicholson commented May 4, 2023

safinaskar commented May 5, 2023

safinaskar commented Apr 28, 2023 •

edited

Loading