Block non-.onion subresources on .onion websites?

added TorBrowserTeam202006 component::applications/tor browser owner::tbb-team points::2 priority::medium severity::normal sponsor::27-can status::needs-information type::defect labels

I think there are two constituents here: The onion server, and the Browser user.

Our primary goal should be to serve the browser user.

Where it's easy and simple, we can serve the onion server. But these suggestions are not comprehensive, and Tor Browser will never be a comprehensive onion audit tool. I would instead advocate for improving the tool onionscan https://onionscan.org/ where it is possible (although that also, cannot be comprehensive...)

Focusing on the browser user, I think it's fair to treat any non-onion resource as Mixed Content on an onion, regardless of HTTP/HTTPS status. There are three levels of Mixed Content Blocking:

None
Active (blocks scripts, allows images)
Full (blocks scripts and images)

There's also the security slider. I would suggest that when the security slider is at High, we perform Full blocking. It provides a smaller attack surface for the browser user.

When the slider is not at High; I would advocate for either Active or Full Blocking. Probably Active.

I personally would ignore the situation of a HTTPS onion including from a HTTP onion and give this no special treatment (that is to say it's fine, and it loads fine.)

Replying to tom:

I think there are two constituents here: The onion server, and the Browser user.

Our primary goal should be to serve the browser user.

Where it's easy and simple, we can serve the onion server. But these suggestions are not comprehensive, and Tor Browser will never be a comprehensive onion audit tool. I would instead advocate for improving the tool onionscan https://onionscan.org/ where it is possible (although that also, cannot be comprehensive...)

Focusing on the browser user, I think it's fair to treat any non-onion resource as Mixed Content on an onion, regardless of HTTP/HTTPS status. There are three levels of Mixed Content Blocking:

None

Active (blocks scripts, allows images)

Full (blocks scripts and images)

There's also the security slider. I would suggest that when the security slider is at High, we perform Full blocking. It provides a smaller attack surface for the browser user.

I'd like to understand that point more. What attacks are you talking about here? We block features based on code execution vulnerabilities in the past, not based on transport, as a general rule of thumb. So, this means that on the highest slider scripts are already blocked irrespective of transport or mixed content situation or whatever. Now, images are not so far, because the ratio of security benefit/usability penalty is not that good. That, again, is not dependent on the underlying transport or the mixed content situation.

If I understand it right then what you want is to defend against the privacy risks Arthur outlined by using the security slider. If that's the case then I am not convinced by that idea yet as we don't want to mix security and privacy related settings in the slider.

Replying to gk:

If I understand it right then what you want is to defend against the privacy risks Arthur outlined by using the security slider. If that's the case then I am not convinced by that idea yet as we don't want to mix security and privacy related settings in the slider.

Nooo, I keep the delineation in mind. I said "when the security slider is at High, perform Full blocking" specifically for security reasons.

An attacker wants to compromise a user who visits foo.onion. foo.onion includes an image from example.com. (HTTP or HTTPS, doesn't matter.) Instead of compromising foo.onion, the attacker compromises either example.com or the connection from the exit node to example.com and serves an exploit on a passive piece of content (like an image.)

Performing full blocking removes this attack surface.

Now you said

We block features based on code execution vulnerabilities in the past, not based on transport

I hadn't heard the bit about transport before. Perhaps you disagree with me based on that. But I'm confused then: At Medium, why is JS disabled on HTTP sites? Isn't that blocking a feature based on transport?

Replying to tom:

Replying to gk:

If I understand it right then what you want is to defend against the privacy risks Arthur outlined by using the security slider. If that's the case then I am not convinced by that idea yet as we don't want to mix security and privacy related settings in the slider.

Nooo, I keep the delineation in mind. I said "when the security slider is at High, perform Full blocking" specifically for security reasons.

An attacker wants to compromise a user who visits foo.onion. foo.onion includes an image from example.com. (HTTP or HTTPS, doesn't matter.) Instead of compromising foo.onion, the attacker compromises either example.com or the connection from the exit node to example.com and serves an exploit on a passive piece of content (like an image.)

Performing full blocking removes this attack surface.

Okay, sounds good.

Now you said

We block features based on code execution vulnerabilities in the past, not based on transport

I hadn't heard the bit about transport before. Perhaps you disagree with me based on that. But I'm confused then: At Medium, why is JS disabled on HTTP sites? Isn't that blocking a feature based on transport?

Well, back then we wanted to block JavaScript due to its high amount of vulnerabilities found in the past everywhere. But we settled at enabling it for HTTPS on the more or less medium level to strike some balance between usability and security as otherwise that level would have been too unattractive to use.

So, it's not exactly like blocking something based on transport (the difference might sound subtle here but I still think it is worth mentioning). That said, in general we could think about taking the transport into account for putting something onto the slider. My worry is, though, that it makes analysis of which features to block where much more complicated and we end up picking less security where we should not.

But that said I think we should definitely do the mixed content blocking (we might even already get some due to treating .onion as secure context) and we have #13747 (moved) for that which I just put on our work radar. And we should go with (full) mixed content blocking regardless of the slider level. Ideally, we would align it with the mixed content policy we see wrt non-onion mixed content but I guess that could be up for debate.

I think I'd like to see #13747 (moved) fixed first and see what fallout we have from that change and some better understanding whether the remaining parts should be tackled on the browser side or not.

Arthur: if you feel that this ticket is essentially about implementing full mixed content blocking, then feel free to dupe it over to #13747 (moved). If not, let's revisit it after #13747 (moved) got done.

Trac:
Sponsor: N/A to Sponsor27

#32464 (moved) is a duplicate

Trac:
Cc: N/A to simonfrey

Putting this on the roadmap for early next year.

Trac:
Keywords: N/A deleted, TorBrowser202001 added

Correct keyword.

Trac:
Keywords: TorBrowser202001 deleted, TorBrowserTeam202001 added

Trac:
Points: N/A to 2

Replying to arthuredelstein:

Right now, .onion sites can load HTTP or HTTPS subresources (scripts, images, etc.).

But is this safe? Loading non-.onion subresources means we are potentially leaking information including:

the .onion domain

the full top-level .onion URL

other information about the content of the page

the list of subresources requested by a .onion page

Leaks might happen by referer, fetch request, query string, etc. (I haven't tested these yet and I'm not sure what leaks happen in practice.) Such leaks would be particularly bad for "stealth" onion sites.

Even worse, some of the non-.onion subresources may leak the onion site's IP address. For example, a .onion website improperly configured may accidentally include URLs pointing to their own server's non-.onion IP address. Loading those subresources leaks the IP address not just to the user but to anyone watching connections outside the Tor network.

I'm not sure I understand the goal of this. In the simple case, a web developer has complete control over which subresources are used on the web site. As such, they accept any risks associated with using non-onion subresources. Maybe we should provide more training/support for explaining these risks, but I do not see the browser as a place where these restrictions should be imposed.

I begin seeing the benefit of blocking resources from clearnet addresses on more complicated websites, such as those sites where user-generated content is published. However, in this case, it seems like the website/server should implement sanitization or filtering in their software, instead of expecting this functionality in the browser.

As a user, it is possible I may only want to load resources from .onion addresses. This wouldn't be related to leaking onion addresses. There is a torrc option (OnionTrafficOnly) which accomplishes this, and we could expose a UI preference for this - but as gk mentioned, this sound like #13747 (moved).

Trac:
Status: new to needs_information

Trac:
Keywords: TorBrowserTeam202001 deleted, TorBrowserTeam202002 added
Sponsor: Sponsor27 to Sponsor27-can

Deferring until June 2020

Trac:
Keywords: TorBrowserTeam202002 deleted, TorBrowserTeam202006 added

changed time estimate to 16h

mentioned in issue #32464 (moved)

moved to tpo/applications/tor-browser#28174 (closed)

Block non-.onion subresources on .onion websites?

Child items 0

Activity