circuit_is_acceptable is slow due to IP and fingerprint parsing

changed milestone to %Tor: 0.4.1.x-final

added 032-unreached component::core tor/tor milestone::Tor: 0.4.1.x-final owner::neel parent::30221 points::1 priority::medium resolution::fixed reviewer::mikeperry severity::normal status::closed tor-hs type::defect version::tor 0.2.8.12 labels

Replying to teor:

2.10 min 100.0% 6.26 s circuit_is_acceptable 58.93 s 46.7% 1.05 s tor_addr_parse

Parsing IP addresses from strings out of node descriptors

To be precise, it looks like this one is from

    const int family = tor_addr_parse(&addr, conn->socks_request->address);

which is actually parsing IP addresses from strings in the socks request, not the node descriptor.

For how that code came to be that way, see commit 2b22c0ae from #7582 (moved).

Replying to arma:

{{{ const int family = tor_addr_parse(&addr, conn->socks_request->address); }}}

It looks like we could pretty easily only do that tor_addr_parse call in the two places below it where we would actually use the value of family -- that is,

(a) for a one-hop circuit where we've set chosen_exit_name and we don't have a digest for the exit, which should be rare if it ever happens, and
(b) when origin_circ->prepend_policy is set, meaning we've already received a surprising exit policy failure on that circuit and called adjust_exit_policy_from_exitpolicy_failure() on it.

Replying to teor:

43.76 s 34.7% 1.30 s connection_ap_can_use_exit 39.51 s 31.3% 574.00 ms node_get_by_nickname 38.93 s 30.9% 764.00 ms node_get_by_hex_id

This case happens because they're hidden service descriptor lookups, so they're general (exit) streams yet they have conn->chosen_exit_name set.

That's a surprising amount of time in hex_digest_nickname_decode(), but I guess with all the strlcpy's, etc, that it does, it shouldn't be that surprising.

You're right that it's inefficient here, since the logic is "consider a given circuit, and then reparse what exit node this conn says it's for, and then see if that circuit ends at that exit node", and in theory we don't need to keep reparsing. I would be tempted to try to take a step farther back though and figure out if there's a way to call the broader "how's it going, are any streams ready to be attached to circuits yet" function more sparingly.

I added a hs-intro chutney network in #22599 (moved) that might be a test for this. It currently doesn't verify (see #22600 (moved)), so it might be a chutney test for this issue.

Mark a large number of tickets that I do not think we will do for 0.3.2.

Trac:
Keywords: N/A deleted, 032-unreached added
Milestone: Tor: 0.3.2.x-final to Tor: unspecified

This probably blocks #15251 (moved)

Trac:
Parent: N/A to #15251 (moved)

Trac:
Parent: #15251 (moved) to #30221 (moved)

Trac:
Owner: N/A to neel
Cc: N/A to neel
Status: new to assigned

PR is here: https://github.com/torproject/tor/pull/1004

The only changes I have done is calling tor_addr_parse() only where it is needed as arma said in Comment 2.

Keep in mind that AppVeyor failed, but it is not a build error as much as it is that AppVeyor 64-bit was unable to fetch the Git repo:

Build started
git clone -q --depth=50 https://github.com/torproject/tor.git C:\projects\tor
fatal: unable to access 'https://github.com/torproject/tor.git/': Failed to connect to github.com port 443: Timed out
Command exited with code 128
C:\Python27\python.exe %APPVEYOR_BUILD_FOLDER%\scripts\test\appveyor-irc-notify.py irc.oftc.net:6697 tor-ci failure
C:\Python27\python.exe: can't open file 'C:\projects\tor\scripts\test\appveyor-irc-notify.py': [Errno 2] No such file or directory
Command exited with code 2

Source: https://ci.appveyor.com/project/torproject/tor/builds/24367946/job/xu0uiwp0h847t6nu

Trac:
Status: assigned to needs_review

Trac:
Reviewer: N/A to mikeperry

Simple code relocation. Looks good to me. Thanks, neel!

Trac:
Status: needs_review to merge_ready

Trac:
Milestone: Tor: unspecified to Tor: 0.4.1.x-final

Looks okay to me too. Merging this.

Closing it too, since we have other tickets for profile-driven improvements to onion service performance.

Trac:
Resolution: N/A to fixed
Status: merge_ready to closed

closed

changed time estimate to 8h

mentioned in issue #22600 (moved)

mentioned in issue #30221 (moved)

mentioned in issue tpo/core/chutney#22600 (closed)

circuit_is_acceptable is slow due to IP and fingerprint parsing

Child items 0

Activity