Clean up connection timeout logic

changed milestone to %Tor: 0.3.1.x-final

added TorCoreTeam-postponed-201604 component::core tor/tor milestone::Tor: 0.3.1.x-final nickm-deferred-20160905 nickm-deferred-20161005 owner::mikeperry parent::16861 points::see-parent priority::high resolution::fixed review-group-9 reviewer::nickm severity::normal sponsor2 status::closed type::defect labels

Ok, I think I want to combine CircuitIdleTimeout and PredictedPortsRelevanceTime into a single option (call it CircuitsAvailableTimeout?) and also randomize the value by some range when it is used.

Where CircuitIdleTimeout is currently used, I would sample a random timeout value on circuit creation and store it in origin_circuit_t. Where PredictedPortsRelevantTime is used, I think the right thing to do is to sample a new value whenever the list of predicted ports is empty.

For the TLS connection timeout, I want to explicitly separate canonical relay connections from client connections and non-canonical relay connections, and make the relay connection timeout be on the order of an hour (randomized +/- 25%) and be controlled by a consensus parameter. The client TLS connection timeout can be much shorter, since client TLS lifespan will be governed primarily by circuit activity (which will be controlled via CircuitsAvailableTimeout).

With these two sets of changes, it will be much easier to control how long TLS connections live (and thus more easily control network activity and padding), for both relays and clients.

FYI: I came to the conclusion to combine CircuitIdleTimeout and PredictedPortsRelevanceTime based on arma's comments here: https://trac.torproject.org/projects/tor/ticket/9176#comment:7.

Trac:
Priority: Medium to High

FYI: This commit implements the changes in comment:2 https://gitweb.torproject.org/mikeperry/tor.git/commit/?h=netflow_padding-v3-squashed&id=9ca83e5c82a9af0da18278ea64ab13e75f9dadb2.

I've tested it a bit in chutney and it has unittests. I'm posting it now for a quick review for any major refactoring/structural issues before I do more involved chutney/live network testing.

Trac:
Status: new to needs_review

I have a new version of this rebased to the new origin/master (with periodic events). Testing that now.

Trac:
Status: needs_review to needs_revision

Ok, here's the new rebased commit: https://gitweb.torproject.org/mikeperry/tor.git/commit/?h=netflow_padding-v4&id=41e847502212437aee1078dc93e28be2a3256c47

Trac:
Status: needs_revision to needs_review

This patch looks good to me in general.

I like the change from port 80 as the default predicted port to port 443.

In options_validate:

Should we put minimum and maximum values on CircuitsAvailableTimeout, like the previous code for PredictedPortsRelevanceTime?

A nitpick:

In channelpadding_get_channel_idle_timeout:

I think server_mode() is the standard way of checking options->BridgeRelay || !options->ORPort_set. It does a few more checks, as far as I recall.

Trac:
Status: needs_review to needs_revision

Replying to teor:

This patch looks good to me in general.

I like the change from port 80 as the default predicted port to port 443.

In options_validate:

Should we put minimum and maximum values on CircuitsAvailableTimeout, like the previous code for PredictedPortsRelevanceTime?

I think the only reason why we had minimum and maximum values here was due to the concerns about discrepancy between PredictedPortsRelevanceTime and CircuitIdleTimeout. CircuitIdleTimeout never had any ranges or limits. At least, that was my read of https://trac.torproject.org/projects/tor/ticket/9176#comment:7.

A nitpick:

In channelpadding_get_channel_idle_timeout:

I think server_mode() is the standard way of checking options->BridgeRelay || !options->ORPort_set. It does a few more checks, as far as I recall.

Ok, I fixed this and another instance in fixup commits atop netflow_padding-v4.

Trac:
Status: needs_revision to needs_review

Replying to mikeperry:

Replying to teor:

In channelpadding_get_channel_idle_timeout:

I think server_mode() is the standard way of checking options->BridgeRelay || !options->ORPort_set. It does a few more checks, as far as I recall.

Ok, I fixed this and another instance in fixup commits atop netflow_padding-v4.

These fixups all look good. Are we going to merge this? :-)

I've squashed the three major commits here into a new "netflow_padding-v4_squashed" branch so I can review them one-by-one.

channelpadding_get_channel_idle_timeout():
- The documentation for channelpadding_get_channel_idle_timeout should describe units. It's seconds, right?
- Should the magic numbers in the client-or-noncanonical case uses magic numbers. Should it use a network parameter instead, like the other stuff?
- Let's document what a "circuit idle timeout" is.
- How come this one doesn't look at any optoins like channelpadding_get_circuits_available_timeout does? Should it be looking at CircuitsAvailableTimeout?
channelpadding_get_circuits_available_timeout():
- Let's also document in the function comment when this timeout controls exactly.
predicted_ports_prediction_time_remaining():
- time_t differences are not guaranteed to fit into an int.

General stuff:

Should we be looking at monotonic time for any of this?

Otherwise looks good! And also solid!

Trac:
Status: needs_review to needs_revision

These issues should all be fixed in e1c57775ed8a09a8767bf4d74db94d04c9fe1659 in mikeperry/netflow_padding-v4_squashed+rebased. I opted to explicitly check for time_t overflow/underflow and associated clock jumps rather than use monotonic time. I don't like the idea of waiting for the clock to catch up with itself here, either. I'd rather reset the timeout in that case.

Trac:
Status: needs_revision to needs_review

These seem like features, or like other stuff unlikely to be possible this month. Bumping them to 0.2.9

Trac:
Milestone: Tor: 0.2.8.x-final to Tor: 0.2.9.x-final

Trac:
Reviewer: N/A to N/A
Points: N/A to see-parent

Every postponed needs_review ticket should get a review in April

Trac:
Keywords: N/A deleted, TorCoreTeam201604 added

Trac:
Reviewer: N/A to nickm

I will not be getting these revised and reviewed this week. I hold out hope for May. Sorry mike. Please let me know whether you want to revise them wrt my handles/timing patches, or whether I should. I'm happy either way.

Trac:
Keywords: TorCoreTeam201604 deleted, TorCoreTeam-postponed-201604, TorCoreTeam201605 added

My current understanding here is that mike means to revise this branch based on other merges we're doing. Moving these to needs_revision in the meantime. Please let me know if I'm incorrect.

Trac:
Status: needs_review to needs_revision

Remove "TorCoreTeam201605" keyword. The time machine is broken.

Trac:
Keywords: TorCoreTeam201605 deleted, N/A added

Deferring many tickets that are in needs_revision with no progress noted for a while, where I think they could safely wait till 0.3.0 or later.

Please feel free to move these back to 0.2.9 if you finish the revisions soon.

Trac:
Milestone: Tor: 0.2.9.x-final to Tor: 0.2.???
Keywords: N/A deleted, nickm-deferred-20160905 added

Alright. I switched the code over to using the new handle, monotonic timer, and timer wheel abstractions. All unit tests pass without leaks from this code (though the unit tests have grown new memory leaks of their own).

mikeperry/netflow_padding-v6. The commit specific to this bug is 0e709402d9fe5cb48188bf65e335dcdedadb268b.

Trac:
Milestone: Tor: 0.2.??? to Tor: 0.2.9.x-final
Status: needs_revision to needs_review

Trac:
Keywords: N/A deleted, review-group-9 added

Haven't reviewed this one yet, but I will, on https://gitlab.com/nickm_tor/tor/merge_requests/8 .

Okay, review done. I have probably misunderstood a few important things. Please feel free to focus on correcting my misunderstandings here, so that I can say smarter things about the code.

Trac:
Status: needs_review to needs_revision

Trac:
Status: needs_revision to needs_review

Deferring big/risky-feature things (even the ones I really love!) to 0.3.0. Please argue if I'm wrong.

Trac:
Milestone: Tor: 0.2.9.x-final to Tor: 0.3.0.x-final
Keywords: N/A deleted, nickm-deferred-20161005 added

Trac:
Keywords: N/A deleted, review-group-11 added

Trac:
Keywords: review-group-11 deleted, review-group-12 added

Trac:
Keywords: review-group-12 deleted, review-group-13 added

And that's all for review-group-13.

Trac:
Keywords: review-group-13 deleted, review-group-14 added

Trac:
Milestone: Tor: 0.3.0.x-final to Tor: 0.3.1.x-final

Trac:
Keywords: review-group-14 deleted, N/A added

Trac:
Keywords: N/A deleted, review-group-16 added

Trac:
Keywords: review-group-16 deleted, review-group-16 sponsor2 added

Trac:
Keywords: review-group-16 sponsor2 deleted, sponsor2 added

merging parent

Trac:
Resolution: N/A to fixed
Status: needs_review to closed

closed

mentioned in issue #22306 (moved)

mentioned in issue #23621 (moved)

mentioned in issue #24228 (moved)

mentioned in issue #24841 (moved)

mentioned in issue #33880 (moved)

mentioned in issue #34257 (moved)

moved to tpo/core/tor#17592 (closed)

mentioned in issue tpo/core/tor#33880 (closed)

mentioned in issue tpo/core/dirauth#22306 (closed)

Clean up connection timeout logic

Child items ...

Activity