Make bootstrapping clients wait before trying an authority

changed milestone to %Tor: 0.3.2.x-final

added actualpoints::0.3 component::core tor/tor milestone::Tor: 0.3.2.x-final owner::teor points::2 priority::medium resolution::fixed severity::normal status::closed tor-bootstrap type::enhancement version::tor 0.2.8.1-alpha labels

Trac:
Points: N/A to small/medium

No need for this in 028.

Trac:
Milestone: Tor: 0.2.8.x-final to Tor: 0.2.9.x-final

small/medium => 2.

Trac:
Points: small/medium to 2

Trac:
Keywords: N/A deleted, isaremoved added
Milestone: Tor: 0.2.9.x-final to Tor: 0.2.???

Milestone renamed

Trac:
Milestone: Tor: 0.2.??? to Tor: 0.3.???

Finally admitting that 0.3.??? was a euphemism for Tor: unspecified all along.

Trac:
Milestone: Tor: 0.3.??? to Tor: unspecified
Keywords: N/A deleted, tor-03-unspecified-201612 added

Remove an old triaging keyword.

Trac:
Keywords: tor-03-unspecified-201612 deleted, N/A added

Trac:
Keywords: isaremoved deleted, N/A added

This bug means that bootstrapping clients an authority and a fallback immediately, and then try another fallback 1-3 seconds later, and another 4-9 seconds after that. The intention was to try an authority after we'd tried the first 2-3 fallbacks.

(arma discovered this was happening in #22400 (moved).)

Please see my branch bug17750_029 for a general fix for this. It could go all the way back to 0.2.9 if we wanted it to, but we should definitely test it in master first.

This bug could only ever have affected ClientBootstrapConsensusAuthorityDownloadSchedule and TestingBridgeDownloadSchedule, because every other schedule starts with 0 (the default).

And TestingBridgeDownloadSchedule is already initialised correctly.

I opened #22403 (moved) as a follow up for those cases where we directly access a download_status_t's fields rather than using an accessor function.

Trac:
Summary: A download_status_t can be used before calling download_status_reset on it to Make bootstrapping clients wait before trying an authority
Reviewer: N/A to N/A
Milestone: Tor: unspecified to Tor: 0.3.1.x-final
Keywords: review, easy deleted, tor-bootstrap added
Actualpoints: N/A to 0.3
Status: new to needs_review
Version: N/A to Tor: 0.2.8.1-alpha

I'm not so good with this code, but I can confirm that the patch seems to have the behavior that teor describes. (The code used to launch two consensus fetches in parallel, one to an authority and one to a guard or fallback. Now it launches only one, not to an authority, and 1 second later if needed it launches another, also not to an authority. I didn't try triggering stuff after that, but hey, what could go wrong. :)

Agreed it is smart to put this patch into 0.3.1. In fact, we might not need/want to backport it before that -- since it's the sort of thing where it will be a long time until we realize our mistake, if there is one.

merged to master; marking for possible backport.

Trac:
Milestone: Tor: 0.3.1.x-final to Tor: 0.3.0.x-final
Keywords: tor-bootstrap deleted, tor-bootstrap 030-backport 029-backport added

I spoke too soon; this patch makes the unit tests fail with:

dir/download_status_increment: 
  FAIL src/test/test_dir.c:3809: assert(mock_get_options_calls == 0)
  [download_status_increment FAILED]

Not merging yet.

Trac:
Milestone: Tor: 0.3.0.x-final to Tor: 0.3.1.x-final
Status: needs_review to needs_revision

Trac:
Milestone: Tor: 0.3.1.x-final to Tor: 0.3.2.x-final

Please see my branch bug17750_029, which also adds some regression tests for this bug and #20534 (moved).

I marked this for 0.3.1 backport. If we do backport, we only need to backport the squashed commit "Make clients try fallbacks before authorities".

And we might not want to backport at all, given the risk of bugs in older versions. The only thing this bug causes is more load on the directory authorities.

Trac:
Keywords: tor-bootstrap 030-backport 029-backport deleted, tor-bootstrap 031-backport 030-backport 029-backport added
Status: needs_revision to needs_review

Creating review-group-20

Trac:
Keywords: tor-bootstrap 031-backport 030-backport 029-backport deleted, 031-backport, tor-bootstrap, 029-backport, 030-backport, review-group-20 added

setting owner

Trac:
Owner: N/A to teor
Status: needs_review to assigned

Trac:
Status: assigned to needs_review

Squashed as "bug17750_029_squashed" and merged to master. Let's let it cook there for a little while before we decide about a backport; it's a bit large for stable or post-freeze.

Trac:
Milestone: Tor: 0.3.2.x-final to Tor: 0.3.1.x-final
Keywords: review-group-20 deleted, N/A added
Status: needs_review to merge_ready

Make bootstrapping clients wait before trying an authority

Child items ...

Activity