Snippets Groups Projects

This is an archived project. Repository and other project resources are read-only.

Investigate raising the minimum bandwidth for getting the Fast flag

Mike's performance work has shown that the smaller relays -- for example, the ones that set bandwidthrate and bandwidthburst to 20k -- are never good news to have in your circuit.

Damon McCoy's hotpets 2010 paper showed more details of how you could improve performance by dumping the bottom X% of the relays.

Of course, there's a network effect issue here: clearly you get better performance if you're the only one ignoring the slower relays.

But I think there's something to this even when everybody is doing it. Our load balancing makes a 500KB relay 10x more likely to be used than a 50KB relay, but given a whole lot of users building paths, the 50KB relay will get overloaded more often and show worse characteristics when overloaded than the 500KB relay -- in large part because we're load balancing by circuit rather than by byte.

So I'd like to do a series of performance experiments where the directory authorities take away the Fast flag from everybody whose consensus bandwidth is under X.

Ideally we'd do it while the network is under a variety of load conditions (removing capacity from the network when there's a lot of load seems like it would hurt us more, but then, using overloaded relays when there's a lot of load could hurt us a lot too).

This could even be a research task that we try to give to a research group that wants to work on simulated Tor network performance. But I think that's a separate project.

Along with the performance simulations we need to consider the anonymity implications of reducing the diversity of relays. How much anonymity do we lose if we treat anonymity as entropy? How much do we lose if we consider the location-based anonymity metrics of Feamster or Edman? Ideally we'd figure out some way to compare performance and anonymity so we can decide if we like various points in the tradeoff space. Really, we should be working on this piece already to analyze whether Mike's bwauth algorithm is worth it.

Finally, should we consider keeping them in the network if they have nice exit policies?

Relays that are too slow should be encouraged to become bridges. Even better, we should help people recognize when they ought to start out as a bridge rather than trying to be a relay.

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information

Child items 0

No child items are currently assigned. Use child items to break down this issue into smaller parts.

Activity

Roger Dingledine added component::metrics/analysis loadbalancing owner::arma performance priority::medium resolution::wontfix status::closed type::task labels 14 years ago

added component::metrics/analysis loadbalancing owner::arma performance priority::medium resolution::wontfix status::closed type::task labels
S

Sebastian Hahn @sebastian · 14 years ago

What about using them for directory information still? Seems like once we have microdescriptors out, they could cache these and distribute the new descriptors to clients (thought bootstrapping off them might be a bit painful).
Roger Dingledine @arma · 14 years ago

Author

Trac:
Description: Mike's performance work has shown that the smaller relays -- for example, the ones that set bandwidthrate and bandwidthburst to 20k -- are never good news to have in your circuit.

Damon McCoy's hotpets 2010 paper showed more details of how you could improve performance by dumping the bottom X% of the relays.

Of course, there's a network effect issue here: clearly you get better performance if you're the only one ignoring the slower relays.

But I think there's something to this even when everybody is doing it. Our load balancing makes a 500KB relay 10x more likely to be used than a 50KB relay, but given a whole lot of users building paths, the 50KB relay will get overloaded more often and show worse characteristics when overloaded than the 500KB relay -- in large part because we're load balancing by circuit rather than by byte.

So I'd like to do a series of performance experiments where the directory authorities take away the Running flag from everybody whose consensus bandwidth is under X.

Ideally we'd do it while the network is under a variety of load conditions (removing capacity from the network when there's a lot of load seems like it would hurt us more, but then, using overloaded relays when there's a lot of load could hurt us a lot too).

This could even be a research task that we try to give to a research group that wants to work on simulated Tor network performance. But I think that's a separate project.

Along with the performance simulations we need to consider the anonymity implications of reducing the diversity of relays. How much anonymity do we lose if we treat anonymity as entropy? How much do we lose if we consider the location-based anonymity metrics of Feamster or Edman? Ideally we'd figure out some way to compare performance and anonymity so we can decide if we like various points in the tradeoff space. Really, we should be working on this piece already to analyze whether Mike's bwauth algorithm is worth it.

Finally, should we consider keeping them in the network if they have nice exit policies?

Relays that are too slow should be encouraged to become bridges. Even better, we should help people recognize when they ought to start out as a bridge rather than trying to be a relay.

to

Mike's performance work has shown that the smaller relays -- for example, the ones that set bandwidthrate and bandwidthburst to 20k -- are never good news to have in your circuit.

Damon McCoy's hotpets 2010 paper showed more details of how you could improve performance by dumping the bottom X% of the relays.

Of course, there's a network effect issue here: clearly you get better performance if you're the only one ignoring the slower relays.

But I think there's something to this even when everybody is doing it. Our load balancing makes a 500KB relay 10x more likely to be used than a 50KB relay, but given a whole lot of users building paths, the 50KB relay will get overloaded more often and show worse characteristics when overloaded than the 500KB relay -- in large part because we're load balancing by circuit rather than by byte.

So I'd like to do a series of performance experiments where the directory authorities take away the Running flag from everybody whose consensus bandwidth is under X.

Ideally we'd do it while the network is under a variety of load conditions (removing capacity from the network when there's a lot of load seems like it would hurt us more, but then, using overloaded relays when there's a lot of load could hurt us a lot too).

This could even be a research task that we try to give to a research group that wants to work on simulated Tor network performance. But I think that's a separate project.

Along with the performance simulations we need to consider the anonymity implications of reducing the diversity of relays. How much anonymity do we lose if we treat anonymity as entropy? How much do we lose if we consider the location-based anonymity metrics of Feamster or Edman? Ideally we'd figure out some way to compare performance and anonymity so we can decide if we like various points in the tradeoff space. Really, we should be working on this piece already to analyze whether Mike's bwauth algorithm is worth it.

Finally, should we consider keeping them in the network if they have nice exit policies?

Relays that are too slow should be encouraged to become bridges. Even better, we should help people recognize when they ought to start out as a bridge rather than trying to be a relay.
Points: N/A to N/A
Roger Dingledine @arma · 14 years ago

Author

Replying to arma:

So I'd like to do a series of performance experiments where the directory authorities take away the Running flag from everybody whose consensus bandwidth is under X.

A cleaner way to run the experiment would be to take away their Fast flag. I could imagine putting in a consensus param that authorities look at (off by default), so we can easily modify the values over time and see how the torperf output changes.

We should also ponder if we really mean consensus bandwidth here, or if we mean relay descriptor bandwidth. Currently the Fast flag is assigned based on relay descriptor bandwidth.

Trac:
Milestone: Deliverable-Mar2011 to N/A
Owner: N/A to karsten
Component: Tor Relay to Metrics
Roger Dingledine @arma · 13 years ago

Author

Trac:
Component: Metrics to Analysis
Actualpoints: N/A to N/A
Karsten Loesing @karsten · 13 years ago

This sounds like an analysis task among many others we should work on. Removing the "Project: " part from the summary. If this is really a project, please change ticket type to "project."

Trac:
Summary: Project: Raise the minimum bandwidth for being a relay? to Investigate raising the minimum bandwidth for being a relay
Type: enhancement to task
Roger Dingledine @arma · 13 years ago

Author

Replying to arma:

A cleaner way to run the experiment would be to take away their Fast flag. I could imagine putting in a consensus param that authorities look at (off by default), so we can easily modify the values over time and see how the torperf output changes.

Added as #3946 (moved).
Roger Dingledine @arma · 13 years ago

Author

Modified this task to be more clearly about analyzing taking the Fast flag away.

Trac:
Summary: Investigate raising the minimum bandwidth for being a relay to Investigate raising the minimum bandwidth for getting the Fast flag
Description: Mike's performance work has shown that the smaller relays -- for example, the ones that set bandwidthrate and bandwidthburst to 20k -- are never good news to have in your circuit.

Damon McCoy's hotpets 2010 paper showed more details of how you could improve performance by dumping the bottom X% of the relays.

Of course, there's a network effect issue here: clearly you get better performance if you're the only one ignoring the slower relays.

But I think there's something to this even when everybody is doing it. Our load balancing makes a 500KB relay 10x more likely to be used than a 50KB relay, but given a whole lot of users building paths, the 50KB relay will get overloaded more often and show worse characteristics when overloaded than the 500KB relay -- in large part because we're load balancing by circuit rather than by byte.

So I'd like to do a series of performance experiments where the directory authorities take away the Running flag from everybody whose consensus bandwidth is under X.

Ideally we'd do it while the network is under a variety of load conditions (removing capacity from the network when there's a lot of load seems like it would hurt us more, but then, using overloaded relays when there's a lot of load could hurt us a lot too).

This could even be a research task that we try to give to a research group that wants to work on simulated Tor network performance. But I think that's a separate project.

Along with the performance simulations we need to consider the anonymity implications of reducing the diversity of relays. How much anonymity do we lose if we treat anonymity as entropy? How much do we lose if we consider the location-based anonymity metrics of Feamster or Edman? Ideally we'd figure out some way to compare performance and anonymity so we can decide if we like various points in the tradeoff space. Really, we should be working on this piece already to analyze whether Mike's bwauth algorithm is worth it.

Finally, should we consider keeping them in the network if they have nice exit policies?

Relays that are too slow should be encouraged to become bridges. Even better, we should help people recognize when they ought to start out as a bridge rather than trying to be a relay.

to

Mike's performance work has shown that the smaller relays -- for example, the ones that set bandwidthrate and bandwidthburst to 20k -- are never good news to have in your circuit.

Damon McCoy's hotpets 2010 paper showed more details of how you could improve performance by dumping the bottom X% of the relays.

Of course, there's a network effect issue here: clearly you get better performance if you're the only one ignoring the slower relays.

But I think there's something to this even when everybody is doing it. Our load balancing makes a 500KB relay 10x more likely to be used than a 50KB relay, but given a whole lot of users building paths, the 50KB relay will get overloaded more often and show worse characteristics when overloaded than the 500KB relay -- in large part because we're load balancing by circuit rather than by byte.

So I'd like to do a series of performance experiments where the directory authorities take away the Fast flag from everybody whose consensus bandwidth is under X.

Ideally we'd do it while the network is under a variety of load conditions (removing capacity from the network when there's a lot of load seems like it would hurt us more, but then, using overloaded relays when there's a lot of load could hurt us a lot too).

This could even be a research task that we try to give to a research group that wants to work on simulated Tor network performance. But I think that's a separate project.

Along with the performance simulations we need to consider the anonymity implications of reducing the diversity of relays. How much anonymity do we lose if we treat anonymity as entropy? How much do we lose if we consider the location-based anonymity metrics of Feamster or Edman? Ideally we'd figure out some way to compare performance and anonymity so we can decide if we like various points in the tradeoff space. Really, we should be working on this piece already to analyze whether Mike's bwauth algorithm is worth it.

Finally, should we consider keeping them in the network if they have nice exit policies?

Relays that are too slow should be encouraged to become bridges. Even better, we should help people recognize when they ought to start out as a bridge rather than trying to be a relay.
Keywords: N/A deleted, performance added
Roger Dingledine @arma · 13 years ago

Author

Trac:
Keywords: performance deleted, performance loadbalancing added
Karsten Loesing @karsten · 13 years ago

Why is this ticket assigned to me? Am I supposed to do something here? If so, please tell me and re-assign to me. Assigning to ticket reporter for now.

Trac:
Owner: karsten to arma
Status: new to assigned
Roger Dingledine @arma · 12 years ago

Author

Based on Rob's CSET paper, I am now less optimistic that we can answer this question with simulations: messing with what relays make up a test network is among the least solved pieces of simulating Tor networks.

So I think we should proceed in two directions:

A) We should get gsathya or asn or whoever to confirm that dropping relays with bandwidth less than X doesn't change any of the diversity metrics much (because they're never picked often enough to matter). What's the largest X for which you can reasonably say that?

B) Then we should do an actual performance experiment on the live Tor network, using the FastFlagMinThreshold consensus param added in #3946 (moved), and see what we see on torperf.

Judging the performance experiment on the live Tor network will be especially messy because there are so many variables, but I think despite that it may still be the best route.

Trac:
Cc: N/A to karsten, gsathya, asn, robgjansen
Rob Jansen @robgjansen · 12 years ago

Replying to arma:

Based on Rob's CSET paper, I am now less optimistic that we can answer this question with simulations: messing with what relays make up a test network is among the least solved pieces of simulating Tor networks.

Unless we use rpw's machine to simulate all of the existing relays. Then we don't worry about down sampling problems;)
Karsten Loesing @karsten · 12 years ago

Replying to arma:

A) We should get gsathya or asn or whoever to confirm that dropping relays with bandwidth less than X doesn't change any of the diversity metrics much (because they're never picked often enough to matter).

So, is this ticket about dropping relays from the consensus, or taking away their Fast flag? I can see how we can graph the former, but I'm not sure about the latter.

What's the largest X for which you can reasonably say that?

Sounds like we want #6232 (moved) graphs with the minimum bandwidth to keep relays in the consensus on the X axis. For example, a graph similar to https://trac.torproject.org/projects/tor/attachment/ticket/6232/entropy-august.png would have its blue lines decreasing steadily, because we're taking away relays, but the red lines would stay on the same level and only drop in the last third or so, because we start taking away relays from the slowest ones.

Is that what you have in mind here?

gsathya, asn, is this something you want to look into?
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to karsten:

gsathya, asn, is this something you want to look into?

Yep. How do you want the output of the script to look like?
Roger Dingledine @arma · 12 years ago

Author

Replying to karsten:

So, is this ticket about dropping relays from the consensus, or taking away their Fast flag? I can see how we can graph the former, but I'm not sure about the latter.

Shouldn't matter much. I guess that leads to: do your consensus diversity analysis tools consider the Fast flag? They probably should, since clients do.
Roger Dingledine @arma · 12 years ago

Author

Replying to karsten:

Sounds like we want #6232 (moved) graphs with the minimum bandwidth to keep relays in the consensus on the X axis. For example, a graph similar to https://trac.torproject.org/projects/tor/attachment/ticket/6232/entropy-august.png would have its blue lines decreasing steadily, because we're taking away relays, but the red lines would stay on the same level and only drop in the last third or so, because we start taking away relays from the slowest ones.

Is that what you have in mind here?

Sounds plausible. One nice way of looking at it might be: what's the highest bandwidth cutoff such that the red lines in your graph lose 1% or less? Then the same question for 2%, 3%, 4%, 5%.

Of course, that needs a definition of what it means for two lines to differ. We might try defining the difference as the point x where f1(x) and f2(x) differ the most. If there's noise, we might define it as the 10th percentile of these points x, which would let us say "90% of the time there was at most a 1% difference."
Roger Dingledine @arma · 12 years ago

Author

s/as the point x/at the point x/ and s/as the 10th/at the 10th/
Roger Dingledine @arma · 12 years ago

Author

Or you could bust out real stats and use that, if you prefer. :)
Karsten Loesing @karsten · 12 years ago
Replying to gsathya:

Replying to karsten:

gsathya, asn, is this something you want to look into?

Yep. How do you want the output of the script to look like?

Cool! How about a format similar to #6232 (moved)?

validafter,min_cw,relays,all,max_all,exit,max_exit,guard,max_guard,country,max_country,as,max_as 2012-09-10 01:00:00,1,3040,7.44,11.26,5.79,9.73,6.12,8.99,3.23,6.26,5.44,9.57 2012-09-10 01:00:00,2,[...]

In that output, min_cw is the minimum consensus weight of relays that we keep in the consensus. That value would start at the smallest consensus weight in the network, and we'd calculate entropy values for all relays in the consensus. Then we'd raise the minimum to the second-smallest value in the network, throw out all relays below that value, and compute new entropy values. Continue until we're at the relay with highest consensus weight.

The first column, validafter, is the consensus valid-after time. The third column, relays, contains the number of relays left. The other columns (all, max_all, etc.) are defined similar to #6232 (moved).

Roger, please note that I assumed you want to cut out relays based on consensus weight, not advertised bandwidth. Please correct me if that assumption is wrong. (Writing the analysis script for consensus weights is probably easier, so we could later extend it to advertised bandwidth if required.)
Karsten Loesing @karsten · 12 years ago

Replying to arma:

Replying to karsten:

So, is this ticket about dropping relays from the consensus, or taking away their Fast flag? I can see how we can graph the former, but I'm not sure about the latter.

Shouldn't matter much.

Really?

I guess that leads to: do your consensus diversity analysis tools consider the Fast flag? They probably should, since clients do.

Our tools don't consider the Fast flag. They're only based on relays' consensus weights, their Exit and Guard flags, and the bandwidth-weights line.

Simulating what clients would do, including considering the Fast flag is almost impossible. There are too many variables which relays clients would pick depending on what other relays they already have in their circuit, including family settings and same /16's, that we can't reasonably model. If we want results this precise, we'll have to run simulations with the actual Tor code.
Karsten Loesing @karsten · 12 years ago

Replying to arma:

Sounds plausible. One nice way of looking at it might be: what's the highest bandwidth cutoff such that the red lines in your graph lose 1% or less? Then the same question for 2%, 3%, 4%, 5%.

Sure, that's something that the CDF I suggested above should show. We could put percent values on the y axis and start with current diversity at 100%. Then you could read what x value corresponds to 99% (98%, ...).

Of course, that needs a definition of what it means for two lines to differ. We might try defining the difference as the point x where f1(x) and f2(x) differ the most. If there's noise, we might define it as the 10th percentile of these points x, which would let us say "90% of the time there was at most a 1% difference."

Ah, my idea was to start with a single consensus. Combining multiple consensuses would be step 2. (The data format I suggested above should support the graphs you suggest here.)
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to karsten:

Cool! How about a format similar to #6232 (moved)?

{{{ validafter,min_cw,relays,all,max_all,exit,max_exit,guard,max_guard,country,max_country,as,max_as 2012-09-10 01:00:00,1,3040,7.44,11.26,5.79,9.73,6.12,8.99,3.23,6.26,5.44,9.57 2012-09-10 01:00:00,2,[...] }}}

In that output, min_cw is the minimum consensus weight of relays that we keep in the consensus. That value would start at the smallest consensus weight in the network, and we'd calculate entropy values for all relays in the consensus. Then we'd raise the minimum to the second-smallest value in the network, throw out all relays below that value, and compute new entropy values. Continue until we're at the relay with highest consensus weight.

The first column, validafter, is the consensus valid-after time. The third column, relays, contains the number of relays left. The other columns (all, max_all, etc.) are defined similar to #6232 (moved).

Please review my bug_1854 branch! Thanks!

There seems to be quite a bit of relays with "None" bandwidth, should we consider such relays?(They count when calculating number of relays but don't provide any bandwidth)

Trac:
Cc: karsten, gsathya, asn, robgjansen to karsten, gsathya, asn, robgjansen, aaron.m.johnson@nrl.navy.mil
Status: assigned to needs_review
Roger Dingledine @arma · 12 years ago

Author

Replying to karsten:

Roger, please note that I assumed you want to cut out relays based on consensus weight, not advertised bandwidth. Please correct me if that assumption is wrong. (Writing the analysis script for consensus weights is probably easier, so we could later extend it to advertised bandwidth if required.)

The Fast and Guard flags look at descriptor bandwidth, not consensus bandwidth. So yes, eventually we should do a version of this analysis that looks at descriptor bandwidth.
Roger Dingledine @arma · 12 years ago

Author

Trac:
Cc: karsten, gsathya, asn, robgjansen, aaron.m.johnson@nrl.navy.mil to karsten, gsathya, asn, robgjansen, aaron.m.johnson@nrl.navy.mil, iang
Karsten Loesing @karsten · 12 years ago

Trac:
Karsten Loesing @karsten · 12 years ago

Trac:
Karsten Loesing @karsten · 12 years ago

Replying to gsathya:

Please review my bug_1854 branch! Thanks!

Fixed two bugs, but otherwise looks good. Merged.

Please see the output graphs here and here.

There seems to be quite a bit of relays with "None" bandwidth, should we consider such relays?(They count when calculating number of relays but don't provide any bandwidth)

I think this was caused by your code looking at position-dependent consensus weights, not raw consensus weights. Should be fixed.
Karsten Loesing @karsten · 12 years ago

Replying to arma:

Replying to karsten:

Roger, please note that I assumed you want to cut out relays based on consensus weight, not advertised bandwidth. Please correct me if that assumption is wrong. (Writing the analysis script for consensus weights is probably easier, so we could later extend it to advertised bandwidth if required.)

The Fast and Guard flags look at descriptor bandwidth, not consensus bandwidth. So yes, eventually we should do a version of this analysis that looks at descriptor bandwidth.

A version of this analysis that looks at descriptor bandwidth would sort relays by advertised bandwidth and cut off the slowest relays based on that. In the graphs, the x axis would say "Minimum advertised bandwidth" instead of "Minimum consensus weight", and of course the lines might be slightly different. But everything else would remain the same, including how we calculate guard entropy for the "All guards" sub graph.

We'll mostly have to change router.bandwidth to router.advertised_bw a few times in the code. Shouldn't be too hard.

Sathya, do you want to look into this, or shall I?

Trac:
Status: needs_review to needs_revision
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to karsten:

Sathya, do you want to look into this, or shall I?

Done. I just monkey patched router.bandwidth to router.advertised_bw, I think this is fine for now.

Trac:
Status: needs_revision to needs_review
Roger Dingledine @arma · 12 years ago

Author

I talked to Ian and Aaron a bit more about this analysis. What we'd like to see, for a given consensus, is a graph with bandwidth cutoff on the x axis and L_\inf on the y axis. L_\inf is the largest distance between the two probability distributions -- one being the probability distribution of which relay you'd pick from the pristine consensus, and the other the distribution in the modified consensus. "largest distance" means the element (i.e. relay) with the largest difference.

Then we should consider time: looking at C consensuses over the past year or something, for a given cutoff, we should graph the cdf of these C data points where each data point is the L_\inf of that consensus for that cutoff. The hope is that for some cutoffs, the cdf has very high area-under-the-curve.
Karsten Loesing @karsten · 12 years ago
Replying to arma:

I talked to Ian and Aaron a bit more about this analysis. What we'd like to see, for a given consensus, is a graph with bandwidth cutoff on the x axis and L_\inf on the y axis. L_\inf is the largest distance between the two probability distributions -- one being the probability distribution of which relay you'd pick from the pristine consensus, and the other the distribution in the modified consensus. "largest distance" means the element (i.e. relay) with the largest difference.

Sounds doable. I'd say let's start with plain consensus weight fractions and postpone exit, guard, country, and AS probabilities until we have a better handle on this type of analysis.

A possible output file could look like this:

validafter,min_advbw,relays,linf 2012-09-10 01:00:00,1,3040,0.03553 2012-09-10 01:00:00,2,[...]

Here, validafter is the consensus valid-after time, min_advbw is the minimum advertised bandwidth of relays kept in the modified consensus, relays is the number of those relays, and linf is the largest difference between consensus weight fractions of all relays. The probability in the pristine consensus is always the consensus weight fraction. The probability in the modified consensus is 0 if the relay was excluded, or the consensus weight fraction relative to the new consensus weight sum (which is lower than the original consensus weight sum, because we cut out some relays). We'll want to compare probabilities of all relays, including those that we excluded, because they have non-zero probability in the modified consensus.

Then we should consider time: looking at C consensuses over the past year or something, for a given cutoff, we should graph the cdf of these C data points where each data point is the L_\inf of that consensus for that cutoff. The hope is that for some cutoffs, the cdf has very high area-under-the-curve.

Sure, we should be able to plot those graphs from the file format above.

Sathya, want to look into modifying pyentropy.py for the linf stuff?

Trac:
Status: needs_review to needs_revision
Karsten Loesing @karsten · 12 years ago

Replying to gsathya:

Done. I just monkey patched router.bandwidth to router.advertised_bw, I think this is fine for now.

Hmm. I didn't look very closely, but I think this doesn't work. We'll want to exclude relays based on descriptor bandwidth but calculate the various metrics based on consensus weight. With your patch we're using descriptor bandwidth for everything.

This specific patch is probably moot, now that we're going to change the analysis from entropy values to L_\inf. But we'll want to have a similar patch for the L_\inf stuff, too.
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to karsten:

Here, validafter is the consensus valid-after time, min_advbw is the minimum advertised bandwidth of relays kept in the modified consensus, relays is the number of those relays, and linf is the largest difference between consensus weight fractions of all relays. The probability in the pristine consensus is always the consensus weight fraction. The probability in the modified consensus is 0 if the relay was excluded, or the consensus weight fraction relative to the new consensus weight sum (which is lower than the original consensus weight sum, because we cut out some relays). We'll want to compare probabilities of all relays, including those that we excluded, because they have non-zero probability in the modified consensus.

"The probability in the modified consensus is 0 if the relay was excluded," and "including those that we excluded, because they have non-zero probability in the modified consensus" seem to be contradicting?

Sathya, want to look into modifying pyentropy.py for the linf stuff?

I'm ignoring the probabilities of relays that we excluded because they have 0 probability. Please check my bug_1854_v2 branch Thanks!

Trac:
Status: needs_revision to needs_review
Karsten Loesing @karsten · 12 years ago

Replying to gsathya:

"The probability in the modified consensus is 0 if the relay was excluded," and "including those that we excluded, because they have non-zero probability in the modified consensus" seem to be contradicting?

What I meant is "including those that we excluded, because they have non-zero probability in the pristine consensus". Sorry.

Sathya, want to look into modifying pyentropy.py for the linf stuff?

I'm ignoring the probabilities of relays that we excluded because they have 0 probability. Please check my bug_1854_v2 branch Thanks!

Can you change the above? I just had a quick look, but I'd want to look closer once it's doing the thing that I think arma et al. had in mind.

And can you either remove pyentropy.py, or move your changes to pyentropy.py and remove pylinf.py, so that there's only the code file that we're actually using in the repository?

Thanks!

Trac:
Status: needs_review to needs_revision
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to karsten:

Replying to gsathya:

"The probability in the modified consensus is 0 if the relay was excluded," and "including those that we excluded, because they have non-zero probability in the modified consensus" seem to be contradicting?

What I meant is "including those that we excluded, because they have non-zero probability in the pristine consensus". Sorry.

Sathya, want to look into modifying pyentropy.py for the linf stuff?

I'm ignoring the probabilities of relays that we excluded because they have 0 probability. Please check my bug_1854_v2 branch Thanks!

Can you change the above? I just had a quick look, but I'd want to look closer once it's doing the thing that I think arma et al. had in mind.

Done

And can you either remove pyentropy.py, or move your changes to pyentropy.py and remove pylinf.py, so that there's only the code file that we're actually using in the repository?

Done

Please check my bug_1854_v2 branch. Thanks!

Trac:
Status: needs_revision to needs_review
Karsten Loesing @karsten · 12 years ago
Merged with a minor tweak that otherwise would bite us when evaluating the data.

That's part one of the analysis. Next steps are:

rewrite plot-entropy.R to visualize a single consensus,

run pylinf.py on, say, 1 year of consensuses (and 1y1m of server descriptors to be sure we have all server descriptors referenced from consensuses),

generate graph data for L_\inf for a given min_adv_bw over time, and

visualize previously generated graph data, probably using R.

Want to look into one of these next steps?

Trac:
Status: needs_review to new
Roger Dingledine @arma · 12 years ago

Author

Over the past few days, the minimum bandwidth for the Fast flag looks like it moved from 32KB/s up to 50KB/s and then back down. So maybe there is data to analyze even without explicitly doing the experiment. :)
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Trac:
0001-store-all-server-descs-in-memory.patch
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Some background from Karsten's email - I usually take another approach for combining network statuses and server descriptors in an analysis: parse all server descriptors, extract the relevant parts, keep them in memory stored under their descriptor digest, parse consensuses, use server descriptor parts from memory. This is faster, because we only have to parse a server descriptor once, not every time it's referenced from a consensus, which can be 12 times or more. There's also the option to store intermediate results from parsing server descriptors in a temp file and only read that when re-running the analysis, which typically happens quite often. This approach is also more efficient, because we can parse server descriptors contained in tarballs without extracting them.

I've changed pylinf to be able to read a single tar file or a bunch of tar server descriptor files and store it in memory. I haven't had the chance to test it much, let me know if you find any bugs.

Trac:
Status: new to needs_review
Karsten Loesing @karsten · 12 years ago

Sounds good. Did the code produce meaningful output? I won't be able to review the code today, but I could try tomorrow or Friday. Knowing that the code probably works as expected would be good though. Thanks!
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to karsten:

Sounds good. Did the code produce meaningful output? I won't be able to review the code today, but I could try tomorrow or Friday. Knowing that the code probably works as expected would be good though. Thanks!

Made more changes here - https://github.com/gsathya/metrics-tasks/compare/bug_1854_v2 It's been running on my tiny vps for more than 4 hrs processing 2 months server descriptor and 1 month consensus data. I'm going to run this on lemmonni now.
G

Sathyanarayanan Gunasekaran @gsathya · 12 years ago

Replying to gsathya:

Replying to karsten:

Sounds good. Did the code produce meaningful output? I won't be able to review the code today, but I could try tomorrow or Friday. Knowing that the code probably works as expected would be good though. Thanks!

Made more changes here - https://github.com/gsathya/metrics-tasks/compare/bug_1854_v2 It's been running on my tiny vps for more than 4 hrs processing 2 months server descriptor and 1 month consensus data. I'm going to run this on lemmonni now.

http://codesurfers.net/~gsathya/entropy.csv 1 month consensus data with 3 months of serverdescriptor data
Karsten Loesing @karsten · 12 years ago

Code looks good, merged. I also graphed your 1 month of data. I'm currently running your script in an EC2 instance on 1 year of data. Will let you know once I have results.

Trac:
Status: needs_review to new
Karsten Loesing @karsten · 12 years ago

Trac:
linf-min-adv-bw-2012-11-26.pdf
Karsten Loesing @karsten · 12 years ago

Running this code on an EC2 m1.large instance took 15 minutes to set up (download and uncompress tarballs) and 9 hours to run.

Here are some results. gsathya, does this look plausible? arma and iang, is this what you had expected?

Trac:
Status: new to needs_information
I

iang @iang · 12 years ago

What's going on at the right end of the linf graph there?

Other than that, the plot shows that setting the cutoff to 1 MB/s (using only the top 400 relays or so) would affect the choice of relays in a tiny amount I can't read from the graph. (Can you make that graph log/log?)

What is the linf comparison to? A cutoff of 20 KB/s? No cutoff? There are relays appearing in the upper plot with speeds < 20 KB/s.
I

iang @iang · 12 years ago

Can you also plot total advertised bandwidth with the same x-axis? A rough eyeing of the top figure suggests that the ~2100 relays with bandwidths below 1 MB/s contribute a total of ~500 MB/s, but it seems to me that that should produce more than a negligible change in probability distribution.
Karsten Loesing @karsten · 12 years ago

Trac:
linf-min-adv-bw-2012-11-26-a.pdf
Karsten Loesing @karsten · 12 years ago

Replying to iang:

What's going on at the right end of the linf graph there?

You mean why is it skyrocketing and then dropping to almost zero? I think when there's only 1 relay left, the probability of it being picked grows to 100%, so linf is 100% minus its previous probability of being picked. And when no relay is left, linf goes down to the maximum probability of a relay being picked in the pristine consensus that now cannot be picked anymore.

Other than that, the plot shows that setting the cutoff to 1 MB/s (using only the top 400 relays or so) would affect the choice of relays in a tiny amount I can't read from the graph. (Can you make that graph log/log?)

Attached. (I left the original graph in and added another graph for log/log, because number of relays looks funny on a log scale and there's no easy way to use different scales for both sub graphs.)

What is the linf comparison to? A cutoff of 20 KB/s? No cutoff? There are relays appearing in the upper plot with speeds < 20 KB/s.

No cutoff, that is, comparing to the pristine consensus where any relay could be picked. That's how linf is defined in the script right now. We can change that, but new results would be at least 9+ hours away.
Karsten Loesing @karsten · 12 years ago

Trac:
linf-min-adv-bw-2012-11-26-b.pdf

Please register or sign in to reply

Labels

component::metrics/analysis owner::arma priority::medium resolution::wontfix status::closed type::task loadbalancing performance

Milestone

None

Due date

None

Confidentiality

Confidentiality controls have moved to the issue actions menu () at the top of the page.

10 Participants