Once we know how many relays a bandwidth authority controls (#21992 (moved)), we might want to know how much it can change their figures, or how their measurements are distributed.
I'm not sure if we would use this, but I am writing it down so others can decide if they want it, and which option they want.
Quartile Option
This option doesn't tell us the exact range for each relay, but it's easy to calculate and understand.
For each bandwidth authority, and for the measured bandwidth medians:
I think the range option is very confusing, and we probably just want quartiles.
Trac: Description: Once we know how many relays a bandwidth authority controls (#21992 (moved)), we might want to know how much it can change their figures, or how their measurements are distributed.
I'm not sure if we would use this, but I am writing it down so others can decide if they want it, and which option they want.
Quartile Option
This option doesn't tell us the exact range for each relay, but it's easy to calculate and understand.
This option is more specific, but might be harder to understand and use.
For each bandwidth authority:
the median of the measurements it controls
the median of the next highest measurement
the median of the next lowest measurement
We should probably use medians for the average, because we really don't care about extremes.
Maybe this could look something like:
Bandwidth Authority Variance
lower
median
higher
longclaw
27887
58578
89090
gabelmoo
34585
69344
84323
Or maybe it would be better to express them as percentages of the median.
to
Once we know how many relays a bandwidth authority controls (#21992 (moved)), we might want to know how much it can change their figures, or how their measurements are distributed.
I'm not sure if we would use this, but I am writing it down so others can decide if they want it, and which option they want.
Quartile Option
This option doesn't tell us the exact range for each relay, but it's easy to calculate and understand.
Trac: Summary: Consensus Health: what is the distrubution of a bandwidth authority's measurements? to Consensus Health: what is the distribution of a bandwidth authority's measurements?
It might also be helpful to have quartiles for the overall measurements/medians.
Trac: Description: Once we know how many relays a bandwidth authority controls (#21992 (moved)), we might want to know how much it can change their figures, or how their measurements are distributed.
I'm not sure if we would use this, but I am writing it down so others can decide if they want it, and which option they want.
Quartile Option
This option doesn't tell us the exact range for each relay, but it's easy to calculate and understand.
This option is more specific, but might be harder to understand and use.
For each bandwidth authority:
the median of the measurements it controls (the relays for which it is the median)
the median of the next highest measurement where it is the next highest measurement
the median of the next lowest measurement where it is the next lowest measurement
We should probably use medians for the average, because we really don't care about extremes.
Maybe this could look something like:
Bandwidth Authority Variance
lower
median
higher
longclaw
27887
58578
89090
gabelmoo
34585
69344
84323
Or maybe it would be better to express them as percentages of the median.
to
Once we know how many relays a bandwidth authority controls (#21992 (moved)), we might want to know how much it can change their figures, or how their measurements are distributed.
I'm not sure if we would use this, but I am writing it down so others can decide if they want it, and which option they want.
Quartile Option
This option doesn't tell us the exact range for each relay, but it's easy to calculate and understand.
For each bandwidth authority, and for the measured bandwidth medians:
So I'm not sure exactly ewhat this is asking for, or how to implement it. But as far as bwauth debugging information, what I have wanted/envisioned are the following:
A graph on Atlas, per-relay, that shows each bwauth's votes for that relay over time
Something (maybe a graph) on Atlas, per-relay, that shows which scanner the bwauth made the measurement on
A series of graphs that shows below/median/above bwauth buckets for relays where each graph only applies to one country's relays.
A graph or series of graphs that shows bwauth variance on relays per-country.
(1) is for relay operators to understand why their bandwidth usage may have changed without requiring them to do some ugly consensus/vote grepping. But it requires changes to OnionOO and Atlas.
(2) is also for relay operators, but also bwauth operators, to confirm that sometimes relays slip between scanner cracks. To perform the analysis at all, bwauth's need to apply this patch: https://gitweb.torproject.org/torflow.git/commit/?id=7e4ef735858acf5d2fbb183b6f8418b7fc2b364a To get it into Atlas, we need the data in OnionOO, and to get the data into OnionOO we need it in Collector, (#21378 (moved)) and to get it into Collector we need it exposed by the bwauths (#21377 (moved)).
(3) Should confirm (or reject) the hypothesis that some bwauths make more high or low measurements because their geographic location hurts or helps them measure a disproportionate amount of the network.
(4) Should confirm (or reject) the hypothesis that maybe, just maybe, our bwauths tend to agree with similarly-located bwauths for similarly-located relays.
I'm not sure if any of those is what you said though.
So I'm not sure exactly ewhat this is asking for, or how to implement it. But as far as bwauth debugging information, what I have wanted/envisioned are the following:
A graph on Atlas, per-relay, that shows each bwauth's votes for that relay over time
Something (maybe a graph) on Atlas, per-relay, that shows which scanner the bwauth made the measurement on
A series of graphs that shows below/median/above bwauth buckets for relays where each graph only applies to one country's relays.
A graph or series of graphs that shows bwauth variance on relays per-country.
(1) is for relay operators to understand why their bandwidth usage may have changed without requiring them to do some ugly consensus/vote grepping. But it requires changes to OnionOO and Atlas.
(2) is also for relay operators, but also bwauth operators, to confirm that sometimes relays slip between scanner cracks. To perform the analysis at all, bwauth's need to apply this patch: https://gitweb.torproject.org/torflow.git/commit/?id=7e4ef735858acf5d2fbb183b6f8418b7fc2b364a To get it into Atlas, we need the data in OnionOO, and to get the data into OnionOO we need it in Collector, (#21378 (moved)) and to get it into Collector we need it exposed by the bwauths (#21377 (moved)).
(3) Should confirm (or reject) the hypothesis that some bwauths make more high or low measurements because their geographic location hurts or helps them measure a disproportionate amount of the network.
(4) Should confirm (or reject) the hypothesis that maybe, just maybe, our bwauths tend to agree with similarly-located bwauths for similarly-located relays.
These seem like great ideas.
Which do you think we should do first?
I'm not sure if any of those is what you said though.
These seem like great ideas.
Which do you think we should do first?
Definitely 3 and 4. The first two have a ton of dependencies.
I think adding these graphs will tip the scales of what's feasible in the current design of the consensus-health graphs page though; and we need to rethink how it is organized, and make it interactive. Maybe it should not live on consensus-health at all, and instead on Metrics...