Adding further, SBWS bastet reports 300 relays not reported by SBWS longclaw which reports 400 relays not reported by SBWS bastet. Longclaw and bastet vote are from same consensus:
Looking at maatuska vote in the same 17:00 UTC consensus, SBWS bastet lists 1165 fewer relays. Filtering to the live consensus at the time, deficit is 723 relays comprising 2.5% by overall consensus weight.
We are aware of that, this is a kind of duplicate of #28355 (moved), though the ticket subject might not be clear.
I think the decision was to wait until #28547 (moved) is implemented, to confirm the reasons why sbws is reporting less relays.
It is aggressive to deploy more than one scanner in production before this issue is decisively resolved. Certainly the state of a second non-publishing scanner can be quality assured. . .without pushing results to the authorities.
Hi, we deployed sbws 1.1.0 to bastet and longclaw.
sbws 1.1.0 fixes some bugs, and adds extra error reporting to the bandwidth file.
(It generates file format version 1.4.0.)
Can you please re-do your analysis with the latest votes and bandwidth files from longclaw?
The last two makes the ~1000 relays less.
The scanner takes around 48h (i was wrong with my 24h estimation) to measure unique relays in the consensus, so it takes 4 days for each relay to have at least 2 measures (and not be excluded by few) and we're only considering 5.
There would be less relays excluded if we take only 1 measurement as valid or we consider more days of measurements.
I don't have an explanation yet for the relays near that get measured again in less than 24h.
A different thing for which i'll open a ticket as soon as i confirm, is that the number of the consensuses where the relay has been seen seems to be only 1, which doesn't make sense.
What i've being observing for some months and now is public:
{{{
recent_measurements_excluded_error_count=763
recent_measurements_excluded_few_count=733
recent_measurements_excluded_near_count=259
}}}
The last two makes the ~1000 relays less.
The scanner takes around 48h (i was wrong with my 24h estimation) to measure unique relays in the consensus, so it takes 4 days for each relay to have at least 2 measures (and not be excluded by few) and we're only considering 5.
There would be less relays excluded if we take only 1 measurement as valid or we consider more days of measurements.
Here's a ticket for the few: #30227 (moved).
I will write more on that ticket.
I don't have an explanation yet for the relays near that get measured again in less than 24h.
A different thing for which i'll open a ticket as soon as i confirm, is that the number of the consensuses where the relay has been seen seems to be only 1, which doesn't make sense.
Trac: Summary: Is it acceptable that SBWS consistently reports 6200 relays, 1000 fewer than Torflow's 7200 universe? to sbws reports 6200 relays, 1000 fewer than Torflow's 7200
Trac: Status: needs_information to new Description: Question should be answered. #28547 (moved) closest but this ticket does not directly address the issue.
to
Question should be answered. #28547 (moved) closest but this ticket does not directly address the issue.
Edit: sbws reports running relays, but torflow reports measured relays Summary: sbws reports 6200 relays, 1000 fewer than Torflow's 7200 to sbws reports running relays, but torflow reports measured relays
Torflow approach on this is correct. Relays may go up and down due to maintenance, network outages and other normal occurrences. Authorities detect the state of relays and include them in the consensus as appropriate. Bandwidth scanners should report what they know absent current run state. The negative consequence of omitting down relays is delay in proper rating when they come back online.
Torflow approach on this is correct. Relays may go up and down due to maintenance, network outages and other normal occurrences. Authorities detect the state of relays and include them in the consensus as appropriate. Bandwidth scanners should report what they know absent current run state. The negative consequence of omitting down relays is delay in proper rating when they come back online.
Yes, I agree. We'll fix this issue in #30727 (moved). We need to keep at least 3 torflow instances, until we do this fix.
A different thing for which i'll open a ticket as soon as i confirm, is that the number of the consensuses where the relay has been seen seems to be only 1, which doesn't make sense.
What's the ticket number?
i don't know what i was thinking on when i wrote this, but didn't open any ticket. Relays seen in only 1 consensus make sense, like relays that just joined the network or are up again after sbws lost their previous data.