Opened 2 years ago

Closed 23 months ago

#24716 closed task (wontfix)

Try cranking up cbttestfreq consensus param, to see if it helps the current overload

Reported by: arma Owned by:
Priority: Medium Milestone: Tor: 0.3.3.x-final
Component: Core Tor/DirAuth Version:
Severity: Normal Keywords:
Cc: mikeperry Actual Points:
Parent ID: Points:
Reviewer: Sponsor:

Description

In Tor 0.3.1.1-alpha, commit d5a151a, we switched:

-#define CBT_DEFAULT_TEST_FREQUENCY 60
+#define CBT_DEFAULT_TEST_FREQUENCY 10

And on May 20 2017 the dir auths set the cbttestfreq consensus param to 10 as well.

Right now the network is overloaded with create cells, from the millions of new clients that showed up in the past weeks.

Hypothesis 1: most of these clients are in learning mode much of the time, so 5 million clients * 10 seconds = 500k new create requests per second launched at the network, which contributes to the overload.

Hypothesis 2: some of these clients have learned quite low timeouts, causing them to generate many circuits which they then almost immediately cancel, but not enough of their circuits fail that they back away from their learned value.

Hypothesis 3: the clients are stuck in a sad loop where they learn a low cbt value, generate circuits for a while that mostly time out, eventually they give up on their cbt value, then they generate a circuit every 10s until they re-learn a low cbt value, and they cycle.

The experiment here (set cbttestfreq to 600 seconds temporarily) should help us test these hypotheses. For 1, we will immediately reduce the load of new circuits. For 2, this will help more slowly, because we'll have to wait for each client to hit a situation where 90%+ of its circuit attempts are being timed out, but in theory clients will slowly shift from having a too-aggressive cbt, back into learning mode. And for 3, we'll push most clients to the "learning, but very slowly" phase of their sad loop.

We can use the notice-level heartbeat messages in relay logs, to discover whether the total number of create cells goes down dramatically. If it does, win, we confirmed one or more of these hypotheses, and we can make a plan from there. If it doesn't, also win, we know we need to look elsewhere.

Child Tickets

TicketStatusOwnerSummaryComponent
#24770closedChange the circuit build time defaults to reduce network loadCore Tor/Tor

Change History (17)

comment:1 Changed 2 years ago by arma

I also just added cbtmintimeout=5000 to moria1's consensus params, on the theory that if clients really were learning super low timeouts and they weren't unlearning them, then we'll make them more likely to be willing to use the circuits they make.

This choice is not uniformly a win, since with this raised min, clients that had computed a super-fast cbt will now be less likely to abandon it (and move back to the slow learning phase, during which they use a timeout of 60s). But it could help us in the hypothesis 2 case, where due to bugs or luck clients are sticking to their super-fast cbt despite 80% circuit failure.

comment:2 Changed 2 years ago by arma

Here is a "before" picture from my freeBogatov relay:

Dec 22 00:38:00.372 [notice] Circuit handshake stats since last time: 495489/495557 TAP, 37413745/37414692 NTor.
Dec 22 06:38:00.401 [notice] Circuit handshake stats since last time: 393195/393196 TAP, 32050097/32050113 NTor.
Dec 22 12:38:00.404 [notice] Circuit handshake stats since last time: 445394/445854 TAP, 33484384/33565916 NTor.

https://atlas.torproject.org/#details/7B35DB92BA72BA0BBFD51B35B11A4967757E67B4

30-something million ntors per 6 hours translates to around 1500 create cells per second (or about 770Kbytes/s incoming traffic solely for creates)

comment:3 Changed 2 years ago by arma

Dec 22 18:38:00.404 [notice] Circuit handshake stats since last time: 388649/388650 TAP, 22170938/22170941 NTor.
Dec 23 00:38:00.404 [notice] Circuit handshake stats since last time: 381572/381572 TAP, 17693748/17693749 NTor.

Looking promising (but of course, my relay lost some consensus weight in the mean time because it couldn't handle yesterday's 4000 create cells per second load, so it's hard to isolate the reasons for the drop in create requests).

comment:4 Changed 2 years ago by arma

Dec 23 06:38:00.421 [notice] Circuit handshake stats since last time: 353664/353664 TAP, 20899286/20899290 NTor.
Dec 23 12:38:00.407 [notice] Circuit handshake stats since last time: 453990/453990 TAP, 22121243/22121289 NTor.
Dec 23 18:38:00.434 [notice] Circuit handshake stats since last time: 407439/407439 TAP, 31956327/31956370 NTor.
Dec 24 00:38:00.427 [notice] Circuit handshake stats since last time: 481036/481036 TAP, 31568498/31568526 NTor.
Dec 24 06:38:00.449 [notice] Circuit handshake stats since last time: 490006/490007 TAP, 27855214/27855241 NTor.
Dec 24 12:38:00.428 [notice] Circuit handshake stats since last time: 427430/427430 TAP, 23470101/23470101 NTor.
Dec 24 18:38:00.454 [notice] Circuit handshake stats since last time: 322234/322234 TAP, 27458616/27458619 NTor.
Dec 25 00:38:00.429 [notice] Circuit handshake stats since last time: 316974/316974 TAP, 23668352/23668352 NTor.
Dec 25 06:38:00.430 [notice] Circuit handshake stats since last time: 382918/382918 TAP, 22693583/22693584 NTor.
Dec 25 12:38:00.430 [notice] Circuit handshake stats since last time: 448714/448714 TAP, 21401231/21401231 NTor.
Dec 25 18:38:00.430 [notice] Circuit handshake stats since last time: 380642/380642 TAP, 27475076/27475076 NTor.
Dec 26 00:38:00.431 [notice] Circuit handshake stats since last time: 317830/317830 TAP, 21551889/21551892 NTor.
Dec 26 06:38:00.447 [notice] Circuit handshake stats since last time: 370076/370076 TAP, 22387253/22387275 NTor.
Dec 26 12:38:00.452 [notice] Circuit handshake stats since last time: 440029/440030 TAP, 24994906/24994919 NTor.

These numbers continue to be improvements over the previous ntor counts.

comment:5 Changed 2 years ago by arma

Hm. It's hard to tell from my tiny sample size, but it's less clear that these changes were a win. I think that might argue that the new circuit requests are actual circuits (for actual use), not preemptive circuit requests to help adaptively compute the timeout. That is, maybe the experiment is a success, and we learned that changing these consensus params didn't help as much as we hoped.

comment:6 Changed 2 years ago by arma

Dec 26 18:38:00.458 [notice] Circuit handshake stats since last time: 511249/511346 TAP, 51286261/51287331 NTor.
Dec 27 00:38:00.479 [notice] Circuit handshake stats since last time: 673506/678183 TAP, 55499364/55507489 NTor.
Dec 27 06:38:00.526 [notice] Circuit handshake stats since last time: 509775/509775 TAP, 39767730/39767788 NTor.
Dec 27 12:38:00.493 [notice] Circuit handshake stats since last time: 393278/393278 TAP, 29106206/29106210 NTor.
Dec 27 18:38:00.499 [notice] Circuit handshake stats since last time: 419196/419196 TAP, 28237407/28237409 NTor.
Dec 28 00:38:00.493 [notice] Circuit handshake stats since last time: 490560/490561 TAP, 26759514/26759520 NTor.
Dec 28 06:38:00.502 [notice] Circuit handshake stats since last time: 673176/673176 TAP, 34351081/34351107 NTor.
Dec 28 12:38:00.501 [notice] Circuit handshake stats since last time: 676032/676032 TAP, 29866259/29866287 NTor.
Dec 28 18:38:00.507 [notice] Circuit handshake stats since last time: 486744/486745 TAP, 35015849/35015854 NTor.
Dec 29 00:38:00.503 [notice] Circuit handshake stats since last time: 431075/431075 TAP, 33781037/33781042 NTor.
Dec 29 06:38:00.493 [notice] Circuit handshake stats since last time: 516184/516186 TAP, 28896392/28896474 NTor.
Dec 29 12:38:00.536 [notice] Circuit handshake stats since last time: 490503/490503 TAP, 31057725/31057731 NTor.
Dec 29 18:38:00.508 [notice] Circuit handshake stats since last time: 469543/469543 TAP, 33049227/33049247 NTor.
Dec 30 00:38:00.527 [notice] Circuit handshake stats since last time: 365496/365496 TAP, 34792350/34792383 NTor.
Dec 30 06:38:00.555 [notice] Circuit handshake stats since last time: 462773/462773 TAP, 31974335/31974346 NTor.
Dec 30 12:38:00.524 [notice] Circuit handshake stats since last time: 472061/472061 TAP, 25819195/25819198 NTor.

comment:7 Changed 2 years ago by teor

On 4/5 of my OVH relays in France (all guards), this appears to have halved the number of NTor circuits as of Dec 22/23 local time. The other one (radia0) was underloaded before the change, and maintained a similar load.

Note that these relays likely have very good connectivity to these new Tor clients, many of which are at OVH somewhere in France.

https://atlas.torproject.org/#search/radia

$ grep "Circuit handshake" /var/log/tor-instances/*/log | grep "Dec 2" | cut -d/ -f5- | cut -d" " -f1-3,11-
radia0/log:Dec 20 03:50:24.000 106059/107947 TAP, 4271236/4296375 NTor.
radia0/log:Dec 20 09:50:22.000 895688/947019 TAP, 3263236/3302609 NTor.
radia0/log:Dec 20 15:50:22.000 65908/65911 TAP, 3864864/3880543 NTor.
radia0/log:Dec 20 21:50:22.000 112842/122434 TAP, 3912074/4018827 NTor.
radia0/log:Dec 21 05:15:51.000 254465/269080 TAP, 3413052/3504274 NTor.
radia0/log:Dec 21 11:15:46.000 174919/187939 TAP, 2891613/2933743 NTor.
radia0/log:Dec 21 17:15:46.000 289248/305166 TAP, 4154550/4210506 NTor.
radia0/log:Dec 21 23:15:46.000 402087/467902 TAP, 4360287/4579211 NTor.
radia0/log:Dec 22 05:15:46.000 399203/420485 TAP, 4350336/4552848 NTor.
radia0/log:Dec 22 11:15:46.000 300043/304584 TAP, 5556295/5583442 NTor.
radia0/log:Dec 22 17:15:47.000 347702/360549 TAP, 4981746/4988214 NTor.
radia0/log:Dec 22 23:15:46.000 822845/866806 TAP, 3722148/3767271 NTor.
radia0/log:Dec 23 05:15:46.000 522436/547393 TAP, 2808131/2895566 NTor.
radia0/log:Dec 23 11:15:46.000 37783/37783 TAP, 2651670/2651671 NTor.
radia0/log:Dec 23 17:15:46.000 71538/71538 TAP, 3240451/3240457 NTor.
radia0/log:Dec 23 23:15:46.000 853435/853609 TAP, 3222007/3222209 NTor.
radia0/log:Dec 24 05:15:47.000 518963/541799 TAP, 3206635/3225575 NTor.
radia0/log:Dec 24 11:15:46.000 391894/392259 TAP, 3585873/3586082 NTor.
radia0/log:Dec 24 17:15:46.000 187107/187512 TAP, 3280086/3280249 NTor.
radia0/log:Dec 24 23:15:46.000 226741/227733 TAP, 3397785/3403033 NTor.
radia0/log:Dec 25 05:15:46.000 47519/47614 TAP, 2363708/2393817 NTor.
radia0/log:Dec 25 11:15:46.000 358658/359818 TAP, 2585496/2585503 NTor.
radia0/log:Dec 25 17:15:47.000 380189/380189 TAP, 3201905/3201911 NTor.
radia0/log:Dec 25 23:15:47.000 567819/569850 TAP, 3828259/3858067 NTor.
radia0/log:Dec 26 05:15:47.000 1811814/1917921 TAP, 3309816/3336049 NTor.
radia0/log:Dec 26 11:15:47.000 370948/370948 TAP, 3476749/3476752 NTor.
radia0/log:Dec 26 17:15:47.000 1423665/1474705 TAP, 3714381/3768300 NTor.
radia0/log:Dec 26 23:15:47.000 96816/97383 TAP, 4170982/4249271 NTor.
radia0/log:Dec 27 05:15:47.000 475032/481846 TAP, 3658655/3765814 NTor.
radia0/log:Dec 27 11:15:47.000 581453/584563 TAP, 4307006/4327653 NTor.
radia0/log:Dec 27 17:15:47.000 771062/818781 TAP, 4880358/4937947 NTor.
radia0/log:Dec 27 23:15:47.000 803408/832857 TAP, 4540603/4644602 NTor.
radia0/log:Dec 28 05:32:27.000 329760/342275 TAP, 2985083/3037971 NTor.
radia0/log:Dec 28 11:32:14.000 640057/647703 TAP, 3569188/3587123 NTor.
radia0/log:Dec 28 17:32:14.000 1074693/1082716 TAP, 4383746/4396249 NTor.
radia0/log:Dec 28 23:32:14.000 293635/294129 TAP, 4241604/4253026 NTor.
radia0/log:Dec 29 05:32:14.000 545425/546855 TAP, 4204893/4247948 NTor.
radia0/log:Dec 29 11:32:14.000 600737/600775 TAP, 4264769/4268766 NTor.
radia0/log:Dec 29 17:32:14.000 684621/695050 TAP, 4299874/4310820 NTor.
radia0/log:Dec 29 23:32:14.000 400624/412002 TAP, 5104122/5234707 NTor.
radia2/log:Dec 20 03:43:20.000 660033/660918 TAP, 9557927/9574297 NTor.
radia2/log:Dec 20 09:43:19.000 615512/621624 TAP, 7343769/7383568 NTor.
radia2/log:Dec 20 15:43:19.000 266080/266083 TAP, 8691142/8691223 NTor.
radia2/log:Dec 20 21:43:19.000 290020/290331 TAP, 9034322/9039971 NTor.
radia2/log:Dec 21 04:07:14.000 117711/117961 TAP, 8039416/8042975 NTor.
radia2/log:Dec 21 10:07:02.000 639445/645115 TAP, 7181337/7192024 NTor.
radia2/log:Dec 21 16:07:02.000 617447/630046 TAP, 8926146/8926598 NTor.
radia2/log:Dec 21 22:07:02.000 160608/160776 TAP, 8810094/8825140 NTor.
radia2/log:Dec 22 04:07:02.000 895347/910078 TAP, 8676374/8726419 NTor.
radia2/log:Dec 22 10:07:02.000 428128/428128 TAP, 9402189/9402190 NTor.
radia2/log:Dec 22 16:07:02.000 906339/906423 TAP, 9347120/9350573 NTor.
radia2/log:Dec 22 22:07:02.000 1107819/1108533 TAP, 7763052/7769410 NTor.
radia2/log:Dec 23 04:07:02.000 1476996/1523770 TAP, 6540293/6587865 NTor.
radia2/log:Dec 23 10:07:02.000 206684/206692 TAP, 6397089/6397944 NTor.
radia2/log:Dec 23 16:07:02.000 411155/411155 TAP, 5697543/5697543 NTor.
radia2/log:Dec 23 22:07:02.000 260237/260237 TAP, 5657338/5657338 NTor.
radia2/log:Dec 24 04:07:02.000 405525/405774 TAP, 5025989/5032176 NTor.
radia2/log:Dec 24 10:07:02.000 762578/762979 TAP, 6846202/6847462 NTor.
radia2/log:Dec 24 16:07:02.000 1227590/1227590 TAP, 6226857/6226857 NTor.
radia2/log:Dec 24 22:07:02.000 1152390/1152390 TAP, 5861620/5861620 NTor.
radia2/log:Dec 25 04:07:02.000 680941/685222 TAP, 4675610/4695253 NTor.
radia2/log:Dec 25 10:07:02.000 876958/876958 TAP, 4360172/4360172 NTor.
radia2/log:Dec 25 16:07:02.000 690567/690567 TAP, 4465554/4465554 NTor.
radia2/log:Dec 25 22:07:02.000 588578/590005 TAP, 4917537/4918307 NTor.
radia2/log:Dec 26 04:07:02.000 430968/432409 TAP, 5102578/5126839 NTor.
radia2/log:Dec 26 10:07:02.000 412771/412771 TAP, 5021488/5021488 NTor.
radia2/log:Dec 26 16:07:02.000 702828/702828 TAP, 9135196/9135196 NTor.
radia2/log:Dec 26 22:07:02.000 5237422/5243242 TAP, 9006314/9008648 NTor.
radia2/log:Dec 27 04:07:02.000 1400944/1402677 TAP, 8519342/8580996 NTor.
radia2/log:Dec 27 10:07:02.000 965480/965495 TAP, 10148290/10148915 NTor.
radia2/log:Dec 27 16:07:02.000 1200826/1200849 TAP, 9237722/9237722 NTor.
radia2/log:Dec 27 22:07:02.000 925678/926293 TAP, 8625067/8629924 NTor.
radia2/log:Dec 28 04:07:02.000 1068048/1121321 TAP, 8510027/8540205 NTor.
radia2/log:Dec 28 10:07:02.000 1016997/1020837 TAP, 7682840/7734723 NTor.
radia2/log:Dec 28 16:07:02.000 1411613/1411613 TAP, 7328790/7328792 NTor.
radia2/log:Dec 28 22:07:02.000 710325/710406 TAP, 6824685/6825359 NTor.
radia2/log:Dec 29 04:07:02.000 1236076/1296514 TAP, 7572770/7588432 NTor.
radia2/log:Dec 29 10:07:02.000 2087193/2087193 TAP, 8105463/8105463 NTor.
radia2/log:Dec 29 16:07:02.000 2420170/2420359 TAP, 12220365/12220367 NTor.
radia2/log:Dec 29 22:07:02.000 3373306/3379296 TAP, 13541421/13545804 NTor.
radia3/log:Dec 20 03:41:21.000 1683221/1690231 TAP, 5334317/5342813 NTor.
radia3/log:Dec 20 09:41:19.000 1683564/1736903 TAP, 7738350/7790037 NTor.
radia3/log:Dec 20 15:41:20.000 567387/573658 TAP, 9579935/9582605 NTor.
radia3/log:Dec 20 21:41:19.000 150709/151039 TAP, 9943191/10047560 NTor.
radia3/log:Dec 21 04:49:04.000 537754/563312 TAP, 8785610/8835759 NTor.
radia3/log:Dec 21 10:48:59.000 1079922/1103831 TAP, 7603757/7650126 NTor.
radia3/log:Dec 21 16:48:59.000 648571/668169 TAP, 17365706/17390223 NTor.
radia3/log:Dec 21 22:48:59.000 1543537/1640100 TAP, 13162461/13325330 NTor.
radia3/log:Dec 22 04:48:59.000 1044331/1130595 TAP, 10466355/10599278 NTor.
radia3/log:Dec 22 10:48:59.000 857781/857793 TAP, 11470969/11471029 NTor.
radia3/log:Dec 22 16:48:59.000 865998/866939 TAP, 11795200/11797239 NTor.
radia3/log:Dec 22 22:48:59.000 425098/425363 TAP, 8865905/8892455 NTor.
radia3/log:Dec 23 04:48:59.000 1279450/1291342 TAP, 6971472/7011294 NTor.
radia3/log:Dec 23 10:48:59.000 2000971/2000981 TAP, 6345228/6345928 NTor.
radia3/log:Dec 23 16:48:59.000 732162/732162 TAP, 6614799/6614799 NTor.
radia3/log:Dec 23 22:48:59.000 1805004/1805123 TAP, 6803505/6803571 NTor.
radia3/log:Dec 24 04:48:59.000 946949/947313 TAP, 6628231/6652590 NTor.
radia3/log:Dec 24 10:48:59.000 1102423/1102739 TAP, 7459116/7460967 NTor.
radia3/log:Dec 24 16:48:59.000 978599/979023 TAP, 6793693/6795690 NTor.
radia3/log:Dec 24 22:48:59.000 1384820/1435229 TAP, 6719261/6736078 NTor.
radia3/log:Dec 25 04:48:59.000 747395/758909 TAP, 5335033/5360782 NTor.
radia3/log:Dec 25 10:48:59.000 907700/907701 TAP, 4926214/4926217 NTor.
radia3/log:Dec 25 16:48:59.000 1744327/1744328 TAP, 6591431/6591469 NTor.
radia3/log:Dec 25 22:48:59.000 1681776/1691168 TAP, 6914224/6934125 NTor.
radia3/log:Dec 26 04:48:59.000 670844/695767 TAP, 6498931/6522652 NTor.
radia3/log:Dec 26 10:48:59.000 308417/308417 TAP, 6826446/6826446 NTor.
radia3/log:Dec 26 16:48:59.000 950003/963824 TAP, 8076661/8087140 NTor.
radia3/log:Dec 26 22:48:59.000 896842/905000 TAP, 8489901/8506339 NTor.
radia3/log:Dec 27 04:48:59.000 817936/843500 TAP, 8646337/8683423 NTor.
radia3/log:Dec 27 10:48:59.000 688773/696081 TAP, 9692435/9714631 NTor.
radia3/log:Dec 27 16:48:59.000 484858/490207 TAP, 8748992/8788716 NTor.
radia3/log:Dec 27 22:48:59.000 1326916/1347857 TAP, 5416307/5458339 NTor.
radia3/log:Dec 28 05:14:10.000 290733/292564 TAP, 4012069/4029032 NTor.
radia3/log:Dec 28 11:13:30.000 510247/514378 TAP, 4438692/4449727 NTor.
radia3/log:Dec 28 17:13:30.000 1421059/1421318 TAP, 5883695/5883813 NTor.
radia3/log:Dec 28 23:13:30.000 560099/560171 TAP, 5510310/5511832 NTor.
radia3/log:Dec 29 05:13:30.000 893870/897976 TAP, 5078319/5112083 NTor.
radia3/log:Dec 29 11:13:30.000 556389/556389 TAP, 4930086/4930089 NTor.
radia3/log:Dec 29 17:13:30.000 708288/710572 TAP, 5335291/5342281 NTor.
radia3/log:Dec 29 23:13:30.000 815907/834181 TAP, 6307191/6336015 NTor.
radia4/log:Dec 20 03:38:37.000 766863/777324 TAP, 11248102/11257141 NTor.
radia4/log:Dec 20 09:38:36.000 287820/288167 TAP, 8627977/8665574 NTor.
radia4/log:Dec 20 15:38:36.000 11425969/15712062 TAP, 10139774/10141172 NTor.
radia4/log:Dec 20 21:38:36.000 34120430/46465434 TAP, 8349899/8349954 NTor.
radia4/log:Dec 21 07:02:40.000 519222/519233 TAP, 7940428/7940877 NTor.
radia4/log:Dec 21 13:02:21.000 201719/201760 TAP, 6444553/6456991 NTor.
radia4/log:Dec 21 19:02:21.000 406162/489604 TAP, 9527839/9544297 NTor.
radia4/log:Dec 22 01:02:21.000 5116651/5342413 TAP, 10409141/10465488 NTor.
radia4/log:Dec 22 07:02:21.000 3596594/3640301 TAP, 9744049/9822217 NTor.
radia4/log:Dec 22 13:02:21.000 448180/448223 TAP, 10819817/10823676 NTor.
radia4/log:Dec 22 19:02:21.000 203723/204142 TAP, 13173213/13231840 NTor.
radia4/log:Dec 23 01:02:21.000 1037666/1085564 TAP, 11493356/11569557 NTor.
radia4/log:Dec 23 07:02:21.000 1734324/1782355 TAP, 10166732/10204764 NTor.
radia4/log:Dec 23 13:02:22.000 916657/916905 TAP, 10431145/10431350 NTor.
radia4/log:Dec 23 19:02:22.000 946250/946254 TAP, 9855674/9855679 NTor.
radia4/log:Dec 24 01:02:22.000 410858/410876 TAP, 9653145/9654128 NTor.
radia4/log:Dec 24 07:02:22.000 1557375/1565850 TAP, 8250949/8297403 NTor.
radia4/log:Dec 24 13:02:22.000 813499/815117 TAP, 7991608/7997885 NTor.
radia4/log:Dec 24 19:02:22.000 1315854/1323867 TAP, 6443006/6450238 NTor.
radia4/log:Dec 25 01:02:22.000 635393/673727 TAP, 4658596/4730211 NTor.
radia4/log:Dec 25 07:02:22.000 816487/846851 TAP, 4448908/4490480 NTor.
radia4/log:Dec 25 13:02:22.000 833626/833631 TAP, 6270905/6270936 NTor.
radia4/log:Dec 25 19:02:22.000 651185/651187 TAP, 6535715/6535751 NTor.
radia4/log:Dec 26 01:02:22.000 1059702/1095587 TAP, 5685195/5721423 NTor.
radia4/log:Dec 26 07:02:22.000 830573/832668 TAP, 4039191/4082144 NTor.
radia4/log:Dec 26 13:02:22.000 530167/531542 TAP, 6018797/6020610 NTor.
radia4/log:Dec 26 19:02:22.000 657411/669503 TAP, 6327927/6337680 NTor.
radia4/log:Dec 27 01:02:22.000 998128/1021016 TAP, 7846159/7875856 NTor.
radia4/log:Dec 27 07:02:22.000 689437/700717 TAP, 7708472/7760894 NTor.
radia4/log:Dec 27 13:02:22.000 319868/342063 TAP, 8516945/8556501 NTor.
radia4/log:Dec 27 19:02:22.000 1117930/1156242 TAP, 8489667/8572742 NTor.
radia4/log:Dec 28 01:02:22.000 946359/985999 TAP, 6851886/6938813 NTor.
radia4/log:Dec 28 07:02:22.000 75979/76853 TAP, 2225322/2252711 NTor.
radia4/log:Dec 28 13:02:22.000 1406352/1412449 TAP, 6753360/6759483 NTor.
radia4/log:Dec 28 19:02:22.000 1086648/1086661 TAP, 6547486/6547597 NTor.
radia4/log:Dec 29 01:02:22.000 169897/169930 TAP, 4013327/4015880 NTor.
radia4/log:Dec 29 07:02:22.000 703415/704352 TAP, 3848527/3895842 NTor.
radia4/log:Dec 29 13:02:22.000 1053456/1064367 TAP, 4011138/4013006 NTor.
radia4/log:Dec 29 19:02:22.000 571738/575304 TAP, 4259906/4262560 NTor.
radia5/log:Dec 20 04:42:17.000 625639/680168 TAP, 12499606/12601764 NTor.
radia5/log:Dec 20 10:42:09.000 639229/647316 TAP, 7913425/7935506 NTor.
radia5/log:Dec 20 16:42:09.000 485485/489836 TAP, 11272929/11275418 NTor.
radia5/log:Dec 20 22:42:09.000 150254/155766 TAP, 5329847/5405635 NTor.
radia5/log:Dec 21 04:47:40.000 135117/146336 TAP, 3640955/3690187 NTor.
radia5/log:Dec 21 10:47:00.000 409902/457502 TAP, 3449731/3488725 NTor.
radia5/log:Dec 21 16:47:00.000 55492/55532 TAP, 4525322/4554568 NTor.
radia5/log:Dec 21 22:47:00.000 58213/58448 TAP, 4781562/4866989 NTor.
radia5/log:Dec 22 04:47:00.000 983581/1024397 TAP, 5581354/5705240 NTor.
radia5/log:Dec 22 10:47:00.000 1684410/1685762 TAP, 6573154/6573832 NTor.
radia5/log:Dec 22 16:47:00.000 283610/321631 TAP, 6617820/6706928 NTor.
radia5/log:Dec 22 22:47:00.000 376292/387936 TAP, 5305446/5348443 NTor.
radia5/log:Dec 23 04:47:00.000 240197/250249 TAP, 4353838/4409459 NTor.
radia5/log:Dec 23 10:47:00.000 297299/304602 TAP, 3552063/3552285 NTor.
radia5/log:Dec 23 16:47:00.000 132848/132848 TAP, 4449669/4449677 NTor.
radia5/log:Dec 23 22:47:00.000 351543/353151 TAP, 5498848/5500878 NTor.
radia5/log:Dec 24 04:47:00.000 343480/344337 TAP, 4123111/4144623 NTor.
radia5/log:Dec 24 10:47:00.000 869304/869322 TAP, 4144639/4144941 NTor.
radia5/log:Dec 24 16:47:00.000 712096/714048 TAP, 3779441/3779537 NTor.
radia5/log:Dec 24 22:47:00.000 794430/797457 TAP, 3733587/3738892 NTor.
radia5/log:Dec 25 04:47:00.000 558567/566102 TAP, 3048521/3069759 NTor.
radia5/log:Dec 25 10:47:00.000 191905/191905 TAP, 2728532/2728532 NTor.
radia5/log:Dec 25 16:47:00.000 1374766/1376815 TAP, 3678290/3678303 NTor.
radia5/log:Dec 25 22:47:00.000 260028/263236 TAP, 4418891/4445845 NTor.
radia5/log:Dec 26 04:47:00.000 449802/473441 TAP, 3848575/3908236 NTor.
radia5/log:Dec 26 10:47:00.000 240544/240545 TAP, 3423936/3423939 NTor.
radia5/log:Dec 26 16:47:00.000 1246427/1275247 TAP, 4670074/4677581 NTor.
radia5/log:Dec 26 22:47:00.000 793753/801380 TAP, 5168537/5176938 NTor.
radia5/log:Dec 27 04:47:00.000 197994/199661 TAP, 4605005/4638393 NTor.
radia5/log:Dec 27 10:47:00.000 371832/374686 TAP, 4659866/4666084 NTor.
radia5/log:Dec 27 16:47:00.000 530631/554382 TAP, 6795980/6820455 NTor.
radia5/log:Dec 27 22:47:00.000 530296/540786 TAP, 6368716/6406793 NTor.
radia5/log:Dec 28 05:21:41.000 152554/154468 TAP, 4700944/4747448 NTor.
radia5/log:Dec 28 11:20:53.000 256750/258048 TAP, 12697675/12704008 NTor.
radia5/log:Dec 28 17:20:53.000 410466/410466 TAP, 17809179/17809179 NTor.
radia5/log:Dec 28 23:20:53.000 439896/439896 TAP, 28112581/28112581 NTor.
radia5/log:Dec 29 05:20:53.000 262653/262717 TAP, 19072085/19139791 NTor.
radia5/log:Dec 29 11:20:53.000 418189/418190 TAP, 21901636/21901636 NTor.
radia5/log:Dec 29 17:20:53.000 405224/405224 TAP, 23656096/23656098 NTor.
radia5/log:Dec 29 23:20:53.000 409176/409268 TAP, 26345574/26349328 NTor.

comment:8 Changed 2 years ago by teor

On 3/3 of my OVH relays in Montreal (liskov0 is an exit, the others are guards), this appears to have reduced the number of NTor circuits by by 30% as of Dec 22/23 local time (UTC-5). One of the guards (liskov1) had a load spike around the time of the change.

Note that these relays have decent connectivity but more latency to these new Tor clients, many of which are at OVH somewhere in France.

https://atlas.torproject.org/#search/liskov

$ grep "Circuit handshake" /var/log/tor-instances/*/log | grep "Dec 2" | cut -d/ -f5- | cut -d" " -f1-3,11-
liskov0/log:Dec 20 03:06:15.000 16294/16294 TAP, 403314/403314 NTor.
liskov0/log:Dec 20 09:06:15.000 20397/20397 TAP, 454041/454041 NTor.
liskov0/log:Dec 20 15:06:15.000 22958/22958 TAP, 519545/519545 NTor.
liskov0/log:Dec 20 21:06:15.000 18563/18563 TAP, 476568/476568 NTor.
liskov0/log:Dec 21 03:06:15.000 13820/13820 TAP, 370375/370375 NTor.
liskov0/log:Dec 21 09:06:15.000 17606/17606 TAP, 390082/390082 NTor.
liskov0/log:Dec 21 15:06:15.000 19405/19405 TAP, 423351/423351 NTor.
liskov0/log:Dec 21 21:06:15.000 15339/15339 TAP, 381211/381211 NTor.
liskov0/log:Dec 22 03:06:15.000 14661/14661 TAP, 401180/401180 NTor.
liskov0/log:Dec 22 09:06:15.000 20069/20069 TAP, 490016/490016 NTor.
liskov0/log:Dec 22 15:06:15.000 20923/20923 TAP, 500426/500426 NTor.
liskov0/log:Dec 22 21:06:15.000 13109/13109 TAP, 328728/328728 NTor.
liskov0/log:Dec 23 03:06:15.000 8636/8636 TAP, 258989/258989 NTor.
liskov0/log:Dec 23 09:06:15.000 12643/12643 TAP, 285919/285919 NTor.
liskov0/log:Dec 23 15:06:15.000 13791/13791 TAP, 294955/294955 NTor.
liskov0/log:Dec 23 21:06:15.000 13051/13051 TAP, 320555/320555 NTor.
liskov0/log:Dec 24 03:06:15.000 14007/14007 TAP, 362664/362664 NTor.
liskov0/log:Dec 24 09:06:15.000 16310/16310 TAP, 372172/372172 NTor.
liskov0/log:Dec 24 15:06:15.000 16156/16156 TAP, 374303/374303 NTor.
liskov0/log:Dec 24 21:06:15.000 12930/12930 TAP, 315003/315003 NTor.
liskov0/log:Dec 25 03:06:15.000 11217/11217 TAP, 299267/299267 NTor.
liskov0/log:Dec 25 09:06:15.000 13727/13727 TAP, 334649/334649 NTor.
liskov0/log:Dec 25 15:06:15.000 14221/14221 TAP, 364917/364917 NTor.
liskov0/log:Dec 25 21:06:15.000 12552/12552 TAP, 334950/334950 NTor.
liskov0/log:Dec 26 03:06:15.000 11219/11219 TAP, 320459/320459 NTor.
liskov0/log:Dec 26 09:06:15.000 13831/13831 TAP, 350023/350023 NTor.
liskov0/log:Dec 26 15:06:15.000 19181/19181 TAP, 475904/475904 NTor.
liskov0/log:Dec 26 21:06:15.000 15538/15538 TAP, 438239/438239 NTor.
liskov0/log:Dec 27 03:06:15.000 11690/11690 TAP, 374361/374361 NTor.
liskov0/log:Dec 27 09:06:15.000 17814/17814 TAP, 463937/463937 NTor.
liskov0/log:Dec 27 15:06:15.000 23088/23088 TAP, 528082/528082 NTor.
liskov0/log:Dec 27 21:06:15.000 15687/15687 TAP, 461411/461411 NTor.
liskov0/log:Dec 28 03:06:15.000 13162/13162 TAP, 419477/419477 NTor.
liskov0/log:Dec 28 09:06:15.000 16937/16937 TAP, 455789/455789 NTor.
liskov0/log:Dec 28 15:06:15.000 19813/19813 TAP, 506786/506786 NTor.
liskov0/log:Dec 28 21:06:15.000 15673/15673 TAP, 475216/475216 NTor.
liskov0/log:Dec 29 03:06:15.000 13696/13696 TAP, 454461/454461 NTor.
liskov0/log:Dec 29 09:06:15.000 17431/17431 TAP, 462030/462030 NTor.
liskov0/log:Dec 29 15:06:15.000 19668/19668 TAP, 514633/514633 NTor.
liskov0/log:Dec 29 21:06:15.000 18060/18060 TAP, 459767/459767 NTor.
liskov1/log:Dec 20 03:05:57.000 167329/167328 TAP, 1698982/1698848 NTor.
liskov1/log:Dec 20 09:05:57.000 155634/155634 TAP, 1647595/1647595 NTor.
liskov1/log:Dec 20 15:05:57.000 20021/20021 TAP, 1737221/1737221 NTor.
liskov1/log:Dec 20 21:05:57.000 26317/26317 TAP, 2209295/2209312 NTor.
liskov1/log:Dec 21 03:05:57.000 229269/229269 TAP, 2636508/2636508 NTor.
liskov1/log:Dec 21 09:05:57.000 241587/241587 TAP, 2414534/2414534 NTor.
liskov1/log:Dec 21 15:05:57.000 29006/29006 TAP, 2411853/2411853 NTor.
liskov1/log:Dec 21 21:05:57.000 613618/613618 TAP, 2494160/2494184 NTor.
liskov1/log:Dec 22 03:05:57.000 738018/738018 TAP, 2608335/2608335 NTor.
liskov1/log:Dec 22 09:05:57.000 68340/68340 TAP, 2662807/2662807 NTor.
liskov1/log:Dec 22 15:05:57.000 32915/32915 TAP, 2091445/2091445 NTor.
liskov1/log:Dec 22 21:05:57.000 35401/35401 TAP, 1541737/1541737 NTor.
liskov1/log:Dec 23 03:05:57.000 27732/27732 TAP, 5482727/5482727 NTor.
liskov1/log:Dec 23 09:05:57.000 794275/794275 TAP, 1321854/1321854 NTor.
liskov1/log:Dec 23 15:05:57.000 25915/25915 TAP, 1352584/1352584 NTor.
liskov1/log:Dec 23 21:05:57.000 77238/77238 TAP, 1248124/1248125 NTor.
liskov1/log:Dec 24 03:05:57.000 193632/193632 TAP, 1762061/1762061 NTor.
liskov1/log:Dec 24 09:05:57.000 50083/50083 TAP, 1814817/1814817 NTor.
liskov1/log:Dec 24 15:05:57.000 74907/74907 TAP, 1688924/1688924 NTor.
liskov1/log:Dec 24 21:05:57.000 303247/303247 TAP, 1479111/1479111 NTor.
liskov1/log:Dec 25 03:05:57.000 123391/123391 TAP, 1336242/1336242 NTor.
liskov1/log:Dec 25 09:05:57.000 103087/103087 TAP, 1358966/1358966 NTor.
liskov1/log:Dec 25 15:05:57.000 378236/378236 TAP, 1436780/1436780 NTor.
liskov1/log:Dec 25 21:05:57.000 219107/219143 TAP, 1659679/1659679 NTor.
liskov1/log:Dec 26 03:05:57.000 655228/655228 TAP, 1864584/1864584 NTor.
liskov1/log:Dec 26 09:05:57.000 382692/382692 TAP, 2271386/2271386 NTor.
liskov1/log:Dec 26 15:05:57.000 280941/280946 TAP, 8589614/8905253 NTor.
liskov1/log:Dec 26 21:06:08.000 107503/107517 TAP, 3127084/3127606 NTor.
liskov1/log:Dec 27 03:06:08.000 728020/728015 TAP, 3267462/3267078 NTor.
liskov1/log:Dec 27 09:06:08.000 111720/111720 TAP, 3349663/3349663 NTor.
liskov1/log:Dec 27 15:06:08.000 773125/773125 TAP, 3296155/3296155 NTor.
liskov1/log:Dec 27 21:06:08.000 733516/733518 TAP, 3414845/3414868 NTor.
liskov1/log:Dec 28 03:06:08.000 505359/505359 TAP, 3417172/3417172 NTor.
liskov1/log:Dec 28 09:06:08.000 419599/419599 TAP, 3584161/3584161 NTor.
liskov1/log:Dec 28 15:06:08.000 529407/529407 TAP, 3163304/3163304 NTor.
liskov1/log:Dec 28 21:06:08.000 307914/307921 TAP, 3323575/3323605 NTor.
liskov1/log:Dec 29 03:06:08.000 849330/849330 TAP, 3269660/3269660 NTor.
liskov1/log:Dec 29 09:06:08.000 1297178/1297178 TAP, 3810270/3810270 NTor.
liskov1/log:Dec 29 15:06:08.000 232687/232687 TAP, 4105295/4105295 NTor.
liskov1/log:Dec 29 21:06:08.000 114436/114437 TAP, 4873495/4873688 NTor.
liskov2/log:Dec 20 03:06:04.000 70074/70071 TAP, 3740593/3740499 NTor.
liskov2/log:Dec 20 09:06:04.000 75622/75622 TAP, 3208747/3208747 NTor.
liskov2/log:Dec 20 15:06:04.000 39118/39118 TAP, 3357602/3357602 NTor.
liskov2/log:Dec 20 21:06:04.000 484138/484139 TAP, 3382642/3382708 NTor.
liskov2/log:Dec 21 03:06:04.000 10119694/10123385 TAP, 3381509/3381509 NTor.
liskov2/log:Dec 21 09:06:04.000 2105177/2105177 TAP, 3114635/3114635 NTor.
liskov2/log:Dec 21 15:06:04.000 229920/229920 TAP, 3449980/3449980 NTor.
liskov2/log:Dec 21 21:06:07.000 56117/56139 TAP, 3428774/3429296 NTor.
liskov2/log:Dec 22 03:06:07.000 381576/381554 TAP, 3579634/3579237 NTor.
liskov2/log:Dec 22 09:06:07.000 73342/73342 TAP, 4813774/4813774 NTor.
liskov2/log:Dec 22 15:06:07.000 145771/145771 TAP, 4718373/4718373 NTor.
liskov2/log:Dec 22 21:06:07.000 285165/285165 TAP, 2367385/2367385 NTor.
liskov2/log:Dec 23 03:06:07.000 194570/194570 TAP, 2190537/2190537 NTor.
liskov2/log:Dec 23 09:06:07.000 1005400/1005400 TAP, 1920355/1920355 NTor.
liskov2/log:Dec 23 15:06:07.000 36635/36635 TAP, 1988409/1988409 NTor.
liskov2/log:Dec 23 21:06:07.000 104566/104566 TAP, 1853247/1853247 NTor.
liskov2/log:Dec 24 03:06:07.000 340538/340538 TAP, 2080668/2080668 NTor.
liskov2/log:Dec 24 09:06:07.000 63863/63863 TAP, 1914263/1914263 NTor.
liskov2/log:Dec 24 15:06:07.000 76969/76969 TAP, 1767802/1767802 NTor.
liskov2/log:Dec 24 21:06:07.000 171020/171312 TAP, 1624776/1624776 NTor.
liskov2/log:Dec 25 03:06:07.000 358170/358170 TAP, 1394736/1394736 NTor.
liskov2/log:Dec 25 09:06:07.000 43211/43211 TAP, 1411938/1411938 NTor.
liskov2/log:Dec 25 15:06:07.000 294397/294397 TAP, 1501031/1501031 NTor.
liskov2/log:Dec 25 21:06:07.000 36167/36167 TAP, 1744405/1744405 NTor.
liskov2/log:Dec 26 03:06:07.000 646087/646087 TAP, 1802457/1802457 NTor.
liskov2/log:Dec 26 09:06:07.000 192836/192836 TAP, 1755431/1755431 NTor.
liskov2/log:Dec 26 15:06:07.000 6870277/6870277 TAP, 1398903/1398903 NTor.
liskov2/log:Dec 26 21:06:07.000 272931/272931 TAP, 2377552/2377555 NTor.
liskov2/log:Dec 27 03:06:07.000 812723/812723 TAP, 2586087/2586087 NTor.
liskov2/log:Dec 27 09:06:07.000 56085/56085 TAP, 2623252/2623252 NTor.
liskov2/log:Dec 27 15:06:07.000 68687/68687 TAP, 2508305/2508305 NTor.
liskov2/log:Dec 27 21:06:07.000 116582/116582 TAP, 2553796/2553797 NTor.
liskov2/log:Dec 28 03:06:07.000 294486/294486 TAP, 2485709/2485709 NTor.
liskov2/log:Dec 28 09:06:07.000 313521/313521 TAP, 2504431/2504431 NTor.
liskov2/log:Dec 28 15:06:07.000 615334/615334 TAP, 1870973/1870973 NTor.
liskov2/log:Dec 28 21:06:07.000 90164/90164 TAP, 2262294/2262294 NTor.
liskov2/log:Dec 29 03:06:07.000 1079047/1079047 TAP, 1839090/1839090 NTor.
liskov2/log:Dec 29 09:06:07.000 349901/349901 TAP, 2342389/2342389 NTor.
liskov2/log:Dec 29 15:06:07.000 86387/86387 TAP, 2103361/2103361 NTor.
liskov2/log:Dec 29 21:06:07.000 378476/378476 TAP, 2368781/2368782 NTor.

comment:9 Changed 2 years ago by teor

Overall, these relays have been much more stable on Atlas and in OVH's monitoring since we made this change. I haven't made any other config changes over that time.

So I would call this a success - we should think about the defaults for these values.

We might also want to consider asking DirAuths to raising the circuits available timeout for clients, which is new in 0.3.1. I recommend nf_conntimeout_clients 18000. See #24768 for details.

I also recommend we reconsider the idle and network timeout changes in that commit, neither of which can be changed using consensus parameters. See #24769.

comment:10 Changed 2 years ago by nickm

Cc: mikeperry added
Milestone: Tor: 0.3.3.x-final

comment:11 Changed 2 years ago by dgoulet

My fast relay (Truie) got from Dec 28th to Jan 5th a pretty important load on NTor. From 500k to 1M depending on the time of day.

Jan 03 22:51:02.262 [notice] Circuit handshake stats since last time: 397262/397262 TAP, 910632/910632 NTor.

Everything started to go down on Jan 5th past 20:00:00 UTC. For now, the traffic going through it is almost double what it used to see but the connections are stable with:

Jan 08 15:21:02.670 [notice] Circuit handshake stats since last time: 23591/23597 TAP, 334149/334368 NTor.
Jan 08 15:51:02.709 [notice] Circuit handshake stats since last time: 12063/12063 TAP, 341922/341922 NTor.
Jan 08 16:21:02.718 [notice] Circuit handshake stats since last time: 23751/23751 TAP, 335790/335790 NTor.
Jan 08 16:51:02.478 [notice] Circuit handshake stats since last time: 21637/21637 TAP, 357263/357263 NTor.
Jan 08 17:21:02.557 [notice] Circuit handshake stats since last time: 57312/57312 TAP, 343390/343390 NTor.

Not only NTor connections went down but TAP also went down considerably.

comment:12 Changed 2 years ago by arma

My ntor load has varied but lately it is crazy high.

Here are the lines since the last ones I pasted:

Dec 30 18:38:00.502 [notice] Circuit handshake stats since last time: 455344/455344 TAP, 28975792/28975813 NTor.
Dec 31 00:38:00.536 [notice] Circuit handshake stats since last time: 699334/699334 TAP, 27973797/27973843 NTor.
Dec 31 06:38:00.542 [notice] Circuit handshake stats since last time: 853782/853786 TAP, 29181912/29181969 NTor.
Dec 31 12:38:00.536 [notice] Circuit handshake stats since last time: 826675/826677 TAP, 27429591/27429629 NTor.
Dec 31 18:38:00.573 [notice] Circuit handshake stats since last time: 691791/691799 TAP, 41866424/41866586 NTor.
Jan 01 00:38:00.549 [notice] Circuit handshake stats since last time: 755275/755293 TAP, 38457671/38457826 NTor.
Jan 01 06:38:00.569 [notice] Circuit handshake stats since last time: 622897/622923 TAP, 23214801/23214813 NTor.
Jan 01 12:38:00.552 [notice] Circuit handshake stats since last time: 552771/552771 TAP, 21665861/21665865 NTor.
Jan 01 18:38:00.554 [notice] Circuit handshake stats since last time: 627477/627604 TAP, 35923014/35923158 NTor.
Jan 02 00:38:00.555 [notice] Circuit handshake stats since last time: 604364/604365 TAP, 30375781/30375796 NTor.
Jan 02 06:38:00.555 [notice] Circuit handshake stats since last time: 691533/691547 TAP, 33685459/33685478 NTor.
Jan 02 12:38:00.553 [notice] Circuit handshake stats since last time: 738342/738351 TAP, 33851359/33851421 NTor.
Jan 02 18:38:00.567 [notice] Circuit handshake stats since last time: 455824/455825 TAP, 31640360/31640429 NTor.
Jan 03 00:38:00.568 [notice] Circuit handshake stats since last time: 680932/681281 TAP, 28213577/28215385 NTor.
Jan 03 06:38:00.577 [notice] Circuit handshake stats since last time: 684922/684922 TAP, 30468494/30468518 NTor.
Jan 03 12:38:00.584 [notice] Circuit handshake stats since last time: 706932/706936 TAP, 32047162/32047204 NTor.
Jan 03 18:38:00.586 [notice] Circuit handshake stats since last time: 717007/717007 TAP, 33859568/33859589 NTor.
Jan 04 00:38:00.586 [notice] Circuit handshake stats since last time: 254597/254597 TAP, 39652360/39652429 NTor.
Jan 04 12:35:31.607 [notice] Circuit handshake stats since last time: 370544/370544 TAP, 43227111/43227209 NTor.
Jan 04 18:35:31.651 [notice] Circuit handshake stats since last time: 419505/463538 TAP, 57708012/57718125 NTor.
Jan 05 00:35:31.667 [notice] Circuit handshake stats since last time: 323241/328550 TAP, 61154041/61154849 NTor.
Jan 05 06:35:31.677 [notice] Circuit handshake stats since last time: 320161/321177 TAP, 56381809/56382464 NTor.
Jan 05 12:35:31.703 [notice] Circuit handshake stats since last time: 461325/465544 TAP, 56769051/56769766 NTor.
Jan 05 18:35:31.706 [notice] Circuit handshake stats since last time: 511351/652657 TAP, 62709365/62749635 NTor.
Jan 06 10:24:49.143 [notice] Circuit handshake stats since last time: 261450/261450 TAP, 38935974/38937202 NTor.
Jan 06 23:14:22.057 [notice] Circuit handshake stats since last time: 473587/473589 TAP, 46681363/46681447 NTor.
Jan 07 05:14:22.102 [notice] Circuit handshake stats since last time: 486614/486615 TAP, 46743956/46744012 NTor.
Jan 07 11:14:22.144 [notice] Circuit handshake stats since last time: 496896/496896 TAP, 48832445/48832552 NTor.
Jan 07 17:14:22.176 [notice] Circuit handshake stats since last time: 477255/503942 TAP, 47588565/47592247 NTor.
Jan 07 23:14:22.201 [notice] Circuit handshake stats since last time: 492865/493968 TAP, 50934071/50936320 NTor.
Jan 08 05:14:22.231 [notice] Circuit handshake stats since last time: 426030/426035 TAP, 45163497/45163547 NTor.
Jan 08 11:14:22.204 [notice] Circuit handshake stats since last time: 452647/457412 TAP, 38736117/38754463 NTor.
Jan 08 17:14:22.232 [notice] Circuit handshake stats since last time: 510958/511346 TAP, 46547845/46548062 NTor.

So maybe the circuit overload is still here because of #24769, or maybe this experiment was overall a failure.

comment:13 Changed 2 years ago by arma

I think we should back out these changes, and see if there's any effect. Call it one last experiment.

Mike suggests backing out the cbtmintimeout change first, since he thinks it shouldn't be affecting anything.

comment:14 Changed 2 years ago by arma

I have changed moria1, and the dirauth-git, to no longer vote about cbtmintimeout. It will be some time (hours? days?) until enough dir auths update for the change to actually happen.

comment:15 Changed 2 years ago by teor

I think the changes had a clear effect on my relays, that varied by their proximity to the new clients at OVH. But it could have been a coincidence.

I've tried tuning a lot of things since then, here's my update:
https://trac.torproject.org/projects/tor/ticket/24782#comment:6

My next step is to DROP all traffic from these IP addresses so I can get decent data for our privacy preserving stats research. That means I won't be able to do comparisons.

comment:16 Changed 23 months ago by arma

I have backed out the change, so it goes back to cbttestfreq=10, on moria1 and in the dirauth-git.

I'm sufficiently convinced that the overload is from onion service circuits, not from cbt testing circuits, and while reducing the cbt test frequency will indeed reduce overall load on the network, it does so exactly by messing up everybody's cbt estimate, which isn't worth it.

comment:17 Changed 23 months ago by teor

Resolution: wontfix
Status: newclosed

This doesn't do what we want,

Note: See TracTickets for help on using tickets.