Opened 8 months ago

Last modified 6 months ago

#33508 accepted task

Write Ops Doc for check service

Reported by: irl Owned by: irl
Priority: Medium Milestone:
Component: Metrics/Exit Scanner Version:
Severity: Normal Keywords: metrics-team-roadmap-2020
Cc: metrics-team, gaba, anarcat Actual Points: 0.5
Parent ID: #33507 Points: 1.2
Reviewer: Sponsor:

Description

Mirroring the format of https://help.torproject.org/metrics/ops/onionoo-ops/ unless anarcat has requests for things to include here.

The sections on deployment and disaster recovery can already be written.

The sections on monitoring will have to wait for monitoring to exist.

Child Tickets

TicketStatusOwnerSummaryComponent
#33718newmetrics-teamCheck check.torproject.org in NagiosMetrics/Cloud
#33719newmetrics-teamCheck DNSEL in NagiosMetrics/Cloud
#33720newmetrics-teamCheck exit scanner in NagiosMetrics/Cloud

Change History (13)

comment:1 Changed 8 months ago by irl

Component: Internal Services/Tor Sysadmin TeamMetrics/Exit Scanner
Owner: changed from tpa to metrics-team

comment:2 Changed 8 months ago by irl

On disaster recovery, this comment is relevant from the other ticket: https://trac.torproject.org/projects/tor/ticket/33506#comment:7

After initial deployment, it seems necessary to run make start in check directory, ctrl+c when you see "Listening on port: 8000" then restart the check service.

comment:3 Changed 8 months ago by irl

Owner: changed from metrics-team to irl
Status: newaccepted

comment:4 Changed 8 months ago by gaba

Keywords: metrics-team-roadmap-2020Q1 added

comment:5 Changed 7 months ago by anarcat

re monitoring (#33718, #33719, #33720), could you expand on what exactly you want monitored? reachability? latency? if it's an HTTP endpoint, please provide the full url as well.

comment:6 in reply to:  5 Changed 7 months ago by irl

Replying to anarcat:

re monitoring (#33718, #33719, #33720), could you expand on what exactly you want monitored? reachability? latency? if it's an HTTP endpoint, please provide the full url as well.

I was going to put this into the Metrics Nagios.

comment:7 Changed 7 months ago by anarcat

oh, so i don't need to do anything about this?

comment:8 in reply to:  7 Changed 7 months ago by irl

Replying to anarcat:

oh, so i don't need to do anything about this?

Nope, we may discuss at a future dev meeting about how we can use some TPA service instead of our own Nagios in AWS but for now this has been a great unblocker and I'm not in a hurry to be blocking on TPA again for our monitoring.

comment:9 Changed 7 months ago by irl

Actual Points: 0.5

This will end up at https://help.torproject.org/metrics/ops/exit-ops/ and is still a work in progress.

comment:10 Changed 7 months ago by gaba

Keywords: metrics-team-roadmap-2020April added; metrics-team-roadmap-2020Q1 removed

Move some of the tickets from last metrics roadmap to the roadmap in April.

comment:11 Changed 7 months ago by irl

Points: 1.2

comment:12 Changed 7 months ago by irl

Keywords: irl-roadmap-2020April added

comment:13 Changed 6 months ago by gaba

Keywords: metrics-team-roadmap-2020 added; metrics-team-roadmap-2020April irl-roadmap-2020April removed

We need to review all this tickets for metrics roadmap.

Note: See TracTickets for help on using tickets.