Opened 2 months ago

Last modified 7 weeks ago

#29860 new defect

collect 404 errors on our websites

Reported by: emmapeel Owned by: hiro
Priority: Medium Milestone:
Component: Webpages/Webtools Version:
Severity: Normal Keywords:
Cc: Actual Points:
Parent ID: Points:
Reviewer: Sponsor:

Description

Maybe we could have a way of knowing the most popular 404s we have?

I want to have it for when the new website is launched, but also for the different lektor services it would be very interesting to have access to them.

This was done a while ago (8 years) with good results: #2072

Child Tickets

Change History (1)

comment:1 Changed 7 weeks ago by boklm

It looks like we can get apache logs from there:
https://metrics.torproject.org/collector/archive/webstats/

With description of the format here:
https://metrics.torproject.org/web-server-logs.html

Maybe there is some existing tools that we can use to parse those logs, and give us the list of URLs by HTTP status codes, and popularity.

I think if we make a list of all the URLs that were working in march 2019, we can make a script that check those URLs on the new website and give us the list of the ones that don't work anymore.

Note: See TracTickets for help on using tickets.