Opened 9 years ago

Closed 9 years ago

#2072 closed task (fixed)

Collect popular now-dead urls

Reported by: Sebastian Owned by: phobos
Priority: Medium Milestone:
Component: Webpages/Website Version:
Severity: Keywords:
Cc: Actual Points:
Parent ID: Points:
Reviewer: Sponsor:

Description

weasel made it possible to use .htaccess files for redirect. We should collect a list of old now-broken links that external sites or published documents use and put in redirects for those. All internal links should still be fixed to use the new version of the website. Just put links here in this format and I'll get to it:

https://www.torproject.org/faq => https://www.torproject.org/docs/faq

Child Tickets

Change History (15)

comment:1 Changed 9 years ago by mikeperry

We keep anonymized logs still right? Can we just grep through the error logs to see which urls are getting lots of 404s and go from there?

comment:2 Changed 9 years ago by Sebastian

Not sure if we keep them; if we do, sounds like a great first step.

comment:3 Changed 9 years ago by mikeperry

comment:4 in reply to:  3 Changed 9 years ago by Sebastian

comment:5 Changed 9 years ago by phobos

Here's the command line I ran on vescum:

grep "File does not exist" www.torproject.org-error.log | cut -d":" -f6 | sort -n | uniq -c | sort -n | tail -20

Here's the result:

129 www.torproject.org/docs/tor-doc-windows.html.fr

132 www.torproject.org/

145 www.torproject.org/projects/dist/download/download.html.en

157 www.torproject.org/download/download.html.en

160 www.torproject.org/torbrowser/dist/download/download/download/download.html.en

163 www.torproject.org/torbrowser/index.html.en

170 www.torproject.org/torbrowser/dist/download/download.html.en

177 www.torproject.org/vidalia/download/download/download/download.html.en

182 www.torproject.org/torbrowser/download/download/download.html.en

186 www.torproject.org/dist/vidalia-bundles/download/download.html.en

217 www.torproject.org/vidalia/index.html.en

244 www.torproject.org/projects/dist/tor-browser-1.3.10_en-US.exe

284 www.torproject.org/dist/torbrowser/tor-browser-1.3.10_en-US.tar.gz

383 www.torproject.org/torbrowser/download/download.html.en

391 www.torproject.org/vidalia/download/download/download.html.en

640 www.torproject.org/torbrowser/

729 www.torproject.org/projects/torbrowser.html.en

775 www.torproject.org/vidalia/download/download.html.en

1965 www.torproject.org/vidalia/

comment:6 Changed 9 years ago by phobos

Status: newaccepted

Update cli and result:

grep "File does not exist" www.torproject.org-error.log | cut -d" " -f13 | sort -n | uniq -c | sort -n | tail -20

87 /var/www/www.torproject.org/htdocs/index.html.de
89 /var/www/www.torproject.org/htdocs/index.html.fr
99 /var/www/www.torproject.org/htdocs/index.html.ru,

104 /var/www/www.torproject.org/htdocs/images/white-bullet.gif,
117 /var/www/www.torproject.org/htdocs/projects/torbrowser.html.en/,
121 /var/www/www.torproject.org/htdocs/easy-download.html.en,
124 /var/www/www.torproject.org/htdocs/torbutton/search
145 /var/www/www.torproject.org/htdocs/svn
147 /var/www/www.torproject.org/htdocs/docs/tor-doc-windows.html.fr,
162 /var/www/www.torproject.org/htdocs/index.html.de,
193 /var/www/www.torproject.org/htdocs/download.html.en
217 /var/www/www.torproject.org/htdocs/download.html.en,
266 /var/www/www.torproject.org/htdocs/dist/torbrowser/tor-browser-1.3.10_en-US.tar.gz,
444 /var/www/www.torproject.org/htdocs/favicon.ico,
481 /var/www/www.torproject.org/htdocs/img,
506 /var/www/www.torproject.org/htdocs/projects/dist,
541 /var/www/www.torproject.org/htdocs/index.html.fr,
648 /var/www/www.torproject.org/htdocs/images/tbb-close-button.png,
707 /var/www/www.torproject.org/htdocs/volunteer.html.en

7213 /var/www/www.torproject.org/htdocs/favicon.ico

comment:7 Changed 9 years ago by phobos

And today's update is looking better. The total number of missing files is down dramatically.

14 /var/www/www.torproject.org/htdocs/index.html.fr,

15 /var/www/www.torproject.org/htdocs/download.html
15 /var/www/www.torproject.org/htdocs/index.html.fr
16 /var/www/www.torproject.org/htdocs/download.html.ru,
16 /var/www/www.torproject.org/htdocs/index.html.de
20 /var/www/www.torproject.org/htdocs/easy-download.html.en
20 /var/www/www.torproject.org/htdocs/easy-download.html.it
23 /var/www/www.torproject.org/htdocs/projects/torbrowser.html.en/,
28 /var/www/www.torproject.org/htdocs/styles,
31 /var/www/www.torproject.org/htdocs/easy-download.html.en,
40 /var/www/www.torproject.org/htdocs/images/white-bullet.gif,
45 /var/www/www.torproject.org/htdocs/torbutton/search
48 /var/www/www.torproject.org/htdocs/index.html.ru,
66 /var/www/www.torproject.org/htdocs/dist/torbrowser/tor-browser-1.3.10_en-US.tar.gz,
83 /var/www/www.torproject.org/htdocs/svn

100 /var/www/www.torproject.org/htdocs/projects/dist,
122 /var/www/www.torproject.org/htdocs/download.html.en
164 /var/www/www.torproject.org/htdocs/img,
181 /var/www/www.torproject.org/htdocs/download.html.en,
184 /var/www/www.torproject.org/htdocs/images/tbb-close-button.png,

comment:8 Changed 9 years ago by Sebastian

I added a few more redirects to capture more of these issues. What remains is that someone needs to create a white-bullet.gif and put it into images/

comment:10 in reply to:  8 ; Changed 9 years ago by phobos

Replying to Sebastian:

I added a few more redirects to capture more of these issues. What remains is that someone needs to create a white-bullet.gif and put it into images/

I don't see where white-bullet.gif gets used in our html. I don't want to put things in there solely because other sites link to the wrong place.

comment:12 in reply to:  10 Changed 9 years ago by Sebastian

Replying to phobos:

I don't see where white-bullet.gif gets used in our html. I don't want to put things in there solely because other sites link to the wrong place.

It gets used in ie6.css

comment:13 in reply to:  9 Changed 9 years ago by arma

Replying to mo:

https://www.torproject.org/abuse => https://www.torproject.org/docs/abuse

Perhaps you meant faq-abuse here? There was no 'abuse' page before (and there isn't one now either).

comment:14 Changed 9 years ago by Sebastian

faq-abuse should already be correctly redirected... hm.

comment:15 Changed 9 years ago by phobos

Resolution: fixed
Status: acceptedclosed

I think we've redirected the most popular paths. Now we have logs filled with people linking to paths that were invalid before the change too.

Note: See TracTickets for help on using tickets.