Opened 5 years ago

Last modified 19 months ago

#12436 new defect

Mail archive lint

Reported by: grarpamp Owned by:
Priority: Medium Milestone:
Component: Internal Services/Service - lists Version:
Severity: Normal Keywords:
Cc: Actual Points:
Parent ID: Points:
Reviewer: Sponsor:

Description

Some messages in the gzip pipermail archives (lists.torproject.org)
lack the correct metadata and format for what would otherwise be
full use by MUA's.

If the full raw archives exist, it may be easier to see what
reimporting with current mailman tools looks like.

From a concatenation of the three main lists: dev, relays, talk
(the others were not checked and may suffer as well)

There is...

Child Tickets

Change History (3)

comment:1 Changed 5 years ago by grarpamp

There are about 140 legit messages with bad From lines:
+ address format doesn't match regex
+ header dates don't match body dates

egrep '^From ' | egrep -v '^From .* at .* ... ... .. ..:..:.. ....$' | egrep    ' [0-9]{4}$' | sort | uniq -c

- Correctable with an unpack, edit, repack. Except for the missing message-id's.

From 00abc97618060972e29c31b4ffa4a75b8ef087e8 Mon Sep 17 00:00:00 2001
From: Tails developers <amnesia at boum.org>
Date: Wed, 4 Jan 2012 14:59:54 +0100
Subject: [PATCH 3/7] Optionally skip insecure DNS MX autoconfiguration lookup.

   1 From 00abc97618060972e29c31b4ffa4a75b8ef087e8 Mon Sep 17 00:00:00 2001
   1 From 0651e1f6e2c4f76fc444969f7fc6600670b302da Mon Sep 17 00:00:00 2001
   1 From 1de630482dadcef4ae5c751e152abdc05660f0a8 Mon Sep 17 00:00:00 2001
   1 From 27068f326379702b703014bb4b1ecc917f40b604 Mon Sep 17 00:00:00 2001
   1 From 3ad31e8727b46ef46b73d11764ca5fec15c5b57c Mon Sep 17 00:00:00 2001
   1 From 8d0c3c29dbc2cbf4fbda8d2829737ada33fa6e70 Mon Sep 17 00:00:00 2001
   1 From 8e8ae6cc51055edad7b104c309869e2d1a56389f Mon Sep 17 00:00:00 2001
   1 From af6bf472d59162428a1d7f1d77e6e77bda827414 Mon Sep 17 00:00:00 2001
   1 From cdfe6a543094e172524c50f1766b8a08a22581df Mon Sep 17 00:00:00 2001

- Require the raw archives to fix

From bogus@does.not.exist.com  Mon Jun 20 21:04:01 2011
From: bogus@does.not.exist.com ()
Date: Mon, 20 Jun 2011 21:04:01 -0000
Subject: No subject
Message-ID: <mailman.0.1309766217.3581.tor-dev@lists.torproject.org>

   8 From bogus@does.not.exist.com  Mon Feb 21 22:58:58 2011
  13 From bogus@does.not.exist.com  Mon Jun 20 21:04:01 2011
  18 From bogus@does.not.exist.com  Tue Mar  1 03:41:44 2011
  93 From bogus@does.not.exist.com  Tue Mar  1 03:45:00 2011
   1 From bogus@does.not.exist.com  Tue Mar  1 03:56:07 2011
Last edited 5 years ago by grarpamp (previous) (diff)

comment:2 Changed 5 years ago by grarpamp

Some From's are not properly escaped:

egrep '^From ' | egrep -v '^From .* at .* ... ... .. ..:..:.. ....$' | egrep -v ' [0-9]{4}$' | wc -l
      91

If you need a correct message count:

egrep '^From ' | egrep    '^From .* at .* ... ... .. ..:..:.. ....$' | wc -l
egrep '^From ' | egrep -v '^From .* at .* ... ... .. ..:..:.. ....$' | egrep    ' [0-9]{4}$' | wc -l

comment:3 Changed 19 months ago by teor

Severity: Normal

Set all open tickets without a severity to "Normal"

Note: See TracTickets for help on using tickets.