Inconsistent addrmap events when resolving hostname (regression)

changed milestone to %Tor: 0.2.4.x-final

added component::core tor/tor controller dns milestone::Tor: 0.2.4.x-final priority::medium reporter::Desoxy resolution::fixed status::closed tor-client type::defect version::tor 0.2.4.11-alpha labels

Hm. My thought would be that an ADDRMAP event would be appropriate, perhaps with some new flag to indicate that it wasn't getting cached. (Or we could have an expiry set to some time in the past to indicate that it isn't cacheable.)

The description in control-spec seems unsuitable.

I wonder how hard this would be to fix in 0.2.4. If the fix isn't too hard, we could put it in there.

I like the idea of using a flag or setting the expiry time in the past. It really depends on the meaning of the expiry time: Does it mean "The remote DNS server specified this TTL for this A/AAAA/PTR record" or does it mean "Tor will cache this record until that time"? For the first, a flag would be better, for the latter I think that setting the expiry time to the first of January 1970 would be best. Since the cache is now per circuit, it would also be possible to add an CIRC_ID field that indicates for which circuit the hostname was cached.

I hope that this can go into 0.2.4. The fix shouldn't be too difficult once we have decided on how to change the control spec. Having ADDRMAP work in 0.2.3, then not always work in 0.2.4 and then work again in 0.2.5 would be bad.

Trac:
Username: Desoxy

Replying to Desoxy:

I like the idea of using a flag or setting the expiry time in the past. It really depends on the meaning of the expiry time: Does it mean "The remote DNS server specified this TTL for this A/AAAA/PTR record" or does it mean "Tor will cache this record until that time"? For the first, a flag would be better, for the latter I think that setting the expiry time to the first of January 1970 would be best. Since the cache is now per circuit, it would also be possible to add an CIRC_ID field that indicates for which circuit the hostname was cached.

I'm inclined to say "let's not lose info", and do the flag.

(The cache is not actually per-circuit. Mostly, we just don't cache at the client side at all. This is approximately equivalent to a per-circuit cache, since the exit node caches too. Is there a lingering documentation bug too?)

I hope that this can go into 0.2.4. The fix shouldn't be too difficult once we have decided on how to change the control spec. Having ADDRMAP work in 0.2.3, then not always work in 0.2.4 and then work again in 0.2.5 would be bad.

I agree, but we should consider the risks too:

If it turns out that Vidalia or controller can't cope with a new flag, it would be bad to have to delay 0.2.4 packages until that controller could catch up.
If the fix is tricky enough, it would be bad to delay 0.2.4 stability for it.

But with any luck, if the fix can come soon, and the relevant controllers don't choke on it, and it's nice and simple, I think it can be 0.2.4 material.

Trac:
Username: Desoxy

addrmap_on_resolv_command.patch

Trac:
Username: Desoxy

(The cache is not actually per-circuit. Mostly, we just don't cache at the client side at all. This is approximately equivalent to a per-circuit cache, since the exit node caches too. Is there a lingering documentation bug too?) No, I only skimmed the documentation and simply misunderstood the change.

If it turns out that Vidalia or controller can't cope with a new flag, it would be bad to have to delay 0.2.4 packages until that controller could catch up. I have not found Vidalia listening for ADDRMAP events. Stem ignores unknown arguments, but changes are required to access the value of the new flag.

If the fix is tricky enough, it would be bad to delay 0.2.4 stability for it. There is already code for sending an ADDRMAP event as response to RESOLVE requests, because failed resolves do not get added to the cache and would thus not generate such events. Changing this code to also generate an ADDRMAP event on success is all that is needed. The attached patch does this and also adds a new flag CACHE="YES"/"NO" to ADDRMAP events.

Example events:

(Generated with tor-resolve:)
        650 ADDRMAP example.com 192.0.43.10 "2013-04-03 11:00:52" EXPIRES="2013-04-03 09:00:52" CACHE="YES"
(Gerated with RESOLV command:)
        650 ADDRMAP example.com 192.0.43.10 "2013-04-03 08:28:51" EXPIRES="2013-04-03 06:28:51" CACHE="NO"
	650 ADDRMAP example.invalid <error> "2013-04-03 08:28:52" error=yes EXPIRES="2013-04-03 06:28:52" CACHE="NO"

If this patch is acceped, I will change the controller spec to include information about the CACHE flag.

Trac:
Username: Desoxy
Status: new to needs_review

This patch looks plausible to me. I'd consider naming the new flag "CACHED" instead of "CACHE" because it indicates that Tor has cached the result, not that the user should cache the result. Once it has an accompanying controller spec patch, it's mergeable. It would also be great to have a changes/ file for this.

I have changed the patch to use CACHED instead of CACHE[1]. You can pull the changes directly from the bug-8596 branch of [2]. The changes to the control spec[3] are in the bug-8596 branch, also on github.[4]

One thing to consider before putting this into 0.2.4: When a mapaddress command is issued or when a host is automapped, it will also include the "CACHED" flag. If you do not want that, you could omit the "CACHED" flag if expiry is "NEVER".

> MAPADDRESS map.test=192.0.2.1
< 650 ADDRMAP map.test 192.0.2.1 NEVER CACHED="YES"
> RESOLVE example.com
< 650 ADDRMAP example.com 192.0.43.10 "2013-04-03 22:29:11" EXPIRES="2013-04-03 20:29:11" CACHED="NO"
(external: tor-resolve example.com)
< 650 ADDRMAP example.com 192.0.43.10 "2013-04-03 22:31:22" EXPIRES="2013-04-03 20:31:22" CACHED="YES"

1: https://github.com/desoxy-tor/tor/compare/bug-8596 2: https://github.com/desoxy-tor/tor.git 3: https://github.com/desoxy-tor/torspec/compare/bug-8596 4: https://github.com/desoxy-tor/torspec.git

Trac:
Username: Desoxy

Hi Desoxy. Nick asked me to chime in, so just leaving a note that your spec change (25b0d43) looks great to me!

From the point of view of controller sanity only: I suggest EXPIRES=NEVER rather than just the bare word NEVER. In other places in the control protocol, we've suddenly decided to announce that argument order doesn't matter anymore, which breaks stuff that has been written to depend on it. Omitting the EXPIRES keyword for the special case "NEVER" will break more stuff that will inevitably get written to handle a bare "NEVER".

The NEVER thing is already part of the spec.

Thanks for the reviews!

I have split the change into two parts: One commit for genereating ADDRMAP events if they are the answer to a RESOLVE command, and the second commit adding the CACHE=YES/NO to ADDRMAP events. If the spec change breaks something, I suggest to only use the first commit in 0.2.4 and worry about the spec change in 0.2.5.

Trac:
Username: Desoxy

From the point of view of controller sanity only: I suggest EXPIRES=NEVER rather than just the bare word NEVER.

I agree the NEVER was a mistake, though for a different reason (the "maybe this is a quoted positional argument and maybe it's a bare word" made this event type quite a special snowflake). That said, as Nick mentioned that isn't what this ticket is about.

If the spec change breaks something, I suggest to only use the first commit in 0.2.4 and worry about the spec change in 0.2.5.

It really shouldn't. In terms of the spec this is a very simple addition, and should be perfectly ok for controllers. If it isn't then that's a bug with forward compatibility in the controller.

Looks okay; merging to 0.2.4 and hoping for the best.

Trac:
Status: needs_review to closed
Resolution: N/A to fixed

Sorry to reopen but I need a little clarification about the new CACHED arg. In an example above the YES/NO was quoted - is that how it ended up being implemented?

650 ADDRMAP example.com 192.0.43.10 "2013-04-03 22:31:22" EXPIRES="2013-04-03 20:31:22" CACHED="YES"

If so then the spec change should be changed from...

Cached = "YES" / "NO"

... which, contrary to the quotes, mean bare word arguments. I suspect the right specification would be something like...

Cached = DQUOTE "YES" DQUOTE / DQUOTE "NO" DQUOTE

Also, this is more of a stem-specific question but what would be the best behavior for when this flag isn't present? Should the event's cached attribute be defaulted to True or False? Or should the attribute take on the values of True / False / None (for undefined)?

Thanks! -Damian

Trac:
Resolution: fixed to N/A
Status: closed to reopened

Thanks for notifying me.

I got confused by the quotes that were already around it, but those are part of the ABNF notation, not of the actual protocol. (I'm not very good with ABNF.) I commited a fix at [1].

Also, this is more of a stem-specific question but what would be the best behavior for when this flag isn't present? Should the event's cached attribute be defaulted to True or False? Or should the attribute take on the values of True / False / None (for undefined)?

It's a bit complicated: If the DNS resolution worked and an addrmap event is generated without a cached flag, then it was added to the cache. If resolution failed (e.g. because the domain doesn't exist) and there is no cached flag, then it was not be cached (this only happens when using the RESOLVE command).

1: https://github.com/desoxy-tor/torspec/commit/ed7730dc1aa14910955b41c6650c66d70a04e03c

Trac:
Username: Desoxy
Status: reopened to needs_review

I got confused by the quotes that were already around it, but those are part of the ABNF notation, not of the actual protocol. (I'm not very good with ABNF.) I commited a fix at [1].

Thanks! Looks good to me. :)

It's a bit complicated: If the DNS resolution worked and an addrmap event is generated without a cached flag, then it was added to the cache. If resolution failed (e.g. because the domain doesn't exist) and there is no cached flag, then it was not be cached (this only happens when using the RESOLVE command).

Hmm, sounds like I should stick with True / False / None then. Thanks for the clarification.

Merged it; thanks!

Trac:
Status: needs_review to closed
Resolution: N/A to fixed

Inconsistent addrmap events when resolving hostname (regression)

Child items ...

Activity