add autonat v2 spec #538

sukunrt · 2023-04-11T07:43:17Z

First draft for autonat v2. #503

This protocol allows for testing reachability on exactly one address. This helps determine reachability at an address level. This also simplifies the protocol a lot.

I'll change the spec to reflect the discussion on dialing a different ip address from the nodes observed ip address: #536

Discussion for nonce in message is here: libp2p/go-libp2p#1480
and this comment in particular libp2p/go-libp2p#1480 (comment)

marten-seemann

Very nice, this is a solid starting point for the spec!

What's the plan for resolving #536? Would you open a new PR that targets this PR here?

autonat/README.md

autonat/autonat-v2.md

sukunrt · 2023-04-11T12:23:48Z

What's the plan for resolving #536? Would you open a new PR that targets this PR here?

Yes, I'll open a PR with the changes for #536.

autonat/autonat-v2.md

thomaseizinger

Exciting! Thanks for your work. Left some comments/questions :)

Sorry if they have already been answered somewhere.

autonat/autonat-v2.md

sukunrt · 2023-04-25T07:44:48Z

thanks for your review @thomaseizinger.
I'd like your opinion on these two issues

Proposal: use a list of addresses in priority order for autonat v2 dial requests #539
Proposal: allow AutoNAT to dial all IP addresses, without risking amplification attacks #536

mxinden

Great work @sukunrt. Thank you!

autonat/autonat-v2.md

thomaseizinger · 2023-04-27T16:35:40Z

thanks for your review @thomaseizinger. I'd like your opinion on these two issues

Proposal: use a list of addresses in priority order for autonat v2 dial requests #539
Proposal: allow AutoNAT to dial all IP addresses, without risking amplification attacks #536

I don't have anything to add to those at the moment :)

MarcoPolo

On a brief skim, this looks good! I'm curious if we'll want to relax the "implementations MUST NOT dial any multiaddress unless it is based on the IP address the requesting node is observed as". Would it be useful to do this, and we can mitigate the amplification attack some other way?

It seems like there's a healthy discussion already going on, so I'll step back here and let other folks stay involved. If there's anything I can help with, please don't hesitate to ping.

autonat/autonat-v2.md

sukunrt · 2023-04-27T19:22:51Z

Thanks for your review @MarcoPolo

It seems like there's a healthy discussion already going on, so I'll step back here and let other folks stay involved. If there's anything I can help with, please don't hesitate to ping.

The suggested strategy is discussed here: #536
Please check if we've made any errors there or overlooked something.

Here's the PR for those changes: #542
You can review it there, or here after I merge those changes.

marten-seemann · 2023-09-18T08:53:31Z

How is this putting a strain on servers? It’s the server that decides when to require this verification, and for every incoming request, the server has the option to reject it.

umgefahren · 2023-10-30T11:08:27Z

How is this putting a strain on servers? It’s the server that decides when to require this verification, and for every incoming request, the server has the option to reject it.

Yes that's true.

umgefahren · 2023-10-30T11:17:45Z

I need another clarification: In line 67 it is mentioned that every DialRequest will be sent on a stream with protocol ID /libp2p/autonat/2/dial. However in Line 219 it is also said that a DialRequest is a Message send on /libp2p/autonat/2/dial-request. This is confusing to me. Which one is it now?

Or is only the DialRequest a Message on /libp2p/autonat/2/dial and all other Message, like DialDataRequest, DialDataResponse are happening on /libp2p/autonat/2/dial-request. If that's correct, why use Message on /libp2p/autonat/2/dial?

sukunrt · 2023-10-30T11:25:56Z

Thanks umgefahren, I've fixed this. It should be /libp2p/autonat/2/dial-request

sukunrt · 2023-10-30T11:30:01Z

@umgefahren

Or is only the DialRequest a Message on /libp2p/autonat/2/dial and all other Message, like DialDataRequest, DialDataResponse are happening on /libp2p/autonat/2/dial-request. If that's correct, why use Message on /libp2p/autonat/2/dial

on the stream /libp2p/autonat/2/dial-request we exchange messages, DialRequest, DialDataRequest, DialDataResponse, DialResponse. Since we exchange multiple types of messages on the stream from both sides, we wrap them in a Message type protobuf so that we can determine which type of message we have received. So a client on receiving a Message from server can determine whether it is a DialDataRequest or is it a DialResponse.

umgefahren · 2023-10-30T13:21:53Z

Great, that's clear now.

Another question, just to make sure: There will always be several DialDataResponse, since the upper bound for the amount of data per message is capped at 4096 bytes and it is required to send between 30kB and 100kB. Did I understand that correctly? That means the amount of data received so far have to be tracked, per connection.

And an additional suggestion: What is the upper bound for the size of Message, i.e. how many Multiaddress can be send by the client?

sukunrt · 2023-10-30T20:11:38Z

Another question, just to make sure: There will always be several DialDataResponse, since the upper bound for the amount of data per message is capped at 4096 bytes and it is required to send between 30kB and 100kB. Did I understand that correctly? That means the amount of data received so far have to be tracked, per connection.

Yes your understanding is correct, there'll be multiple DialDataResponse till you get the requested amount of data. You will have to track the amount of data received so far per dial-request stream. A peer can open multiple streams in parallel. Of course an implementation MAY choose to not allow this, but I don't think it is necessary to make this a MUST.

And an additional suggestion: What is the upper bound for the size of Message, i.e. how many Multiaddress can be send by the client?

I will think about this. Currently this is only limited by the size of the Message which again is implementation dependent. I think most implementations have a limit lower than 8kB. I think it is fine to add a suggestion to limit the number of addresses inspected.

umgefahren · 2023-11-07T14:27:55Z

Another quick question, I probably missed something: When the server successfully dials the client and provides the nonce. The client closes the stream either way. How does the server know if the provided nonce was correct?

umgefahren · 2023-11-08T10:37:00Z

In the spec it says that all private IP address should be excluded, but it also says it's just for checking reachability on the public internet. That said, we should exclude:

For IPv4:

addresses on "this network" (i.e. IPv4 address starting with a 0)
private IP addresses
shared IP addresses
loopback IP addresses (could create interesting behavior though)
link local IP addresses
reserved for future protocols
documenting
benchmarking
reserved
broadcast

For IPv6:

Unspecified
Loopback
unicast link local
unique local
documentation
IPv4 mapped
IPv4-IPv6 translation
Discard only
IETF Protocol assignments

In the rust-libp2p implementation there was a PR discussing those globally reachable IP address: libp2p/rust-libp2p#3814

Also this list is probably not complete and not formal, it's also a small nitpick.

sukunrt · 2023-11-08T14:06:46Z

@umgefahren it'd be better if you can add the comments as a review, commenting on the specific section.

In the spec it says that all private IP address should be excluded, but it also says it's just for checking reachability on the public internet. That said, we should exclude:

It means non public. Happy to change the wording to non public.

Another quick question, I probably missed something: When the server successfully dials the client and provides the nonce. The client closes the stream either way. How does the server know if the provided nonce was correct?

A correct server will always provide the correct nonce, no? This issue should be easy enough to debug for implementors without signalling from the client. Is there any benefit to the server knowing that it provided an incorrect nonce?

umgefahren · 2023-11-08T14:12:33Z

@umgefahren it'd be better if you can add the comments as a review, commenting on the specific section.

I will do that the next time. I'm sorry.

In the spec it says that all private IP address should be excluded, but it also says it's just for checking reachability on the public internet. That said, we should exclude:

It means non public. Happy to change the wording to non public.

Thanks for clarification.

Another quick question, I probably missed something: When the server successfully dials the client and provides the nonce. The client closes the stream either way. How does the server know if the provided nonce was correct?

A correct server will always provide the correct nonce, no? This issue should be easy enough to debug for implementors without signalling from the client. Is there any benefit to the server knowing that it provided an incorrect nonce?

I think there is a benefit. There is a pathological example where the network configuration or a NAT forward traffic to the wrong libp2p node. Not the one that requested the dial back, but a different one. In that case the server would report reachability on that address, but it's actually not reaching the peer in question. I'm not an expert enough here to think of any case where that might occur apart from a bad config or a malicious actor.

sukunrt · 2023-11-08T14:41:45Z

In this case, the client sees that the server is reporting OK-Reachable but it has not received the nonce, so it should reject the response.

thomaseizinger · 2023-11-09T03:53:35Z

Another quick question, I probably missed something: When the server successfully dials the client and provides the nonce. The client closes the stream either way. How does the server know if the provided nonce was correct?

A correct server will always provide the correct nonce, no? This issue should be easy enough to debug for implementors without signalling from the client. Is there any benefit to the server knowing that it provided an incorrect nonce?

I think there is a benefit. There is a pathological example where the network configuration or a NAT forward traffic to the wrong libp2p node.

If we dial the node with /p2p, the connection will never be fully established if we end up at a different node so you can't send the nonce over.

umgefahren · 2023-11-09T10:24:11Z

So since that is not possible, the implementation doesn't needs to handle this case, right?

But thanks for the clarification and sorry for the dumb questions.

thomaseizinger · 2023-11-10T02:23:39Z

So since that is not possible, the implementation doesn't needs to handle this case, right?

Yep I think you are right! We can assume that this will never happen. Feel free to use debug_assert if you want to be sure!

But thanks for the clarification and sorry for the dumb questions.

No worries at all! I think your questions are pretty spot on actually :)

thomaseizinger · 2023-11-15T09:42:34Z

autonat/autonat-v2.plantuml

+
+Cli -> Srv: [dial] DialRequest:{nonce: 0xabcd, addrs: (addr1, addr2, addr3)}
+Srv -> Cli: [attempt]addr2 DialAttempt:{nonce: 0xabcd}
+Srv -> Cli: [dial] DialResponse:{status: OK, dialStatuses:(E_TRANSPORT_NOT_SUPPORTED, OK)} 


This mentions E_TRANSPORT_NOT_SUPPORTED but that is missing from the protobufs?

umgefahren · 2024-01-28T15:54:31Z

While doing the rust-libp2p implementation, we discovered a race condition, which we are now circumventing by a 100ms delay. You can read the finally comment by @thomaseizinger here: umgefahren/rust-libp2p#1 (comment)

It happens when the server successfully performs a dial back, thus sends the confirmation of the address back to the client. However the client hasn't progressed enough to be notified of that successful dial back when receiving the confirmation. In that case the client wrongly assumed an address was confirmed where no dial back occurred.

thomaseizinger · 2024-01-29T04:45:26Z

In that case the client wrongly assumed an address was confirmed where no dial back occurred.

Minor correction here: The behaviour is usually that the client discards the "successful" confirmation because it has not yet processed the dial-back so it thinks the server is sending it a confirmation without having actually done the dial.

I think the correct way to solve this would be to add an ACK message from the client back to the server for the dial-back where the client can say: "Yes I've processed your dial-back". The server can then proceed to respond on the other stream and thus guarantee that we don't have a race condition between the two streams.

sukunrt · 2024-01-29T04:55:11Z

You can read the closing of the stream as the ACK. See: https://github.com/libp2p/go-libp2p/blob/sukun/autonat-v2-2/p2p/protocol/autonatv2/server.go#L251-L257

The spec also dictates closing the stream: https://github.com/libp2p/specs/blame/autonat-v2/autonat/autonat-v2.md#L87

Do you think an explicit ACK is better?

thomaseizinger · 2024-01-29T04:58:38Z

You can read the closing of the stream as the ACK. See: libp2p/go-libp2p@sukun/autonat-v2-2/p2p/protocol/autonatv2/server.go#L251-L257

The spec also dictates closing the stream: autonat-v2/autonat/autonat-v2.md#L87 (blame)

Do you think an explicit ACK is better?

Yeah I think so. I associate closing a stream with "I have no more data to write". The client never writes data so why wouldn't it immediately close the stream? Also, reading a stream and waiting for that to fail because it has been closed it also somewhat odd 🤷‍♂️

sukunrt · 2024-01-29T05:17:35Z

The client never writes data so why wouldn't it immediately close the stream?

That's a fair point. I'll add an ACK.

sukunrt · 2024-02-05T16:33:58Z

Updated the specs with a DialBackResponse

thomaseizinger

Nice, thank you!

sukunrt force-pushed the autonat-v2 branch from b95fb58 to 9e086f8 Compare April 11, 2023 07:46

sukunrt requested a review from marten-seemann April 11, 2023 07:51

sukunrt marked this pull request as ready for review April 11, 2023 12:05

sukunrt requested review from mxinden and MarcoPolo April 11, 2023 12:05

marten-seemann reviewed Apr 11, 2023

View reviewed changes

sukunrt marked this pull request as draft April 11, 2023 16:51

sukunrt force-pushed the autonat-v2 branch from 9e086f8 to f973931 Compare April 12, 2023 08:28

sukunrt changed the base branch from master to autonat-rename April 12, 2023 08:29

sukunrt force-pushed the autonat-v2 branch from f973931 to b3bd5e0 Compare April 12, 2023 09:04

add autonat v2 spec

d663611

sukunrt force-pushed the autonat-v2 branch from b3bd5e0 to d663611 Compare April 12, 2023 09:04

sukunrt mentioned this pull request Apr 12, 2023

Proposal: use a list of addresses in priority order for autonat v2 dial requests #539

Open

sukunrt marked this pull request as ready for review April 12, 2023 10:11

sukunrt commented Apr 15, 2023

View reviewed changes

autonat/autonat-v2.md Outdated Show resolved Hide resolved

use priority ordered list in requests for autonat-v2

1db8613

sukunrt mentioned this pull request Apr 19, 2023

use a priority ordered list of addresses in autonat v2 #541

Merged

sukunrt added 2 commits April 21, 2023 20:08

only send index of the dialed address

0ff8ac6

accept a priority ordered list of addresses for dial requests

f2a431c

thomaseizinger reviewed Apr 23, 2023

View reviewed changes

Improve naming for messages

62123df

add interaction diagram

0771bab

mxinden reviewed Apr 26, 2023

View reviewed changes

autonat/autonat-v2.md Show resolved Hide resolved

autonat/autonat-v2.md Outdated Show resolved Hide resolved

autonat/autonat-v2.md Outdated Show resolved Hide resolved

address review comments

3e57202

sukunrt force-pushed the autonat-v2 branch from 2c82bd1 to 3e57202 Compare April 27, 2023 11:33

MarcoPolo reviewed Apr 27, 2023

View reviewed changes

autonat/autonat-v2.md Outdated Show resolved Hide resolved

autonat/autonat-v2.md Outdated Show resolved Hide resolved

fix dial-request protocol name

b4a856b

thomaseizinger reviewed Nov 15, 2023

View reviewed changes

mxinden mentioned this pull request Nov 16, 2023

Autonat doesn't support multiple addresses well libp2p/rust-libp2p#4873

Open

add a response to the dialback stream

1c76613

sukunrt requested a review from thomaseizinger February 5, 2024 16:29

thomaseizinger approved these changes Feb 14, 2024

View reviewed changes

MarcoPolo mentioned this pull request Jun 19, 2024

AutoNAT: Network ReachabilityPublic distinguishes between IPv6 and IPv4 #614

Closed

MarcoPolo linked an issue Jun 19, 2024 that may be closed by this pull request

AutoNAT: Network ReachabilityPublic distinguishes between IPv6 and IPv4 #614

Closed

allow the client to send slightly more dial data

03718ef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add autonat v2 spec #538

add autonat v2 spec #538

sukunrt commented Apr 11, 2023 •

edited

Loading

marten-seemann left a comment

sukunrt commented Apr 11, 2023

thomaseizinger left a comment •

edited

Loading

sukunrt commented Apr 25, 2023

mxinden left a comment

thomaseizinger commented Apr 27, 2023

MarcoPolo left a comment

sukunrt commented Apr 27, 2023

marten-seemann commented Sep 18, 2023

umgefahren commented Oct 30, 2023

umgefahren commented Oct 30, 2023

sukunrt commented Oct 30, 2023

sukunrt commented Oct 30, 2023

umgefahren commented Oct 30, 2023 •

edited

Loading

sukunrt commented Oct 30, 2023

umgefahren commented Nov 7, 2023

umgefahren commented Nov 8, 2023

sukunrt commented Nov 8, 2023

umgefahren commented Nov 8, 2023

sukunrt commented Nov 8, 2023

thomaseizinger commented Nov 9, 2023

umgefahren commented Nov 9, 2023

thomaseizinger commented Nov 10, 2023

thomaseizinger Nov 15, 2023

umgefahren commented Jan 28, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

sukunrt commented Feb 5, 2024

thomaseizinger left a comment

add autonat v2 spec #538

Are you sure you want to change the base?

add autonat v2 spec #538

Conversation

sukunrt commented Apr 11, 2023 • edited Loading

marten-seemann left a comment

Choose a reason for hiding this comment

sukunrt commented Apr 11, 2023

thomaseizinger left a comment • edited Loading

Choose a reason for hiding this comment

sukunrt commented Apr 25, 2023

mxinden left a comment

Choose a reason for hiding this comment

thomaseizinger commented Apr 27, 2023

MarcoPolo left a comment

Choose a reason for hiding this comment

sukunrt commented Apr 27, 2023

marten-seemann commented Sep 18, 2023

umgefahren commented Oct 30, 2023

umgefahren commented Oct 30, 2023

sukunrt commented Oct 30, 2023

sukunrt commented Oct 30, 2023

umgefahren commented Oct 30, 2023 • edited Loading

sukunrt commented Oct 30, 2023

umgefahren commented Nov 7, 2023

umgefahren commented Nov 8, 2023

sukunrt commented Nov 8, 2023

umgefahren commented Nov 8, 2023

sukunrt commented Nov 8, 2023

thomaseizinger commented Nov 9, 2023

umgefahren commented Nov 9, 2023

thomaseizinger commented Nov 10, 2023

thomaseizinger Nov 15, 2023

Choose a reason for hiding this comment

umgefahren commented Jan 28, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

thomaseizinger commented Jan 29, 2024

sukunrt commented Jan 29, 2024

sukunrt commented Feb 5, 2024

thomaseizinger left a comment

Choose a reason for hiding this comment

sukunrt commented Apr 11, 2023 •

edited

Loading

thomaseizinger left a comment •

edited

Loading

umgefahren commented Oct 30, 2023 •

edited

Loading