Stream Migration Protocol #328

Stebalien · 2021-05-24T23:59:34Z

Libp2p should support protocol agnostic stream migration. This will make upgrading to better transports "seamless".

Requirement

Transport agnostic. Really, this means migrating at the stream level.
Minimal overhead. Overhead should be at most a small per-stream cost (no additional framing, etc.)
No interruption. Reading/writing should be continuous.
Transparent. Applications using migratable streams shouldn't notice anything.
Correct. There can't be any ambiguity (one side believing the migration happened, the other side disagreeing, etc.).

Protocol

Here we describe a protocol for migrating a "source" stream (stream A) to a "target" stream (stream B).

When opening a stream, if the remote peer supports the stream migration protocol (discovered through identify) and the stream is "long lived" (opt in or opt out?):

First, open a stream (stream A).
Negotiate the stream migration protocol.
Send a message indicating that we're opening a new stream, along with a unique ID for the stream.
Negotiate the actual protocol.

Because we "know" that the peer supports this stream migration protocol, we can pipeline these steps so as not to take any additional round-trips.

To migrate a stream:

The initiator of the stream migration will open a new stream (stream B), on a the "target" connection (the connection to which we're migrating the stream), to the remote peer. This is the "target" stream for the migration.
On stream B, the initiator will negotiate the stream migration protocol.
On stream B, the initiator will send a message indicating that we're migrating a stream, along with the ID of the stream we're migrating (the source stream, stream A).
The receiver will acknowledge the migration (on stream B) and, from this point onward, treat any "EOF" (close) on stream A as a migration to stream B.
When the initiator receives the acknowledgement on stream B, it will close stream A for writing, and start writing on stream B.
When the receiver receives the EOF on stream A, it will send an EOF on stream A, switching over to stream B.
When the initiator receives an EOF on stream A, it will start reading on stream B.

At this point, the stream is fully migrated.

Resets

If either stream is "reset" before both ends are closed, both streams must be reset and the stream as a whole should be considered "aborted" (reset).

Half-Closed

If stream A was half-closed (either for reading or writing), that state must be replicated on the new stream after the initial handshake. Importantly, there's an edge-case:

The receiver tries to stream A for writing.
The receiver receives a migration request on stream B, for stream A.
The receiver ACKs the migration request.
The initiator sees the ACK on stream B.
The initiator sees the EOF on stream A, and treats it as the migration EOF.

This is fine. The stream will be migrated and the EOF will be re-played on stream B, leaving stream B in the intended state.

Analysis

Transport agnostic: This protocol can migrate any stream from any transport to any other stream-based transport. It can even migrate unidirectional and half-closed streams, as long as the new transport supports opening bidirectional streams, and can subsequently half-close them.
Overhead: This protocol will have a small overhead due to the multistream header, and stream ID, but that shouldn't be much in the grand scheme of things (especially if multistream 2 lands at some point). Importantly, this protocol requires no message framing.
Interruption: Writing switches instantly to an already prepared stream with no delay.
Transparent: This protocol supports all the normal stream features (half-close, reset, etc.).
Correct: There are no "undecidable cases" (to be confirmed in a PoC implementation).

Stebalien · 2021-05-24T23:59:58Z

(cc @marten-seemann who helped design this)

Stebalien · 2021-05-25T00:03:07Z

Note: This protocol will not allow recovering a session if "lost" (i.e., the connection was cut). Doing so would require keeping large write buffers and tracking acknowledgement states in userspace. This protocol will primarily aid the connection manager combine duplicate connections into a single connection, or migrate streams from a worse connection (e.g., TCP) to a better connection (e.g., QUIC).

bertrandfalguiere · 2021-05-25T08:01:04Z

Will this be used for upgrade from relayed connections to direct connection ?

yusefnapora · 2021-05-25T14:25:42Z

When the receiver receives the EOF on stream A, it will send an EOF on stream B, switching over to stream A.

Is this reversed? It seems like the receiver should send EOF on stream A and switch to stream B.

Looks like a great proposal to me 👍

Stebalien · 2021-05-25T14:36:55Z

Will this be used for upgrade from relayed connections to direct connection ?

Yes.

Is this reversed? It seems like the receiver should send EOF on stream A and switch to stream B.

Yes...

SgtPooki · 2023-08-07T20:18:20Z

Overhead should be at most a small per-stream cost (no additional framing, etc.)

I'm not sure if this is accurate. Based on step 1, "open a new stream on a new connection" there is a new connection made.

Maybe "at most, a small per-stream cost + new connection overhead," but I may lack understanding here.

The initiator will open a new stream (stream B), on a new connection, to the receiver. This is the target stream for the migration.

This and other lines are quite confusing. By "target" stream for the migration, do we mean the resulting stream? or target stream to be migrated? Technically, there are two streams targeted by a stream migration.

It would be nice to clarify the terminology for the two streams in the migration. It seems like Stream B is the "final" stream, and Stream A is the to-be-migrated stream. It would be nice to clarify and make language consistent in the spec.

Potential legend:

Term	Definition
Leader	The Peer who begins/initiates the connection with the Participant peer
Participant	The Peer who receives/acknowledges the connection and streams with the Leader peer
Negotiation-stream	An initial stream created in an existing connection between Leader and Participant peers.
Goal-stream	A new stream, using an "upgraded" transport when compared to the Negotiation stream, created on a new connection between leader and participant peers.

Using this legend, the spec would change as follows:

Requirement

Transport agnostic. Really, this means migrating at the stream level.
Minimal overhead. Overhead should be at most a small per-stream cost (no additional framing, etc.)
No interruption. Reading/writing should be continuous.
Transparent. Applications using migratable streams shouldn't notice anything.
Correct. There can't be any ambiguity (one side believing the migration happened, the other side disagreeing, etc.).

Protocol

When opening a stream, if the target peer supports the stream migration protocol (discovered through identify) and the stream is "long lived" (opt in or opt out?):

First, open a stream (Negotiation-stream).
Negotiate the stream migration protocol.
Send a message indicating that we're opening a new stream (Goal-stream), along with a unique ID for the stream.
Negotiate the actual protocol.

Because we "know" that the peer supports this stream migration protocol, we can pipeline these steps so as not to take any additional round-trips.

To migrate a stream:

The Leader will open a new stream (Goal-stream), on a new connection, to the Participant.
On Goal-stream, the Leader will negotiate the stream migration protocol.
On Goal-Stream, the Leader will send a message indicating that we're migrating a stream, along with the ID of the stream we're migrating
The Participant will acknowledge the migration (on Goal-stream) and, from this point onward, treat any "EOF" (close) on Negotiation-stream as a migration to Goal-stream.
When the Leader receives the acknowledgement on Goal-stream, it will close Negotiation-stream for writing, and start writing on Goal-stream.
When the Participant receives the EOF on Negotiation-stream, it will send an EOF on Negotiation-stream, switching over to Goal-stream.
When the Leader receives an EOF on Negotiation-stream, it will start reading on Goal-stream.

At this point, the stream is fully migrated.

Resets

If either stream is "reset" before both ends are closed, both streams must be reset and the stream as a whole should be considered "aborted" (reset).

Half-Closed

If Negotiation-stream was half-closed (either for reading or writing), that state must be replicated on the new stream after the initial handshake. Importantly, there's an edge-case:

The Participant tries to use the Negotiation-stream for writing.
The Participant receives a migration request on Goal-stream, for Negotiation-stream.
The Participant ACKs the migration request.
The Leader sees the ACK on Goal-stream.
The Leader sees the EOF on Negotiation-stream, and treats it as the migration EOF.

This is fine. The stream will be migrated and the EOF will be re-played on Goal-stream, leaving Goal-stream in the intended state.

Analysis

Transport agnostic: This protocol can migrate any stream from any transport to any other stream-based transport. It can even migrate unidirectional and half-closed streams, as long as the new transport supports opening bidirectional streams, and can subsequently half-close them.
Overhead: This protocol will have a small overhead due to the multistream header, and stream ID, but that shouldn't be much in the grand scheme of things (especially if multistream 2 lands at some point). Importantly, this protocol requires no message framing.
Interruption: Writing switches instantly to an already prepared stream with no delay.
Transparent: This protocol supports all the normal stream features (half-close, reset, etc.).
Correct: There are no "undecidable cases" (to be confirmed in a PoC implementation).

The receiver tries to stream A for writing.

"The receiver tries to [use?] stream A for writing.

Stebalien · 2023-08-07T21:23:29Z

I'm not sure if this is accurate. Based on step 1, "open a new stream on a new connection" there is a new connection made.

Well, this is a stream migration protocol. The goal is to migrate a stream from connection A to connection B. In this case, that "new connection" is connection B and the "new stream" is the the stream we're migrating from connection A.

I think the confusion is "new connection". I'll rename them to "target" and "source".

This and other lines are quite confusing. By "target" stream for the migration, do we mean the resulting stream? or target stream to be migrated? Technically, there are two streams targeted by a stream migration.

It's the stream to which we're migrating. I'll try to clarify it a bit.

Stebalien · 2023-08-07T21:27:15Z

I've tried to make it a bit more explicit.

MarcoPolo · 2023-08-07T23:37:21Z

fyi, we have this as a spec proposal: #406

We haven't merged because there hasn't been a real implementation nor the demand for it.

Longer term, I'd prefer more effort focused on connection migration in QUIC rather than this effort because:

Connection migration is well defined.
It would be better to use connection migration for the QUIC transport rather than this.
QUIC is the majority of the network.

Stebalien · 2023-08-07T23:46:38Z

In this migration protocol, I'm primarily targeting migrating streams off a relay and/or "combining" connections when we happen to establish multiple.

MarcoPolo · 2023-08-07T23:51:55Z

I think focusing on the "migrating off relay" use case is good. However I'm not sure in practice what you would do that starts on a public relay and continues on a direct connection. Because public relays are so limited (128KB/2min on Kubo) they aren't useful for much besides trying to get a direct connection. You wouldn't start fetching a file on a relayed connection and then continue on a direct one. Maybe there's a use case I'm missing?

MarcoPolo · 2023-08-07T23:52:43Z

"combining" connections when we happen to establish multiple.

Hopefully this is less prevalent now with the smart dialing work: https://github.com/libp2p/go-libp2p/releases/tag/v0.29.0

Stebalien · 2023-08-08T00:04:34Z

You wouldn't start fetching a file on a relayed connection and then continue on a direct one.

I could see sending a wantlist (bitswap) over a relay. Technically we could just kill the stream and re-create it.

But yeah, QUIC stream migration is higher priority and likely better in most cases.

marten-seemann · 2023-08-08T01:41:37Z

If / when https://datatracker.ietf.org/doc/draft-seemann-quic-nat-traversal/ ever becomes a reality, you'll be able to migrate your relayed QUIC connection to a hole-punched connection. Just to set expectations, this is very likely not going to happen within the next 12 months.

SgtPooki · 2023-09-20T22:30:48Z

I think focusing on the "migrating off relay" use case is good. However I'm not sure in practice what you would do that starts on a public relay and continues on a direct connection.

One example is a browser js-libp2p node who ends up having only p2p-circuit dialable multiaddrs.

Couldn't any node who has limited transport capabilities, and relies on relays to talk to the network, benefit from this? or is DCUtR supposed to handle most of those use-cases?

DCUtR attempted to solve this for us in js-libp2p and Helia land. To my untrained eyes, it seems very similar, but instead of an up-front connection migration (transient -> direct), it would be a mid-flight migration.

If we did implement a stream-migration protocol, would that allow us to stop limiting relay throughput, and instead depend upon DCUtR + stream-migration(SM) in order to transition the relay-started-transfer to a stream on the direct connection? If the DCUtR+SM process failed, we could drop the connection.. but in that case, isn't it better to just attempt DCUtR and never start the transfer if it doesn't succeed?

(apologies for the dumb questions, just trying to get on all of your libp2p-experts'-brainwaves)

Stebalien added the enhancement label May 24, 2021

Stebalien mentioned this issue Aug 6, 2021

Libp2p does not recover from interface being temporally down libp2p/go-libp2p#374

Closed

mxinden mentioned this issue Aug 11, 2021

relay/DCUtR: Add Direct Connection Upgrade through Relay protocol #173

Merged

mxinden mentioned this issue Dec 16, 2021

protocols/relay: Implement circuit relay v2 protocol libp2p/rust-libp2p#2059

Merged

marten-seemann assigned MarcoPolo Apr 11, 2022

MarcoPolo mentioned this issue Apr 13, 2022

Add a Stream Migration spec #406

Open

mxinden mentioned this issue Apr 16, 2022

protocols/dcutr: Don't retry when incoming direct connection succeeded libp2p/rust-libp2p#2607

Open

achingbrain mentioned this issue Jun 1, 2022

Multiple connection pruning libp2p/go-libp2p#634

Open

achingbrain mentioned this issue May 25, 2023

Topology on onDisconnect difference from onConnect makes it hard to distinguish between connections libp2p/js-libp2p#1755

Closed

aschmahmann mentioned this issue Jul 7, 2023

swarm connect will use any available address not just the given multiaddress ipfs/kubo#9895

Open

3 tasks

achingbrain mentioned this issue Aug 4, 2023

test: add tests for non-unilateral DCUtR over TCP libp2p/js-libp2p#1932

Closed

SgtPooki mentioned this issue Aug 7, 2023

Create spec legend and terminology guideline #565

Open

5 tasks

achingbrain mentioned this issue Oct 4, 2023

webrtc(private-to-private): clarify interaction with DCUtR #583

Closed

Jorropo mentioned this issue Mar 8, 2024

feat: add connection selection logic libp2p/go-libp2p#2726

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stream Migration Protocol #328

Stream Migration Protocol #328

Stebalien commented May 24, 2021 •

edited

Loading

Stebalien commented May 24, 2021

Stebalien commented May 25, 2021

bertrandfalguiere commented May 25, 2021

yusefnapora commented May 25, 2021

Stebalien commented May 25, 2021

SgtPooki commented Aug 7, 2023

Requirement

Protocol

Analysis

Stebalien commented Aug 7, 2023

Stebalien commented Aug 7, 2023

MarcoPolo commented Aug 7, 2023

Stebalien commented Aug 7, 2023

MarcoPolo commented Aug 7, 2023

MarcoPolo commented Aug 7, 2023

Stebalien commented Aug 8, 2023

marten-seemann commented Aug 8, 2023

SgtPooki commented Sep 20, 2023

Stream Migration Protocol #328

Stream Migration Protocol #328

Comments

Stebalien commented May 24, 2021 • edited Loading

Requirement

Protocol

Analysis

Stebalien commented May 24, 2021

Stebalien commented May 25, 2021

bertrandfalguiere commented May 25, 2021

yusefnapora commented May 25, 2021

Stebalien commented May 25, 2021

SgtPooki commented Aug 7, 2023

Requirement

Protocol

Analysis

Stebalien commented Aug 7, 2023

Stebalien commented Aug 7, 2023

MarcoPolo commented Aug 7, 2023

Stebalien commented Aug 7, 2023

MarcoPolo commented Aug 7, 2023

MarcoPolo commented Aug 7, 2023

Stebalien commented Aug 8, 2023

marten-seemann commented Aug 8, 2023

SgtPooki commented Sep 20, 2023

Stebalien commented May 24, 2021 •

edited

Loading