feat!: retrieval server and client by alanshaw · Pull Request #48 · storacha/go-ucanto

alanshaw · 2025-06-17T13:46:50Z

This PR implements the RFC here. It exposes a server and client implementation that allows UCAN authorized retrieval requests via invocations (and receipts) passed in HTTP headers. This leaves the HTTP response body available to be used for retrieved bytes.

A retrieval server is very similar to a normal Ucanto server, except it requires invocations to be sent using the headercar transport codec. The only other difference is that invocation handlers receive an extra argument - the HTTP request info, and can return and additional value - a HTTP response.

The retrieval client is also very similar to a Ucanto client, except that it has the ability to send an invocation in multiple parts, if it does not fit in HTTP headers. Essentially it'll send proofs one by one until the server has all the proofs required to execute the invocation. The server has an LRU cache allowing for this.

The PR also includes a transport codec that encodes agent messages into HTTP headers.

🎬 Demo: https://youtu.be/11np-cGTe48?si=kw88R1DAlMSq-b1T

resolves #59

codecov · 2025-06-17T13:47:44Z

Codecov Report

❌ Patch coverage is 18.16143% with 730 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
server/retrieval/server.go	0.00%	273 Missing ⚠️
client/retrieval/connnection.go	47.90%	62 Missing and 25 partials ⚠️
testing/helpers/printer/printer.go	0.00%	72 Missing ⚠️
server/retrieval/options.go	0.00%	58 Missing ⚠️
core/message/message.go	0.00%	49 Missing ⚠️
transport/http/channel.go	0.00%	36 Missing ⚠️
server/retrieval/error.go	0.00%	27 Missing ⚠️
transport/headercar/codec.go	0.00%	26 Missing ⚠️
transport/headercar/message/header.go	53.70%	17 Missing and 8 partials ⚠️
transport/headercar/request/request.go	0.00%	14 Missing ⚠️
... and 11 more

Files with missing lines	Coverage Δ
core/car/car.go	`58.02% <100.00%> (ø)`
core/delegation/delegation.go	`45.74% <100.00%> (+3.69%)`	⬆️
core/ipld/hash/sha256/sha256.go	`0.00% <ø> (ø)`
core/receipt/ran/ran.go	`0.00% <ø> (ø)`
server/server.go	`73.85% <100.00%> (ø)`
client/connection.go	`0.00% <0.00%> (ø)`
testing/helpers/helpers.go	`0.00% <0.00%> (ø)`
transport/car/codec.go	`0.00% <0.00%> (ø)`
server/retrieval/cache.go	`80.00% <80.00%> (ø)`
core/receipt/receipt.go	`56.77% <71.42%> (+0.14%)`	⬆️
... and 16 more

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

alanshaw · 2025-08-06T16:03:59Z

Note to self: need to set caching headers to expire when delegation expires + Vary on X-Agent-Message

…executed

frrist

Good stuff, there's a lot here!
Will give another look after some conversation in comments and related RFC.

frrist · 2025-08-27T17:41:21Z

+				return nil, nil, fmt.Errorf("decoding body: %w", err)
+			}
+			if len(model.Proofs) == 0 {
+				return nil, nil, fmt.Errorf("missing missing proofs: %w", err)


maybe something like: "Server did not include missing proofs in response"?

frrist · 2025-08-27T17:48:23Z

+	return c.hasher()
+}
+
+func Execute(ctx context.Context, inv invocation.Invocation, conn client.Connection) (client.ExecutionResponse, transport.HTTPResponse, error) {


nit: preference for this method to be broken down a bit more. Ideaally separate functions for "regular" requests and "multipart" requests at a minimum.

Additionally, I'd appreciate a comment on the top of this method describing the flow of things, something like:

Execute performs a UCAN invocation using the headercar transport, implementing a "probe and retry" pattern to handle HTTP header size limitations. The method first attempts to send the complete invocation (including all proofs) in HTTP headers. If this fails due to size constraints (4KB header limit), it falls back to a multipart negotiation protocol: 1. Send invocation with ALL proofs omitted 2. Server responds with 510 (Not Extended) listing missing proof CIDs 3. Send partial invocations with each missing proof attached one by one as requested (TODO I probably have this wrong) 4. Repeat until server has all required proofs (200/206 response) This approach optimizes for the common case (shallow delegation chains that fit in headers) while also handling deep proof chains that require multiple round trips. The server caches proofs between requests, so each proof only needs to be sent once per session. Note: The current implementation processes missing proofs sequentially rather than in batches, which means deep delegation chains will result in multiple HTTP round trips. This trade-off prioritizes implementation simplicity over network efficiency, which is acceptable given current delegation chain depths but may need optimization as authorization hierarchies grow deeper. Returns the execution response, the final HTTP response, and any error encountered.

Based on a read of the implementation, I think my comment here: https://github.com/storacha/specs/pull/139/files#r2304572067 is probably an incorrect understanding. Though this does motivate, to me at least, a separate endpoint on nodes where proofs can be Put once before a Get, though I haven't really thought this through completely, so I might be speaking non-sense 😅

👍 will do. Please check my response https://github.com/storacha/specs/pull/139/files#r2334017925

frrist · 2025-08-27T18:46:30Z

 	}.Sum(bytes)
 	return cidlink.Link{Cid: c}
 }
+


might want to replace this package with https://github.com/storacha/go-libstoracha/blob/main/testutil in the future.

That ends up being circular I think...

frrist · 2025-08-27T18:47:11Z

 		return nil, fmt.Errorf("decoding CAR: %w", err)
 	}
+	if len(roots) != 1 {
+		return nil, fmt.Errorf("unexpected number of roots: %d", len(roots))


nit: include the expected number of roots in the error message, i.e. 1

frrist · 2025-08-27T18:47:24Z

+	if len(roots) != 1 {
+		return nil, fmt.Errorf("unexpected number of roots: %d", len(roots))
+	}


frrist · 2025-08-27T18:56:17Z

Sign in to view

+	r, w := io.Pipe()
+	go func() {
+		gz := gzip.NewWriter(w)
+		_, err := io.Copy(gz, data)
+		gz.Close()
+		w.CloseWithError(err)
+	}()
+
+	var b bytes.Buffer
+	_, err := b.ReadFrom(r)
+	if err != nil {
+		return "", fmt.Errorf("reading encoded CAR: %w", err)
+	}


Can we remove this go routine, maybe something like this?

var b bytes.Buffer gz := gzip.NewWriter(&b) _, err := io.Copy(gz, data) if err != nil { gz.Close() return "", fmt.Errorf("compressing CAR data: %w", err) } if err := gz.Close(); err != nil { return "", fmt.Errorf("closing gzip writer: %w", err) }

Ahh yes much nicer.

Co-authored-by: Forrest <forrest@storacha.network>

frrist

I don't see anything in here that causes me to want to block a merge. I'd like the Execute method to be broken into smaller methods, but not worth blocking on that point alone.
sooo LGTM 🚢 🚂 good stuff!

(though, it would probably be good to have a second set of eyes/approval - your call ofc)

alanshaw · 2025-09-11T14:56:40Z

@frrist feedback addressed. I have also added an upper bound to the number of requests that will be made, if you have a delegation chain longer than 50 then you cannot use this code...

Let me know if you have any issues with that and I'll address in a new PR.

frrist · 2025-09-11T15:00:19Z

+	// if the header fields are too big, we need to split the delegation into
+	// multiple requests...
+	if multi {
+		response, err = sendPartialInvocations(ctx, inv, conn)
+		if err != nil {
+			return nil, nil, fmt.Errorf("sending partial invocations: %w", err)
+		}
+	}
+
+	output, err := conn.Codec().Decode(response)
+	if err != nil {
+		return nil, nil, fmt.Errorf("decoding message: %w", err)
+	}
+
+	return client.ExecutionResponse(output), response, nil


This is way better! Much easier to follow chain of execution logic 🫶

frrist · 2025-09-11T15:04:30Z

+			return nil, fmt.Errorf("unexpected status code: %d", res.Status())
+		}
+
+		body, err := io.ReadAll(res.Body())


do we need to close the res.Body()?

frrist · 2025-09-11T15:07:47Z

+	}
+
+	// now send the parts
+	for range MaxPartialInvocationReqs {


In a future version, we could do something like:

for { select { case ctx.Done(): return ctx.Err() default: // continue with existing logic } // existing logic }

This would allow the caller to decide how long they want to let this request run; MaxPartialInvocationReqs seems pragmatic enough for now.

frrist · 2025-09-11T15:12:41Z

I have also added an upper bound to the number of requests that will be made, if you have a delegation chain longer than 50 then you cannot use this code

SGTM, I suspect this case will be very unlikely, but proposed an alternative in the comments.

My only concern is this: #48 (comment) - it looks like we may have a resource leak by not closing the body, but unsure.

alanshaw force-pushed the feat/headercar-transport-codec branch from 044368c to d303891 Compare June 17, 2025 15:51

alanshaw marked this pull request as ready for review June 25, 2025 14:34

alanshaw requested review from a team, frrist and hannahhoward June 25, 2025 18:05

alanshaw requested a review from volmedo as a code owner July 4, 2025 10:36

alanshaw added 17 commits August 12, 2025 21:44

feat: header CAR transport codec

93a7bcd

refactor: constructor names

b5d25a7

test: a test

e6e29f6

feat: add logging

573f4d1

feat: add logging

4c27656

refactor: allow use without ucanto server

ba0adde

test: add roundtrip test

3167bb3

refactor: rename import

de7cd56

refactor: rename to body provider

ebff079

feat: allow nil header to be returned

962aa4f

fix: response status for failures

78d7023

fix: handle nil body

4fe924c

feat: wip retrieval server

a61e69f

feat: wip

51c3b8d

feat: wip try new server

2ce1c7d

fix: other files

59e18d0

feat: wip

af2d42d

alanshaw force-pushed the feat/headercar-transport-codec branch from 8544963 to af2d42d Compare August 12, 2025 19:45

alanshaw added 4 commits August 13, 2025 17:18

feat: server implementation

4ae4a6d

feat: send partial agent messages

9620caa

refactor: do not respond with a receipt when invocation has not been …

e323875

…executed

fix: decode in client

2a71726

alanshaw added 3 commits August 14, 2025 22:36

fix: add tests

ffdbddc

test: add tests

d3de300

chore: mod tidy

3661973

alanshaw changed the title ~~feat: header CAR transport codec~~ feat: retrieval server and client Aug 18, 2025

alanshaw changed the title ~~feat: retrieval server and client~~ feat!: retrieval server and client Aug 18, 2025

alanshaw added 2 commits August 18, 2025 18:18

feat: add Vary header to responses

9c6a99e

fix: set content-type header

c954677

frrist mentioned this pull request Aug 19, 2025

Single Process Piri Node storacha/piri#192

Merged

frrist reviewed Aug 27, 2025

View reviewed changes

Update server/retrieval/server.go

dc52000

Co-authored-by: Forrest <forrest@storacha.network>

frrist reviewed Sep 10, 2025

View reviewed changes

Comment thread server/retrieval/cache.go

frrist approved these changes Sep 10, 2025

View reviewed changes

refactor: address feedback

9910b43

frrist reviewed Sep 11, 2025

View reviewed changes

alanshaw mentioned this pull request Sep 11, 2025

Switch retrieval server delegation cache to ARC cache #61

Open

alanshaw merged commit 5cf60dc into main Sep 11, 2025
1 of 2 checks passed

alanshaw deleted the feat/headercar-transport-codec branch September 11, 2025 15:03

frrist reviewed Sep 11, 2025

View reviewed changes

Uh oh!

Conversation

alanshaw commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

alanshaw commented Aug 6, 2025

Uh oh!

frrist left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alanshaw Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

frrist left a comment

Choose a reason for hiding this comment

Uh oh!

alanshaw commented Sep 11, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

frrist commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alanshaw commented Jun 17, 2025 •

edited

Loading

codecov Bot commented Jun 17, 2025 •

edited

Loading

alanshaw Sep 10, 2025 •

edited

Loading