feat(ocap-kernel): add resource limits for remote communications #714

sirtimid · 2025-12-18T14:38:56Z

Closes #660

Add connection limit (default 100 concurrent connections)
Add message size limit (default 1MB per message)
Add stale peer cleanup (removes data for peers disconnected >1 hour)
Make all limits configurable via RemoteCommsOptions
Add ResourceLimitError for limit violations
Add comprehensive tests for all resource limits

This prevents memory exhaustion and manages system resources by:

Rejecting new connections when limit is reached
Rejecting messages exceeding size limit
Periodically cleaning up stale peer data

Note

Introduces resource enforcement in remote comms and a dedicated error type, with robust reconnection/race handling and cleanup.

Enforces max concurrent connections (default 100) and max message size (default 1MB) in network.ts; rejects excess with ResourceLimitError; all limits configurable via RemoteCommsOptions (maxConcurrentConnections, maxMessageSizeBytes, cleanupIntervalMs, stalePeerTimeoutMs).
Adds periodic stale peer cleanup (default every 15m; peers idle >1h) clearing queues, hints, reconnection state; integrates wake handling and intentional-close logic.
Improves reconnection: ignores stale channel errors, reuses existing channels, rechecks limits post-dial, flushes queues on channel replacement.
Adds ConnectionFactory.closeChannel to explicitly close/abort underlying streams and uses it for rejected/replaced channels.
Adds ResourceLimitError (code RESOURCE_LIMIT_ERROR) with marshal/unmarshal validation and exports.
Expands tests across limits, reconnection races, inbound handling, channel replacement/closing; adjusts coverage thresholds for ocap-kernel.

^{Written by Cursor Bugbot for commit 71fe98b. This will update automatically on new commits. Configure here.}

packages/ocap-kernel/src/remotes/network.ts

packages/kernel-errors/src/errors/ResourceLimitError.ts

packages/ocap-kernel/src/remotes/network.ts

rekmarks · 2026-01-05T17:56:51Z

@cursor review

packages/ocap-kernel/src/remotes/network.ts

- Add connection limit (default 100 concurrent connections) - Add message size limit (default 1MB per message) - Add stale peer cleanup (removes data for peers disconnected >1 hour) - Make all limits configurable via RemoteCommsOptions - Add ResourceLimitError for limit violations - Add comprehensive tests for all resource limits This prevents memory exhaustion and manages system resources by: - Rejecting new connections when limit is reached - Rejecting messages exceeding size limit - Periodically cleaning up stale peer data

packages/ocap-kernel/src/remotes/network.ts

FUDCo

This looks fine for what it is.

One lingering question I have is that as we get more careful about detecting error conditions (or proclaiming them, in the case of configurable resource limits), are we potentially setting ourselves up for situations where an error bubbles up to user code at some location other than the location that is actually responsible for causing it? In other words, will a message transmission error always find its way back to the actual send operation that triggered it?

More generally, could user code find itself in an unrecoverable error state (by that I don't mean a state where there's an error that you can't get rid of -- things can always break unfixably, e.g., a remote host dies forever -- but rather a state where you don't actually know you're stuck). It's entirely plausible to me that everything is fine, but I can't tell from reading the tests whether our tests give us reason to believe we are ok on this score.

sirtimid · 2026-01-06T15:55:34Z

One lingering question I have is that as we get more careful about detecting error conditions (or proclaiming them, in the case of configurable resource limits), are we potentially setting ourselves up for situations where an error bubbles up to user code at some location other than the location that is actually responsible for causing it? In other words, will a message transmission error always find its way back to the actual send operation that triggered it?

More generally, could user code find itself in an unrecoverable error state (by that I don't mean a state where there's an error that you can't get rid of -- things can always break unfixably, e.g., a remote host dies forever -- but rather a state where you don't actually know you're stuck). It's entirely plausible to me that everything is fine, but I can't tell from reading the tests whether our tests give us reason to believe we are ok on this score.

@FUDCo Two parts to your question:

Error attribution: Yeah transmission errors are always caught within the specific sendRemoteMessage() call that triggered them. Each call is independent with its own try/catch, so errors won't bubble up to the wrong location. The only errors that surface to callers are synchronous validation errors (ResourceLimitError, intentional close), which also come from the correct call site.

Stuck without knowing: Currently sendRemoteMessage is fire-and-forget—messages queue during reconnection, but if retries are exhausted, they're silently lost. The caller never knows. This is exactly what the Message Acknowledgment work addresses: sendRemoteMessage will resolve on ACK or reject after retries, so user code will know definitively.

packages/ocap-kernel/src/remotes/network.ts

sirtimid requested a review from a team as a code owner December 18, 2025 14:38

cursor bot reviewed Dec 18, 2025

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Outdated Show resolved Hide resolved

packages/kernel-errors/src/errors/ResourceLimitError.ts Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Outdated Show resolved Hide resolved

sirtimid force-pushed the sirtimid/remote-comms-resource-limits branch from a40d3eb to 96f26e2 Compare December 18, 2025 15:05

cursor bot reviewed Dec 18, 2025

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

cursor bot reviewed Dec 19, 2025

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

cursor bot reviewed Dec 19, 2025

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

sirtimid force-pushed the sirtimid/remote-comms-resource-limits branch from b6c23e0 to 25e81ac Compare December 19, 2025 19:55

cursor bot reviewed Dec 19, 2025

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

sirtimid force-pushed the sirtimid/remote-comms-resource-limits branch from 8e04986 to ce1e774 Compare January 5, 2026 15:35

cursor bot reviewed Jan 5, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Outdated Show resolved Hide resolved

cursor bot reviewed Jan 5, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

sirtimid added 11 commits January 5, 2026 21:38

fix bugs

ba2ab03

merge

9f83890

close channel to release network resources

b5d7b3c

small refactor

89c09f6

update last timestamp on inbound message receipt

5e4e59e

fix yet another bug

a44644a

Check connection limit for inbound connections

2d8437e

remove redundant check

ca35a69

thresholds

799b1d3

close channel after dials

8323837

sirtimid force-pushed the sirtimid/remote-comms-resource-limits branch from ce1e774 to 8323837 Compare January 5, 2026 20:51

sirtimid requested a review from FUDCo January 5, 2026 20:51

cursor bot reviewed Jan 5, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Outdated Show resolved Hide resolved

FUDCo reviewed Jan 5, 2026

View reviewed changes

fix bug

4b98dfe

cursor bot reviewed Jan 6, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Outdated Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

sirtimid added 3 commits January 6, 2026 17:21

fix more bugs oof

79bae98

cleanup

972080b

moooore cleanup

33e187d

cursor bot reviewed Jan 6, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Outdated Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

fixing moooore bugs

a85f406

cursor bot reviewed Jan 6, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

fixing all bugs

4092b73

cursor bot reviewed Jan 6, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

fixing race condition

2190328

cursor bot reviewed Jan 6, 2026

View reviewed changes

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

packages/ocap-kernel/src/remotes/network.ts Show resolved Hide resolved

fix Messages stuck in queue when write fails on replaced channel

71fe98b

sirtimid enabled auto-merge (squash) January 6, 2026 19:48

feat(ocap-kernel): add resource limits for remote communications #714

Are you sure you want to change the base?

feat(ocap-kernel): add resource limits for remote communications #714

Uh oh!

Conversation

sirtimid commented Dec 18, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rekmarks commented Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

FUDCo left a comment

Choose a reason for hiding this comment

Uh oh!

sirtimid commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sirtimid commented Dec 18, 2025 •

edited by cursor bot

Loading

sirtimid commented Jan 6, 2026 •

edited

Loading