Skip to content

feat(sandbox): seccomp-notify DNS-pinned allowlist for Platform mode#17

Open
Ladas wants to merge 1 commit into
feat/landlock-tcp-portfrom
feat/seccomp-notify
Open

feat(sandbox): seccomp-notify DNS-pinned allowlist for Platform mode#17
Ladas wants to merge 1 commit into
feat/landlock-tcp-portfrom
feat/seccomp-notify

Conversation

@Ladas

@Ladas Ladas commented Jun 12, 2026

Copy link
Copy Markdown

Summary

Foundation for kernel-level connect() interception using seccomp-notify.
Adds DnsPinnedAllowlist module: resolves allowed domains to IPs at
sandbox creation, freezes them for the session (prevents DNS rebinding).

The notification event loop and on-behalf-of operations (pidfd_getfd)
will be wired once OPA policy integration is complete.

Depends on: #16 (Landlock TCP port restriction) → #15 (Platform mode base)

2 files, +135 lines. 820 tests pass, clippy clean.

What this PR adds

  • DnsPinnedAllowlist: resolve domains, pin IPs, check connect targets
  • Loopback always allowed (proxy address)
  • 4 unit tests
  • Full rustdoc (architecture, TOCTOU safety, requirements, references)

What's NOT in this PR (follow-up)

  • seccomp filter installation (SECCOMP_FILTER_FLAG_NEW_LISTENER)
  • Notification event loop (async read from notification fd)
  • On-behalf-of connect via pidfd_getfd()
  • Fork-based supervisor architecture in lib.rs
  • Integration with OPA network policies

Ref: NVIDIA#899

Assisted-By: Claude Code

@Ladas Ladas force-pushed the feat/seccomp-notify branch 2 times, most recently from 408aa3b to 2446c42 Compare June 12, 2026 16:14
@Ladas Ladas force-pushed the feat/landlock-tcp-port branch from 9ec5718 to 179d108 Compare June 12, 2026 16:26
Add kernel-level network syscall interception using SECCOMP_RET_USER_NOTIF
for Platform mode. Provides mandatory, syscall-level enforcement without
any capabilities.

DnsPinnedAllowlist: resolve domains to IPs at sandbox creation, freeze
for session lifetime (DNS rebinding prevention).

BPF filter intercepts: connect, sendto, sendmsg, recvfrom, recvmsg,
bind. Validates AUDIT_ARCH to prevent x32/compat ABI bypass.

Linux syscall wrappers: notification fd ioctls, pidfd_open/pidfd_getfd
for on-behalf-of operations (TOCTOU-safe), read_process_memory with
read_exact (no short reads), sockaddr parser (correct endianness for
sa_family, port, flowinfo), verify_socket_fd (mitigates fd-swap race),
deny/allow_connect response helpers.

Code review fixes applied across all PRs:
- PR #15: gateway propagates network_enforcement to DriverSandboxSpec
- PR #15: driver uses typed enum comparison (not magic integer)
- PR #16: saturating_sub prevents underflow in Landlock skipped count
- PR #16: warn!() on TCP port restriction failure (was debug)
- PR #17: BPF arch check, recvfrom/recvmsg/bind interception,
  verify_socket_fd, read_exact, allow_connect rename, flowinfo
  endianness, safety comments on all unsafe blocks

8 tests. Compiles, 949 tests pass, clippy clean.

Ref: NVIDIA#899
@Ladas Ladas force-pushed the feat/seccomp-notify branch from 2446c42 to 6078a8e Compare June 12, 2026 16:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant