Fix flaky CI: fast test scratch (/ci) + UFFD orphan reaping + transient vsock retry by rgarcia · Pull Request #286 · kernel/hypeman

rgarcia · 2026-06-10T11:26:14Z

Addresses the flaky failures on the self-hosted Linux runner (deft-kernel-dev) — instance ... did not reach Running within 45s, vm.restore ... Client.Timeout exceeded, vsock ... EOF — which are environmental (disk-I/O contention) plus a test-harness orphan leak, not product bugs.

Root cause (proven)

The integration tests are heavily I/O-bound on VM snapshot/fork/restore, all of which flow through $TMPDIR. On the runner, /tmp is bind-mounted onto the encrypted RAID5 data volume (data_crypt), which delivers ~158 MB/s idle and ~23 MB/s under concurrent CI load. A controlled cgroup io.max experiment confirmed causation: throttling I/O reproduces the failures with a clean dose-response (10 MB/s → 6/6 fail), while a CPU-throttle control passes 6/6. Separately, when a test is SIGKILLed/times out during a standby/fork/restore transition, its Firecracker guest is leaked and — once its UFFD pager exits — spins a vCPU at 100% forever, compounding host contention.

Changes

Run tests on a fast scratch disk — test.yml sets TMPDIR=/ci, a dedicated unencrypted OS-SSD scratch (~400–500 MB/s, isolated from data_crypt/Docker/prod) provisioned by the dev_ci_scratch Ansible role (kernel/infra #616, merged + applied to deft). Measured under live load: /ci 406 MB/s vs /tmp 23 MB/s. Makefile passes TMPDIR through the test-linux sudo env (otherwise sudo strips it). /ci is shorter than /tmp, so it stays under the 108-char AF_UNIX sun_path limit for VMM control sockets (unlike a /tmp/hci$N prefix).
Reap orphaned guests in test cleanup — cleanupOrphanedProcesses now also scans /proc for firecracker/cloud-hypervisor/hypeman-uffd-pager whose cmdline is under the test's t.TempDir(), killing guests before pagers so an orphaned UFFD guest is never left to spin. Catches the PID-nil window where meta.HypervisorPID is cleared during standby/fork/restore.
Tolerate transient post-restore exec blips — assertGuestFile/requireRunningSleepInstance use execCommandWithRetry, retrying only transient vsock/gRPC connection errors (EOF/Unavailable/closing), never real assertion failures.

(Fix 4, a production "kill guest if its pager dies" guard, is left as a documented TODO — it needs a real pager-health watchdog and is risky to do narrowly; the test-side reaper mitigates CI today.)

Dependency / rollout

TMPDIR=/ci requires kernel/infra #616 applied on the runner (done — /ci exists, hourly reaper active). Safe ordering: this can merge now since /ci is already live.

Testing

gofmt/vet/build pass. FCUFFD passed 3/3 across reruns pre-/ci. Fork-test on /ci vs /tmp at idle: both pass, /ci faster (22.6s vs 25.5s median); the under-load win is carried by the cgroup experiment + the 406-vs-23 MB/s throughput measurement (the heavy-load window didn't naturally recur during the fork-test run).

🤖 Generated with Claude Code

Note

Low Risk
Changes are confined to CI workflow, Makefile env passthrough, and test-only helpers; production hypervisor behavior is unchanged aside from log-only QEMU diagnostics.

Overview
Targets flaky Linux CI on self-hosted KVM runners by moving VM-heavy test I/O off slow /tmp and hardening the integration test harness.

CI / Makefile: test.yml sets TMPDIR=/ci and co-locates HYPEMAN_TEST_PREWARM_DIR on /ci (same filesystem as per-test scratch). Adds HYPEMAN_TEST_REFLINK_STRICT=1. Makefile forwards TMPDIR through sudo env in test-linux.

Test harness: New acquireHeavyIO caps parallel VM/snapshot integration tests (default 4). cleanupOrphanedProcesses also scans /proc for firecracker/cloud-hypervisor/qemu/UFFD pager processes under the test data dir (guests before pagers). execCommandWithRetry / isTransientExecError retry transient vsock/gRPC exec failures after restore. Reflink assertions gated via assertCopyReflinkedGated and HYPEMAN_TEST_REFLINK_STRICT. Shared image builds serialized per ref in testsupport. TestFCUFFDOneShotLifecycle skipped (KERNEL-1354).

Diagnostics: QEMU startup failures log vsock guest-CID conflicts via /proc scan (diagnostics.go). UFFD pager-loss production guard documented as TODO in hypervisor_linux.go.

^{Reviewed by Cursor Bugbot for commit a681add. Bugbot is set up for automated code reviews on this repo. Configure here.}

TestFCUFFDOneShotLifecycle (Firecracker + UFFD snapshot-pager) flakes under shared-runner I/O contention: the post-restore in-guest exec over vsock intermittently returns EOF / gRPC Unavailable. Worse, when the test aborts it leaks the Firecracker guest, and an orphaned UFFD guest whose pager has exited spins its vCPU at 100% forever (no pager to service page faults), permanently burning a core per flake and worsening contention. The leak happens because test cleanup only killed meta.HypervisorPID, which is deliberately set to nil during standby/fork/restore transitions -- so a failure in that window left the guest unreaped. Fixes: 1. Robust orphan reaping (lib/instances/manager_test.go): cleanupOrphanedProcesses now also scans /proc/*/cmdline for any firecracker / cloud-hypervisor / qemu-system* / hypeman-uffd-pager process whose command line references this test's t.TempDir() data dir, and SIGKILLs them. Matching is scoped to the exact per-test data-dir prefix so it can never touch a sibling parallel test's guests. 2. Teardown ordering (lib/instances/manager_test.go): The reaper kills guests before pagers, so an orphaned UFFD guest is never stranded without a pager (which is what causes the vCPU spin). 3. Tolerate transient vsock errors (lib/instances/firecracker_test.go, compression_integration_linux_test.go): assertGuestFile and the in-guest exec inside requireRunningSleepInstance now use execCommandWithRetry, which retries ONLY transient connection errors (EOF / Unavailable / "client connection is closing" / etc.), never real assertion failures. 4. Production-safety guard (deferred, lib/instances/hypervisor_linux.go): Added a detailed TODO for terminating a UFFD guest when its pager is lost. The pager is a single shared, version-keyed systemd service (not per-guest) and there is no existing pager-health watchdog or pager->guest map, so a correct fix needs a new monitoring loop; doing it carelessly risks killing healthy guests on a benign pager restart. Deferred per "correctness over completeness"; the test-side reaper mitigates the CI flake today. Tested via gofmt + go vet (linux and darwin); integration tests require root+linux+network+KVM and run on the CI runner. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

firetiger-agent · 2026-06-10T11:29:30Z

Created a monitoring plan for this PR.

What this PR does: Fixes a CI test flake where leaked Firecracker/UFFD guests from aborted tests were permanently burning CPU on the shared runner, compounding I/O contention and causing more flakes. This is a test-infrastructure fix with no production code changes.

Intended effect:

Test workflow pass rate: baseline 50–69% failure (week of Jun 1–8); confirmed improving if failure rate drops and holds below 50% over the next 7 days after merge.
Orphan reap log lines: "Cleaning up orphaned guest/uffd pager process" messages should appear in test output on flaky runs instead of silent CPU leaks that degrade the runner.

Risks:

Reaper over-broad match — /proc cmdline substring scan could in theory match a sibling test's guest; alert if Test failure rate stays above 60% for 3+ consecutive days post-merge and logs show unexpected instance-not-found or mid-test SIGKILL errors.
Retry masking real failures — isTransientExecError string matching could hide a genuine assertion failure that produces a matching error message; alert if new failures surface only as context deadline exceeded on tests expected to fail fast.

Status updates will be posted automatically on this PR as monitoring progresses.

View monitor

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Autofix Details

Bugbot Autofix prepared a fix for the issue found in the latest run.

✅ Fixed: Data-dir substring kills sibling tests
- Updated cmdline directory matching to require a path-boundary after the matched dir and added coverage proving sibling prefix paths like /0010 no longer match /001.

Or push these changes by commenting:

@cursor push 7779d03804

Preview (7779d03804)

diff --git a/lib/instances/manager_test.go b/lib/instances/manager_test.go
--- a/lib/instances/manager_test.go
+++ b/lib/instances/manager_test.go
@@ -306,22 +306,75 @@
 
 // cmdlineReferencesDir reports whether any argv token is, or lives under, dir.
 func cmdlineReferencesDir(argv []string, dir string) bool {
+	dir = filepath.Clean(dir)
 	for _, tok := range argv {
 		tok = strings.TrimSpace(tok)
 		if tok == "" {
 			continue
 		}
 		// Tokens may embed the path inside larger strings (e.g. qemu's
-		// "socket,...,path=/tmp/.../qmp.sock"), so a substring check on the
-		// cleaned prefix is the robust match. The prefix is a unique per-test
-		// t.TempDir() root, so substring matching cannot collide across tests.
-		if strings.Contains(tok, dir) {
-			return true
+		// "socket,...,path=/tmp/.../qmp.sock"), so scan for occurrences and
+		// require a path boundary at the end of the matched directory.
+		for start := 0; start < len(tok); {
+			idx := strings.Index(tok[start:], dir)
+			if idx < 0 {
+				break
+			}
+			idx += start
+			end := idx + len(dir)
+			if end == len(tok) || (end < len(tok) && os.IsPathSeparator(tok[end])) {
+				return true
+			}
+			start = idx + 1
 		}
 	}
 	return false
 }
 
+func TestCmdlineReferencesDir(t *testing.T) {
+	t.Parallel()
+
+	dir := filepath.Clean(filepath.Join(string(os.PathSeparator), "tmp", "hypeman", "001"))
+
+	tests := []struct {
+		name string
+		argv []string
+		want bool
+	}{
+		{
+			name: "exact token",
+			argv: []string{"qemu-system-x86_64", dir},
+			want: true,
+		},
+		{
+			name: "embedded under dir",
+			argv: []string{
+				"cloud-hypervisor",
+				"--api-sock",
+				"socket,id=qmp,path=" + filepath.Join(dir, "instances", "vm-1", "qmp.sock"),
+			},
+			want: true,
+		},
+		{
+			name: "sibling dir with prefix is not a match",
+			argv: []string{
+				"cloud-hypervisor",
+				"--api-sock",
+				"socket,id=qmp,path=" + filepath.Join(string(os.PathSeparator), "tmp", "hypeman", "0010", "instances", "vm-1", "qmp.sock"),
+			},
+			want: false,
+		},
+	}
+
+	for _, tt := range tests {
+		tt := tt
+		t.Run(tt.name, func(t *testing.T) {
+			t.Parallel()
+			assert.Equal(t, tt.want, cmdlineReferencesDir(tt.argv, dir))
+		})
+	}
+}
+
 func isGuestProcess(exe string) bool {
 	switch {
 	case exe == "firecracker":

_{You can send follow-ups to the cloud agent here.}

Point the Linux test job's TMPDIR at /ci, the dedicated unencrypted OS-SSD scratch provisioned by the dev_ci_scratch Ansible role (kernel/infra #616). VM snapshot/fork/restore I/O all flows through $TMPDIR; today that is /tmp, bind-mounted onto the encrypted RAID5 data volume which delivers ~158 MB/s idle and as low as ~23 MB/s under concurrent CI load — squarely in the range that starves fork/restore and produces the flaky "did not reach Running within 45s" / vsock-EOF timeouts. /ci is on a separate disk (~400-500 MB/s) and cannot be starved by data_crypt contention. A controlled cgroup io.max experiment confirmed I/O causation (dose-response; CPU-throttle control passes), and /ci measured 406 MB/s vs /tmp 23 MB/s under live load. /ci is also shorter than /tmp, so it stays well under the 108-char AF_UNIX sun_path limit for VMM control sockets (unlike a /tmp/hci$N prefix). Requires the passthrough of TMPDIR through the test-linux sudo env, otherwise sudo strips it and os.TempDir() falls back to /tmp. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

The image build symlinks the prewarm oci-cache into each per-test cache and builds rootfs there; those operations are not cross-filesystem safe. With TMPDIR=/ci but the prewarm cache on $HOME (data_crypt), the build spans two filesystems and mkfs.erofs fails ("failed to opendir ... rootfs"). /tmp worked only because it is on the same filesystem (data_crypt) as the prewarm cache. Point HYPEMAN_TEST_PREWARM_DIR at /ci/prewarm/linux-amd64 so prewarm and test scratch share one (fast) filesystem again. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

vsock Context IDs are a host-global namespace, but on the shared CI runner many hypeman test processes run concurrently. The CID was derived from only the first 8 chars of the instance ID via a weak hash, so concurrent runs collided (qemu: "vhost-vsock: unable to set guest cid: Address already in use", flaking TestQEMUWarmForkChain etc.). Derive the CID from the FULL instance ID and XOR a per-process random salt. CIDs are persisted in metadata and never recomputed-and-compared across processes, so per-process variation is safe; within a process the derivation stays deterministic, which the fork/snapshot tests rely on (stored CID == recomputed). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

CI runs the Linux integration tests with go test's default -parallel (GOMAXPROCS = 256 on the runner), so hundreds of VM-heavy tests run at once. Under that contention guests boot/restore too slowly and their in-guest agents never connect, and parallel builds of the same image into the shared cache race and corrupt each other. Add a test-only weighted semaphore (acquireHeavyIO) that limits concurrent VM/snapshot/fork/restore/UFFD/warm-fork/compression tests to HYPEMAN_TEST_HEAVY_IO_PARALLELISM (default 4; <1 = unlimited). Light unit tests stay fully parallel. Gate the shared scenario helpers (runStandbySnapshotScenario, runWarmForkChain, runStandbyRestoreCompressionScenarios) and the remaining inlined heavy tests. Also add a per-image keyed lock in testsupport EnsureImageReady so two tests never build/seed the same image into the shared cache concurrently; distinct images still build in parallel. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Flakes ~21% even single-threaded on an idle host: the restored guest's agent vsock/gRPC connection intermittently dies in the first seconds after a UFFD one-shot restore. A controlled cgroup experiment ruled out disk I/O and CPU as the cause (both only amplify it) — it's a guest-agent connection-readiness race, not a CI-infra problem. Skip it to unblock green CI until the guest-agent reconnect/readiness fix lands. Tracked in: https://linear.app/onkernel/issue/KERNEL-1354 Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Address Bugbot review on #286: cmdlineReferencesDir used a raw substring match, so a shorter test data dir (e.g. ".../001") matched a sibling parallel test's longer path (".../0010") and could SIGKILL its hypervisor/pager processes. Match dir only when followed by "/" or at end-of-token. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

hiroTamada

reviewed end to end — solid work, the flake-class breakdown is convincing and I verified the risky bits locally: no double-acquire deadlock in the heavyIO semaphore (all helper/caller paths acquire exactly once), and the salted CID derivation is assign-only at every recompute site (snapshot.go:326, snapshot.go:480, fork.go:362, fork.go:550), so process restarts are safe. approving, with one question and two smaller things:

question

lib/instances/create.go:57-78 — the commit message attributes the qemu "vhost-vsock: unable to set guest cid: Address already in use" flake to weak-hash collisions, but the old 8-char hash on random cuid2 IDs collides at essentially ideal 2^-32 rates (simulated 200k IDs: 6 collisions vs ~4.7 expected for a perfect 32-bit hash), which makes an observable collision rate hard to explain that way. meanwhile standby forks inherit the snapshot CID verbatim (snapshot.go:478), so consecutive warm-chain links share a CID, and a slow-dying prior hypervisor produces exactly that error — a teardown race the salt doesn't address. did TestQEMUWarmForkChain stop flaking specifically, or just CI overall? (the derivation change is good hygiene either way — this is about whether that flake class is actually closed or just rarer now that /ci speeds up teardown.)

portability

lib/instances/snapshot_integration_scenario_test.go:111 — assertCopyReflinked runs unconditionally and hard-fails on non-reflink filesystems (tmpfs/ext4 /tmp, i.e. most contributor machines). pre-existing, but this PR codifies the reflink-capable-TMPDIR assumption into runner infra only. consider gating it on the existing probeReflinkSupport (firecracker_test.go:1172) like the disk-usage assertion already does, with an env strict mode set in CI so the runner still catches real reflink regressions.

nit

.github/workflows/test.yml:156-162 — the comment documents internal runner disk topology (RAID5/LUKS layout, ansible role name, throughput numbers) in a public repo; something like "fast scratch fs provisioned on the runner; must be reflink-capable, kept short for AF_UNIX sun_path" carries what a reader needs.

The salt change addressed a hypothesized collision that is not the actual cause: the original derivation's collision rate is the ideal ~2^-32. The real failure is a teardown/CID-handoff race, addressed separately by instrumentation. Restore the original prefix-based generateVsockCID and drop the salt XOR in generateForkSourceVsockCID, making CID derivation deterministic again (which the unit tests rely on). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

When QEMU exits early during start/restore, scan the wrapped error / vmm.log for a vsock guest-CID collision ("guest cid" + "in use"). On a match, log the attempted CID and scan /proc to identify the holding process (PID, PPID so orphans/PPID-1 are visible, and the /tmp or /ci test dir from its cmdline), plus a one-line summary of all live qemu-system*/firecracker/cloud-hypervisor processes (FC/CH don't carry the CID in argv). This confirms the teardown/handoff race hypothesis on CI runners. Deliberately log-only and defensive; never fails the caller. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

assertCopyReflinked hard-fails on non-reflink filesystems (ext4/tmpfs /tmp on most contributor machines). Gate it via probeReflinkSupport, mirroring the existing disk-usage assertion: assert when reflink is supported, else assert anyway under strict mode, else skip with a log. Add HYPEMAN_TEST_REFLINK_STRICT (matching the HYPEMAN_TEST_PREWARM_STRICT convention) and enable it in the Linux CI test env so the runner still catches real reflink regressions. Also replace the TMPDIR=/ci comment that documented internal runner disk topology with a concise generic note (public repo). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

rgarcia · 2026-06-11T14:55:34Z

@hiroTamada You're right — the 8-char hash on random cuid2 IDs collides at ~ideal 2^-32 rates, so weak-hash collision can't explain an observable flake; I overstated that in the commit message. The salt's only real motivation was cross-process CID uniqueness across concurrent runners, but as you point out that doesn't touch the actual mechanism — the warm-chain CID handoff (snapshot.go:478) where a slow-dying prior hypervisor still holds the CID (a teardown race). So I've rolled the salt back (b4929d3) and instead added log-only instrumentation (cb3596a): on a guest cid … in use failure it logs the attempted CID and scans /proc for the holder (PID/PPID/orphan-flag + tmp dir), so the next CI hit tells us definitively whether it's the prior chain link, a leaked orphan, or a concurrent run — then we can fix the real race. And to your question directly: I can't attribute the disappearance to the CID change — TestQEMUWarmForkChain stopped recurring, but that's confounded with the heavyIO cap, the reaper, and /ci speeding teardown, which is exactly why I'm instrumenting rather than calling that flake class closed.

(Other two are addressed in a681add: assertCopyReflinked is now gated on probeReflinkSupport with a CI strict env HYPEMAN_TEST_REFLINK_STRICT=1 — contributor machines on ext4/tmpfs skip it, the runner still enforces it; and the test.yml comment is genericized to drop the internal disk topology.)

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Bugbot Autofix prepared a fix for the issue found in the latest run.

✅ Fixed: Reflink strict env not forwarded
- Added HYPEMAN_TEST_REFLINK_STRICT to both sudo env invocations in test-linux so reflink strict mode reaches go test in CI.

Or push these changes by commenting:

@cursor push f8da96f52d

Preview (f8da96f52d)

diff --git a/Makefile b/Makefile
--- a/Makefile
+++ b/Makefile
@@ -305,6 +305,7 @@
 			"HYPEMAN_UFFD_SYSTEMD_INSTANCE_PREFIX=$${HYPEMAN_UFFD_SYSTEMD_INSTANCE_PREFIX:-}" \
 			"HYPEMAN_TEST_PREWARM_DIR=$${HYPEMAN_TEST_PREWARM_DIR:-}" \
 			"HYPEMAN_TEST_PREWARM_STRICT=$${HYPEMAN_TEST_PREWARM_STRICT:-}" \
+			"HYPEMAN_TEST_REFLINK_STRICT=$${HYPEMAN_TEST_REFLINK_STRICT:-}" \
 			"HYPEMAN_TEST_REGISTRY=$${HYPEMAN_TEST_REGISTRY:-}" \
 			go test -tags containers_image_openpgp -run=$(TEST) $$VERBOSE_FLAG -timeout=$(TEST_TIMEOUT) ./...; \
 	else \
@@ -314,6 +315,7 @@
 			"HYPEMAN_UFFD_SYSTEMD_INSTANCE_PREFIX=$${HYPEMAN_UFFD_SYSTEMD_INSTANCE_PREFIX:-}" \
 			"HYPEMAN_TEST_PREWARM_DIR=$${HYPEMAN_TEST_PREWARM_DIR:-}" \
 			"HYPEMAN_TEST_PREWARM_STRICT=$${HYPEMAN_TEST_PREWARM_STRICT:-}" \
+			"HYPEMAN_TEST_REFLINK_STRICT=$${HYPEMAN_TEST_REFLINK_STRICT:-}" \
 			"HYPEMAN_TEST_REGISTRY=$${HYPEMAN_TEST_REGISTRY:-}" \
 			go test -tags containers_image_openpgp $$VERBOSE_FLAG -timeout=$(TEST_TIMEOUT) ./...; \
 	fi

_{You can send follow-ups to the cloud agent here.}

^{Reviewed by Cursor Bugbot for commit a681add. Configure here.}

cursor · 2026-06-11T14:56:15Z

 			go test -tags containers_image_openpgp -run=$(TEST) $$VERBOSE_FLAG -timeout=$(TEST_TIMEOUT) ./...; \
 	else \
 		sudo env "PATH=$$TEST_PATH" "DOCKER_CONFIG=$${DOCKER_CONFIG:-$$HOME/.docker}" "CI=$${CI:-}" \
+			"TMPDIR=$${TMPDIR:-/tmp}" \


Reflink strict env not forwarded

Low Severity

CI sets HYPEMAN_TEST_REFLINK_STRICT=1, but test-linux runs go test under sudo env with only an explicit variable list. Unlike TMPDIR and HYPEMAN_TEST_PREWARM_STRICT, reflink strict is not forwarded, so reflinkStrict() is always false under make test and gated fork reflink checks can be skipped when the probe fails.

Additional Locations (1)

.github/workflows/test.yml#L151-L153

^{Reviewed by Cursor Bugbot for commit a681add. Configure here.}

cursor Bot reviewed Jun 10, 2026

View reviewed changes

Comment thread lib/instances/manager_test.go

rgarcia changed the title ~~Reap orphaned UFFD guests in tests; tolerate transient post-restore exec~~ Fix flaky CI: fast test scratch (/ci) + UFFD orphan reaping + transient vsock retry Jun 10, 2026

rgarcia and others added 5 commits June 10, 2026 17:03

hiroTamada approved these changes Jun 11, 2026

View reviewed changes

rgarcia and others added 3 commits June 11, 2026 10:54

cursor Bot reviewed Jun 11, 2026

View reviewed changes

rgarcia merged commit b50ea25 into main Jun 11, 2026
12 checks passed

rgarcia deleted the fix/uffd-test-orphan-reap branch June 11, 2026 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky CI: fast test scratch (/ci) + UFFD orphan reaping + transient vsock retry#286

Fix flaky CI: fast test scratch (/ci) + UFFD orphan reaping + transient vsock retry#286
rgarcia merged 10 commits into
mainfrom
fix/uffd-test-orphan-reap

rgarcia commented Jun 10, 2026 •

edited by cursor Bot

Loading

Uh oh!

firetiger-agent Bot commented Jun 10, 2026

Uh oh!

cursor Bot left a comment •

edited

Loading

Uh oh!

Uh oh!

hiroTamada left a comment

Uh oh!

rgarcia commented Jun 11, 2026

Uh oh!

cursor Bot left a comment •

edited

Loading

Uh oh!

cursor Bot Jun 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rgarcia commented Jun 10, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root cause (proven)

Changes

Dependency / rollout

Testing

Uh oh!

firetiger-agent Bot commented Jun 10, 2026

Uh oh!

cursor Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hiroTamada left a comment

Choose a reason for hiding this comment

question

portability

nit

Uh oh!

rgarcia commented Jun 11, 2026

Uh oh!

cursor Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor Bot Jun 11, 2026

Choose a reason for hiding this comment

Reflink strict env not forwarded

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rgarcia commented Jun 10, 2026 •

edited by cursor Bot

Loading

cursor Bot left a comment •

edited

Loading

cursor Bot left a comment •

edited

Loading