fix(copilot): dedupe transcript replay blocks by Abhi1992002 · Pull Request #13071 · Significant-Gravitas/AutoGPT

Abhi1992002 · 2026-05-10T04:35:39Z

Summary

Deduplicate transcript replay blocks in the copilot chat so repeated/echoed transcript messages do not surface twice during replay.
Scope is limited to autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts and its colocated unit test.

Test Plan

pnpm test:unit 'src/app/(platform)/copilot/__tests__/helpers.test.ts'
pnpm format
pnpm lint
pnpm types

CLAassistant · 2026-05-10T04:35:46Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

abhi seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

coderabbitai · 2026-05-10T04:35:57Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 12ed97ac-f057-4dcc-b1d7-805f85701479

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

Walkthrough

This PR refactors the copilot message deduplication logic to handle SSE transcript replays. The implementation extracts fingerprint helpers, adds transcript-prefix replay detection, and restructures deduplicateMessages into a three-stage pipeline with a new integration test validating the full flow.

Changes

Message Deduplication Refactor

Layer / File(s)	Summary
Fingerprint Helpers `autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts`	New `messageContentFingerprint` and `messageReplayFingerprint` helpers compute stable content hashes by preferring `text` and `toolCallId` fields, falling back to `JSON.stringify` for other part types.
Transcript Replay Detection `autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts`	`removeTranscriptPrefixReplays` function identifies and removes repeated transcript-prefix blocks by comparing role+content fingerprints across multi-message sequences (minimum 4 messages per block).
Deduplication Pipeline `autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts`	`deduplicateMessages` refactored to three stages: exact ID deduplication, transcript-prefix replay removal, and assistant-turn content fingerprint deduplication scoped to preceding user message ID.
Test Coverage `autogpt_platform/frontend/src/app/(platform)/copilot/__tests__/helpers.test.ts`	New test case validates deduplication of whole-conversation SSE prefix replays where replayed messages have fresh client-side IDs but identical role and text content.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

Significant-Gravitas/AutoGPT#12759: Related deduplication logic modifications using content fingerprints and SSE-replayed assistant message deduplication tests.

Suggested labels

size/l, platform/frontend

Suggested reviewers

0ubbe
Bentlybro

Poem

🐰 Messages dance through a thrice-refined refactor,
First duplicates by ID fall away,
Then transcript echoes fade from SSE's slower voice,
Fingerprints keep assistant turns in place—
A test ensures the conversation stays complete. ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 60.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and specifically describes the main change: fixing deduplication of transcript replay blocks in the copilot chat.
Description check	✅ Passed	The description is directly related to the changeset, explaining the purpose (deduplicating transcript replay blocks) and scope (helpers.ts and its test file).
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/frontend-deduplicate-chat-messages

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

Generate code and open pull requests
Plan features and break down work
Investigate incidents and troubleshoot customer tickets together
Automate recurring tasks and respond to alerts with triggers
Summarize progress and report instantly

Built for teams:

Shared memory across your entire org—no repeating context
Per-thread sandboxes to safely plan and execute work
Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2026-05-10T04:41:09Z

Codecov Report

❌ Patch coverage is 86.66667% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 70.56%. Comparing base (2624b6f) to head (1b67cc2).

Additional details and impacted files

@@            Coverage Diff             @@
##              dev   #13071      +/-   ##
==========================================
- Coverage   70.59%   70.56%   -0.04%     
==========================================
  Files        2193     2193              
  Lines      164774   164800      +26     
  Branches    16822    16828       +6     
==========================================
- Hits       116325   116290      -35     
- Misses      45093    45146      +53     
- Partials     3356     3364       +8

Flag	Coverage Δ
platform-frontend	`31.23% <89.65%> (+0.01%)`	⬆️
platform-frontend-e2e	`31.49% <14.28%> (-0.54%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
Platform Backend	`79.58% <ø> (ø)`
Platform Frontend	`37.78% <86.66%> (-0.15%)`	⬇️
AutoGPT Libs	`∅ <ø> (∅)`
Classic AutoGPT	`28.43% <ø> (ø)`

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

coderabbitai

🧹 Nitpick comments (1)

autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts (1)
358-386: 💤 Low value

Prefix-replay algorithm is correct; a small bound tightening would skip provably useless iterations.

Traced through the documented scenario [u1, a1, u2, a2, u1', a1', u2', a2']:

The early return guards length < MIN_PREFIX_REPLAY_MESSAGES * 2.

For i ∈ [1, MIN_PREFIX_REPLAY_MESSAGES - 1], the inner while is capped by replayLength < i, so replayLength can never reach MIN_PREFIX_REPLAY_MESSAGES and the outer iteration is a no-op.

At i = 4, all four fingerprints match and the block 4–7 is dropped; i += replayLength - 1 followed by i++ correctly advances past the dropped range, so multiple consecutive replays are handled.

Optional micro-tightening — start the outer loop at MIN_PREFIX_REPLAY_MESSAGES to make the no-op iterations impossible by construction, and document the replayLength < i (no-overlap) invariant inline:
♻️ Optional clarity tweak
-  for (let i = 1; i < messages.length; i++) {
+  // Smallest replay starts at i === MIN_PREFIX_REPLAY_MESSAGES because the
+  // prefix and the replay must each contain at least that many messages and
+  // must not overlap (replayLength < i below).
+  for (let i = MIN_PREFIX_REPLAY_MESSAGES; i < messages.length; i++) {
     if (dropped.has(i)) continue;
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@autogpt_platform/frontend/src/app/`(platform)/copilot/helpers.ts around lines
358 - 386, The outer loop in removeTranscriptPrefixReplays should start at
MIN_PREFIX_REPLAY_MESSAGES to avoid provably useless iterations; update the for
loop from for (let i = 1; ...) to begin at i = MIN_PREFIX_REPLAY_MESSAGES and
add an inline comment near the while that documents the no-overlap invariant
(replayLength < i) to clarify why early indices cannot produce a valid replay
match.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@autogpt_platform/frontend/src/app/`(platform)/copilot/helpers.ts:
- Around line 358-386: The outer loop in removeTranscriptPrefixReplays should
start at MIN_PREFIX_REPLAY_MESSAGES to avoid provably useless iterations; update
the for loop from for (let i = 1; ...) to begin at i =
MIN_PREFIX_REPLAY_MESSAGES and add an inline comment near the while that
documents the no-overlap invariant (replayLength < i) to clarify why early
indices cannot produce a valid replay match.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 6bfdd376-5545-4657-a618-85cfdfc669ca

📥 Commits

Reviewing files that changed from the base of the PR and between 2624b6f and 116a778.

📒 Files selected for processing (2)

autogpt_platform/frontend/src/app/(platform)/copilot/__tests__/helpers.test.ts
autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (7)

GitHub Check: check API types
GitHub Check: integration_test
GitHub Check: Seer Code Review
GitHub Check: end-to-end tests
GitHub Check: Analyze (python)
GitHub Check: Analyze (typescript)
GitHub Check: Check PR Status

🧰 Additional context used

📓 Path-based instructions (11)

autogpt_platform/frontend/**/*.{ts,tsx,js,jsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

autogpt_platform/frontend/**/*.{ts,tsx,js,jsx}: Use Node.js 21+ with pnpm package manager for frontend development
Always run 'pnpm format' for formatting and linting code in frontend development

Format frontend code using pnpm format

autogpt_platform/frontend/**/*.{ts,tsx,js,jsx}: Fully capitalize acronyms in symbols, e.g. graphID, useBackendAPI
No linter suppressors (// @ts-ignore``, // eslint-disable) — fix the actual issue

Files:

autogpt_platform/frontend/src/app/(platform)/copilot/__tests__/helpers.test.ts
autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts

autogpt_platform/frontend/**/*.{tsx,ts}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

autogpt_platform/frontend/**/*.{tsx,ts}: Use function declarations for components and handlers (not arrow functions) in React components
Only use arrow functions for small inline lambdas (map, filter, etc.) in React components
Use PascalCase for component names and camelCase with 'use' prefix for hook names in React
Use Tailwind CSS utilities only for styling in frontend components
Use design system components from 'src/components/' (atoms, molecules, organisms) in frontend development
Never use 'src/components/legacy/' in frontend code
Only use Phosphor Icons (@phosphor-icons/react) for icons in frontend components
Use generated API hooks from '@/app/api/generated/endpoints/' instead of deprecated 'BackendAPI' or 'src/lib/autogpt-server-api/'
Use React Query for server state (via generated hooks) in frontend development
Default to client components ('use client') in Next.js; only use server components for SEO or extreme TTFB needs
Use '' component for rendering errors in frontend UI; use toast notifications for mutation errors; use 'Sentry.captureException()' for manual exceptions
Separate render logic from data/behavior in React components; keep comments minimal (code should be self-documenting)

Files:

autogpt_platform/frontend/src/app/(platform)/copilot/__tests__/helpers.test.ts
autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts

autogpt_platform/frontend/**/*.{ts,tsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

autogpt_platform/frontend/**/*.{ts,tsx}: No barrel files or 'index.ts' re-exports in frontend code
Regenerate API hooks with 'pnpm generate:api' after backend OpenAPI spec changes in frontend development

autogpt_platform/frontend/**/*.{ts,tsx}: Use function declarations (not arrow functions) for components/handlers
No any types unless the value genuinely can be anything
Keep render functions and hooks under ~50 lines; extract named helpers or sub-components when they grow longer

Files:

autogpt_platform/frontend/src/app/(platform)/copilot/__tests__/helpers.test.ts
autogpt_platform/frontend/src/app/(platform)/copilot/helpers.ts

autogpt_platform/frontend/src/**/*.{ts,tsx}