Product Brief

What is diffsan?¶

diffsan is a Python CLI tool intended to run primarily in GitLab CI on Merge Request pipelines. It performs an AI-assisted review of the MR diff against the target branch, then posts review feedback back to the MR as:

A summary note (markdown) that includes:
- high-level summary of issues
- a collapsible metadata section (fingerprint, timings, token usage, agent info, truncation/redaction flags)
- a collapsible truncation section (what was truncated/excluded)
Inline discussions (per-finding comments) positioned on diffs when possible.

Supported agents are Cursor CLI (default) and Codex CLI.

Primary goals (priority order)¶

Catch correctness & security issues in code changes.
Improve maintainability/quality.
Enforce project-specific conventions (“skills” / rules).
Speed up human review with good summaries and highlighted hotspots.

Non-goals¶

Do not block merges. The tool may exit non-zero on error, but the pipeline stage can be configured allow-failure. Merge decisions remain with humans.
Standalone mode is minimal (prints to stdout only; no GitLab posting).
Not aiming for org-wide service/infrastructure; it is a local CLI installed via pipx.
Not aiming for perfect dedupe/policy enforcement at MVP (keep extensible).

Must-not-do failure modes¶

Leak secrets into prompts, logs, artifacts, or MR comments.
Generate spammy comments (verbosity must be tunable; avoid repeating prior findings).
Produce output that is impossible to consume (must validate strict JSON schema).

Constraints & assumptions¶

The CI runner runs the selected agent CLI (Cursor or Codex); code diffs will be sent to the internet by that agent.
- This is acceptable under enterprise/compliance oversight.
Must do best-effort secret redaction before prompting.
- If secrets are detected, log high severity and (optionally) post a warning on the MR (without exposing the secret).
Must support multiple config sources with precedence:
- repo config file
- env/CI variables
- CLI flags
- sensible defaults with minimal setup (opinionated tool)

Typical CI flow¶

Identify MR and compute diff against target branch.
Preprocess diff: ignore paths, prioritize code, truncate to limits, redact secrets.
Decide if review should run (MVP: skip if auto-merge enabled).
Build prompt and run the selected agent headlessly.
Validate output as strict JSON using Pydantic schema; retry/repair is used for cursor only.
Format summary + discussions.
Post to GitLab (notes + discussions) with retries.
Always store artifacts (prompt + raw output + validated JSON + events).

Success metrics (practical)¶

Reliability: % runs producing valid review.json and successfully posting summary note.
Signal-to-noise: low number of low-value comments; avoids repeats.
Safety: zero incidents of unredacted secrets in prompt/artifacts/MR.
Latency: agent runtime and job duration within acceptable CI budget.

MVP v0 scope¶

Agent: Cursor CLI by default, Codex CLI optional
GitLab posting: summary note + inline discussions (when position computable)
Skip: auto-merge true => silent skip (stdout only)
Fingerprint: sha256(raw diff)
Prior digest: compact digest injected into prompt to avoid repeating