Find review lenses for the codebase in front of you.
Security, platform migrations, billing, agent systems, performance, and team-specific policies install as versioned SKILL.md packages.
Critique is automated code verification for GitHub pull requests: we spin up an ephemeral sandbox, run your code, catch the breakage that kills builds, and give you sandbox-powered safe merging without seat-based pricing. The Coding Agent API ships the same OpenCode runtime over HTTP for CI, internal agents, and fix workflows.
Review status
Merge blocked.
Scout · 4 specialists · lead verdict
This PR weakens tenant scoping, risks double-applying billing reconciliation, and slows the context builder in the hot path. Merge is not recommended until those findings are resolved.
Coding Agent API
Start agent runs from CI, cron, or internal tools. Critique clones the repository, runs OpenCode in a sandbox, returns patch and timeline events, and can open a draft PR you hand to merge-grade review.
POST with "preview": true returns model floor, balance, and blocked state — no sandbox spawned.
Workspace UI and HTTP API read and write the same builderJob, with links back to stream, status, and passport.
Follow-ups stay in one OpenCode session; live activity streams over GET …/stream while a turn runs.
Publish a branch, open the PR, and jump to Change Passport review without re-entering context.
POST /api/v1/coding-agent/runs
{ "repository": "acme/web", "prompt": "…", "preview": true, "billing": { "mode": "managed" } }
Browse installable review skills, publish your own lenses, and route specialized judgment into every PR.
Security, platform migrations, billing, agent systems, performance, and team-specific policies install as versioned SKILL.md packages.
Turn local review heuristics into a portable skill with tags, changelog, install commands, and account-gated releases.
See installs, acceptance, and skill performance so the marketplace rewards useful lenses instead of loud descriptions.
The official review lens adds intake, triage, stack-aware specialists, and stricter finding contracts.
Portable commands and bundles keep Codex, Cursor, Claude Code, and Critique Chat speaking the same review language.
Latest updates
Start with the Coding Agent API for automation, then skills, BYOA, BYOK, and the v5.1 ship log — everything on this page links to the surfaces we ship in production.
HTTP runs on OpenCode + E2B: idempotent creates, credit preview, signed webhooks, cancel/status endpoints, and OpenRouter BYOK.
Open updateAutonomous agents shipped v5.1 in one window — transparent review runs, one Coding Agent product, credit preview, and repo-first Platform. Read the ship essay.
Open updateThe free built-in skill now adds intake and triage, stack lenses, and a stricter findings contract for serious PR review.
Open updateQueue Cursor, Claude, or Codex-style runs from completed reviews while vendor keys stay under your account.
Open updateUse OpenRouter or CrofAI direct billing while Critique keeps orchestration, sandboxes, and ledgers in one place.
Open updateMarketplace, PR dashboard, skills, connections, and every production-facing update in one page.
Open updateThe merge boundary is the product. Critique records provenance and risk, proves a block with evidence, enforces merge policy, verifies repairs in a sandbox, and feeds incident learnings back into the rules. Review, chat, and Builder are layers inside that loop.
One PR-level record holds provenance, risk, gate events, evidence runs, the merge decision, repair proof, and memory. Internally we call it the Change Passport.
components/home/page.tsx
+ product panel stack
+ unique abstract surfaces
- generic review slop
The evidence contract links every blocking decision to the finding behind it. A verdict is a claim you can audit and replay, not an opinion in a thread.
Dashboard or repo-file policy runs in dry-run, warn, or enforce. Operator overrides record who allowed the merge and why, then patch the GitHub check.
Chat
free
Builder
38
Remedy stores a proof bundle: patch hash, validation, and a verification linkage. A fix is done because the proof says so.
Findings memory and incident learnings from Sentry, Linear, Jira, and Vercel feed back into the rules. The boundary gets smarter with every postmortem.
The review loop is the product loop: frame the change, verify risk with specialists, then post one grounded output back to GitHub.
It reads the diff, nearby files, and test surface first so the review starts with context instead of generic commentary.
Security, billing, architecture, and performance lanes open only when the pull request actually warrants them.
The lead reviewer writes the final output as a technical review artifact your team can merge or block against.
Connect GitHub and the verification queue fills in. Run merge policy in dry-run first, then enforce when you are ready. No check renames, no forced gating, no rewrite of your existing automation.
Keep exploring
Full sitemap ↗Critique v4
Sandbox-powered safe merging, Control Board policy, verification records per PR, and why comment-only review bots are not enough for agent-authored code.
PR control
Gate slop before review spend, operate from one Control Board, and keep an auditable verification record per PR.
Git control
Govern what merges on GitHub — agent PRs, policy, and proof — for platform and release engineering.
All guides
PR control, git control, automated code verification, and AI review — one index.
About
The real verification layer for pull requests: sandbox runs, policy, memory, delivery, and proof on one PR record.
Definitive guide
What AI code review is, how multi-agent review actually works, how to evaluate tools on your own PRs, and a 6-step rollout playbook.
Tool comparison
Nine tools ranked, a full feature matrix, honest callouts of where each competitor wins, and direct vs-Critique deep dives.
Pricing + credits
Solo $19/mo, Pro $49/mo, Team $149/mo — shared across the team. Student and OSS maintainers start at $5/mo student/OSS with unlimited indexing and no CodeRabbit-style seat tax.
Free calculator
Estimate monthly reviewer hours, engineering cost, and the leverage of putting an AI review layer on one busy GitHub repo first.
v5.1.0
Transparent OpenCode controllers, one Coding Agent across Builder and API, credit preview, marketplace attribution, and repo-first Platform — mostly agent-built.
Open source
PR control for foundations and ecosystems with hundreds of PRs — Pro/Team for volume, OSS lane for verified maintainers.
PR operations
Weekly operating rhythm, verification queue triage, and PR management when agents flood your repos.
Pricing research
CodeRabbit seats, Cursor Bugbot usage billing, GitHub Copilot Actions-minute costs, and Critique shared credits compared in one buyer guide.
Build failures
Catch TypeScript compile errors, circular imports, bad dynamic imports, and broken Next.js builds before merge with local build checks and ephemeral sandbox verification.
Security checklist
A practical PR checklist for auth, data access, secrets, dependencies, prompt injection, and safe remediation before merge.
Trending now
What GitHub, Anthropic, Cursor, and OpenAI are signalling right now: agentic review, repo context, usage-based pricing, and verification-first buying criteria.
Cursor BYOA
Queue Cursor Agent SDK fix handoffs on Composer 2.5 after Critique review — harness deep dive, benchmarks, and Settings BYOA setup.
Skills
Publish and version critique-review lenses — Fintech Security, Kafka migrations, agentic audits — and import them into any Critique workspace or agent runtime.
Agent API
HTTP runs on OpenCode + E2B — crt_ keys, draft PRs, warm sessions, cookbooks, and pricing from $19/mo shared credits or OpenRouter BYOK.
Compare
Cloud coding agent API hub — model freedom, OpenRouter :free routes, pay-as-you-go credits, and when Devin’s session UI fits better.
Deep dive
Keep OpenCode and the E2B sandbox warm between turns on one run id — multi-step automation with SSE and explicit teardown.
Ship log
June 2026 (v5.5.0): read the v5.1 ship essay and the /version log for what changed this week.
Essays
Field notes on multi-model review, hybrid retrieval, operational trust gaps, and how we ship the product.
Engine
20+ frontier and mid-tier models — GPT-5.4, Claude Opus 4.7, Gemini 3 Pro, Kimi K2.6, GLM-5 — swappable as lead or sub-agent.
Docs
GitHub App install, policy tuning, Remedy fix-agent, chat lane, webhooks, and SSO for enterprise.