Operations

Benchmarks

Token cost + capability comparison: Enquire vs chrome-devtools-mcp vs agent-browser. Measured on example.com, extrapolated to richer pages.

Three stacks compared: Enquire MCP (this project), chrome-devtools-mcp (Google's official MCP server), and agent-browser (Vercel's CLI tool).

Status: numbers measured via scripts/benchmark-mcp.mjs against example.com (April 2026, Chrome 147, chrome-devtools-mcp v0.23, agent-browser v0.26). Enquire numbers are bounded by error-path responses in this run — see Reproducibility. chrome-devtools-mcp and agent-browser numbers are clean measurements.

TL;DR — measured against example.com

Task	chrome-devtools-mcp	Enquire	agent-browser

Edit on GitHub

Roadmap

What's shipped, what's next, why we don't bundle Chromium.

Task class	chrome-devtools-mcp	Enquire (estimate)	agent-browser (estimate)	cdp/enq
Medium e-commerce flow (search → product → price)	~12,200 tok	~530 tok	~400 tok	23×
Heavy 10-step flow (checkout, multi-page form)	~30,000 tok	~800 tok	~600 tok	37×

Property	Enquire MCP	chrome-devtools-mcp	agent-browser CLI
Shape	MCP server (HTTP)	MCP server (stdio)	Shell CLI
Drives which Chrome?	User's real Chrome (extension)	Fresh isolated Chromium	Fresh isolated Chromium (Playwright)
Logged-in user state	✓	✗	✗
Discovery output	Targeted `extract`/`read_page`	Full a11y tree (verbose, scales with DOM)	Interactive-only ref tree (compact)
Token cost on rich pages	Bounded	Scales with DOM	Bounded
Tool-schema loading cost	MCP standard (~500-2000 tok per session)	MCP standard	None (CLI; agent reads SKILL.md once)
Discovery for agents	MCP `tools/list`	MCP `tools/list`	Claude Code Skill / man page
Network/perf inspection	✗	✓ (lighthouse, traces, requests)	◐ (`network route`, `network requests`)
Video recording	✗	✗	✓ (`record start/stop`)
Multiple parallel sessions	✗ (one extension)	✗ (one Chrome)	✓ (`--session <name>`)
State save/restore	✗	✗	✓ (`state save auth.json`)
Procedure recording / replay	✓ (SKILL.md, Ed25519 signed)	✗	✗ (you can shell-script it)
Production maturity	Beta	GA from Google	GA from Vercel

Capability	chrome-devtools-mcp	Enquire MCP (v1)
Navigation	`navigate_page`, `new_page`, `close_page`, history nav	`navigate`
Click	`click` (uid, requires snapshot first)	`click` (CSS selector)
Fill input	`fill`, `fill_form` (uid-based)	`fill_form` (selector-keyed)
Element discovery	`take_snapshot` (a11y tree, uid-indexed)	`extract` (CSS-targeted), `read_page` (readable)
Console	`list_console_messages`, `get_console_message`	telemetry to bridge (read by extension)
Network	`list_network_requests`, `get_network_request`	not exposed in v1
Performance	`performance_start_trace`, `_stop_`, `_analyze_insight`	not exposed
Lighthouse	`lighthouse_audit`	not exposed
Screenshot	`take_screenshot`, `take_memory_snapshot`	not exposed in v1
Extensions	`install_extension`, `reload_extension`, `list_extensions`, `uninstall_extension`, `trigger_extension_action`	n/a — Enquire IS the extension
Page emulation	`emulate` (color scheme, geolocation, network throttle, viewport)	not exposed
Dialogs	`handle_dialog`	not needed (driven via real user UI; if a target page raises one, missing)

Step	chrome-devtools-mcp	Enquire	agent-browser
navigate / open	~30 tok	~30 tok	~25 tok
discover element	`take_snapshot`: ~140 tok	`extract`: ~135 tok¹	`snapshot -i --json`: ~60 tok
`get attr` (CLI only)	—	—	~25 tok
Total measured	169 tok	167 tok	110 tok

Step	chrome-devtools-mcp	Enquire	agent-browser
navigate / open	~30 tok	~30 tok	~25 tok
discover (snapshot)	~140 tok	(none — direct selector)	~60 tok
click + wait	~30 tok	~70 tok¹	~30 tok
read landing	second snapshot: ~2,400 tok²	`read_page`: ~70 tok¹	`snapshot`: ~40 tok
Total measured	2,619 tok	318 tok	151 tok

	chrome-devtools-mcp	Enquire	agent-browser
per-interaction snapshot	~3,000 tok avg	(none — selectors known via SKILL.md)	(refs from one snapshot, reused)
per-interaction action	~80 tok	~80 tok	~50 tok
× 10 interactions	~30,800 tok	~800 tok	~700 tok

Task	chrome-devtools-mcp	Enquire	agent-browser	Δ vs cdp per 1000 runs
A — tiny extract (measured)	$0.0011	$0.0011	$0.0007	$0.40
B — click-and-read (measured)	$0.017	$0.0021	$0.0010	$15-16
C — heavy flow (extrapolated)	$0.20	$0.0053	$0.0046	$194-196

# 1. Bridge + API + WXT dev running (one-time per session)
npm run dev:all

# 2. agent-browser CLI installed (one-time per machine)
npm install -g agent-browser
which agent-browser   # confirm

# 3. Sign Enquire extension into the bridge in your main Chrome
#    (Settings → Cloud → enter dev userid → Sign in)
#    Verify: curl -s http://localhost:3789/healthz  →  tunnels: 1

`extract-link` — first href	169 tok	167 tok	110 tok	1.01×	0.66×
`click-and-read` — nav, click, read	2,619 tok	318 tok	151 tok	8.24×	0.47×
`page-text` — readable text	169 tok	160 tok	94 tok	1.06×	0.59×

Benchmarks

TL;DR — measured against example.com

On this page

Why the gap

The CLI-vs-MCP debate (April 2026)

Three-stack landscape

Tool surface comparison

The dialog handling case study

Token estimate methodology

Per-task token breakdowns

Task A — `extract-link` on example.com (measured)

Task B — `click-and-read` on example.com → iana.org (measured)

Task C — Heavy 10-step flow (extrapolated from measured ratios)

Cost translation

Caveats

When to use which

Reproducibility — how to run the benchmark

Setup

Recommended: split-Chrome mode

Why the auto-setup doesn't work

Force-reconnect handler

Reading the existing numbers