See what's happening under the hood. Benchmark agents. Measure everything.
$ bun install -g @carrier-sh/carrier# Run an agent and capture full telemetry
carrier deploy code-reviewer "review auth.ts"
# View complete execution log
cat .carrier/deployed/{id}/logs/*.json | jq
# Sample output:
{
"timestamp": "2024-10-10T10:23:45Z",
"type": "tool_call",
"content": {
"name": "Read",
"input": { "file_path": "src/auth.ts" }
},
"tokens": { "input": 234, "output": 156 }
}Install Carrier
bun install -g @carrier-sh/carrierRun any agent with telemetry
carrier deploy code-reviewer "review auth.ts"View complete telemetry logs
cat .carrier/deployed/*/logs/*.json | jqBenchmark different agents
carrier benchmark "task" --agents=a1,a2,a3When you use AI agents, you have zero visibility into what they're doing. No metrics on token usage, tool calls, or performance. No way to compare agents or verify quality before deploying.
Every tool call, token used, turn taken. Complete execution logs with timestamps, parameters, and results. Nothing hidden.
Run multiple agents side-by-side on the same task. See which performs best before deploying. Data-driven agent selection.
Watch agents work live. See files accessed, commands executed, decisions made. Chrome DevTools for AI.
Automatic capture of files read/written, commands run, tools used. Complete audit trail for every execution.
Create agents through conversation. Configure purpose, tone, output format. Test and deploy in minutes.
All telemetry stored as queryable JSON. Export to your data warehouse. Build custom analytics.
Claude Code intentionally hides complexity. But power users need data to make informed decisions.
.md agents, hope they workExecution transparency
Tool call captured
Structured data export
Carrier wraps Claude Code agents with comprehensive telemetry, then exports structured data you can query and analyze.
Every tool call, token used, file accessed, command run. Complete audit trail with timestamps and parameters.
Run multiple agents side-by-side. See which performs best on speed, quality, and cost metrics.
Export structured JSON logs. Query with jq, load into your data warehouse, build custom analytics dashboards.
Different users, same problem: no visibility into agent behavior. Carrier solves this at every level.
As AI agents become critical infrastructure, developers need the same quality tools they have for traditional software. Carrier brings that world to Claude Code.
Just like you profile your backend or measure frontend performance, you should understand your agents. Every token, every decision, every outcome.
You wouldn't ship code without tests. Why deploy agents without benchmarking? Compare approaches, validate quality, make data-driven choices.
Structured telemetry becomes organizational knowledge. Learn what works, optimize over time, share insights with your team.
We're building the developer tools AI agents deserve.
Purpose-built for developers who want control, visibility, and data-driven decisions.
Join developers using Carrier to gain full visibility into their AI agents. See what works. Measure what matters.