Tracehouse tails your Claude Code and Codex transcripts live.
Every prompt, thought, tool call, result — in one waterfall.
Share any run. Compare any two. Export to HuggingFace.
$ curl -fsSL https://tracehouse.ai/install.sh | bash
or
pip install tracehouse-sdk
Every prompt, thought, tool call and result — nested by subagent, on a real timeline. You see exactly what the model saw, in order.
One click turns a run into a public link. Anyone can replay the full waterfall — no account needed. Teammates, stakeholders, your future self.
Side-by-side comparison. Flag the bad ones. See exactly where it went sideways — down to the specific tool call and timestamp.
Turn agent runs into shareable HuggingFace datasets in a click. Your traces become training data. Ship better agents, faster.
Drop into any Python script. Works with Claude Code, Codex, and any LLM agent. No framework lock-in. Full observability in three lines of code.
The local agent tails your transcripts. Your code never leaves your machine. We ship the trace — you control what gets recorded.
"It failed" is not a bug report. Share the trace. Anyone opens the link, sees the full waterfall, and understands exactly what happened. No account. No setup. Just a link.
Diff any two runs side by side. See the exact moment a change broke something — or fixed it. Turn regression hunting from intuition into precision.