Deep Analysis of Claude Code Leaked Source

Sergii Makarevych

Published Apr 1, 2026

The leak exposed ~512,000 lines of TypeScript across ~1,900 files (full src.zip via npm sourcemap accident). The codebase is ~90% AI-generated ("vibe-coded" with massive files, high cyclomatic complexity, and rapid iteration — 5 PRs/day, 10-20 feature iterations). Yet it delivers the most stable, production-grade agentic coding experience because Anthropic built a fortress of deterministic scaffolding around the model rather than relying on raw intelligence or massive prompts.

The entire runtime boils down to a 50-line TAOR loop (Think → Act → Observe → Repeat) in src/query.ts: an async generator that assembles context, calls the model, executes tools in parallel (`StreamingToolExecutor`), and streams events back. No complex state machines or workflow graphs — the model does the planning. This "delete code when models improve" philosophy keeps the harness thin and future-proof.

Here are the most interesting findings and ideas, synthesized from the deepest reverse-engineering threads (@iamfakeguru, @o_mega___, @rvivek's Claude self-analysis, @VocAiSage, @MrChiefAI, the full o-mega.ai article, and community GitHub mirrors).

1. Core Architecture & "Prompt > Orchestration" Philosophy

- Four primitives only: Read (files/images/PDFs), Write/Edit, Execute (Bash), Connect (MCP extensions). Everything else composes from these. Tools live in src/tools/ (40+ tools, 50k+ lines) with Zod schemas, permission checks, and async generators. Tools are alphabetically sorted for prompt-cache optimization.

- Prompt-driven multi-agent routing: Instead of code-heavy orchestration, a ~300-line system prompt tells the LLM how to spawn/route sub-agents. Flexible and scales with model intelligence.

- Plan Mode (Shift+Tab): Completely different system — spawns 3 parallel agents (exploration + implementation design), interviews you, then executes in an isolated git worktree. This is why it feels "different."

- Three-agent isolation (no God-mode): Main agent + Security Monitor + Verification Agent. Hardcoded, not dynamic. Verification Agent is forced (via prompt) to run linters and "ruthlessly hunt bugs" to counter self-evaluation bias.

Idea: The scaffolding (not the model) is the real moat. As models get smarter, the harness shrinks — they already deleted half the system prompt with Claude 4.0.

2. Context & Memory Management (The Real Secret to Stability)

- 5-layer compaction + 6-layer memory hierarchy:

- Layers: Managed Policies → Project Config (CLAUDE.md) → User Preferences → Session History → Auto-Learned Patterns (MEMORY.md, YAML taxonomy, auto-written by the agent) → Real-Time Transcript.

- Compaction strategies: Snip (old messages), Microcompact (tool results), Auto-Compact (model-generated summary in forked subprocess), Reactive (emergency), Context Collapse (aggressive prune).

- Threshold ~167k–200k tokens triggers "amputation" — keeps only 5 files + one 50k-token summary, discards everything else. This is the "context death spiral" people feel after 10–15 messages.

- "Dream memory consolidation" (autoDream): Background forked sub-agent runs while you're idle (Orient → Gather → Consolidate → Prune). Your agent literally dreams and prunes when you're asleep.

- Sub-agent swarming: Each gets isolated AsyncLocalStorage, own context window, own compaction cycle. No hard MAX_WORKERS limit. 5 agents = 835k effective tokens. Sequential mode is self-handicapping.

Practical override (from @iamfakeguru's CLAUDE.md): Force sub-agents for >5 files, pre-clean dead code before refactors, re-read files after 10+ messages, chunk any file >500 LOC.

3. Security & Anti-Cheat Moats (Defense-in-Depth Fortress)

- BashTool (bashSecurity.ts — 2,592 lines): Regex hell + tree-sitter AST analysis + heredoc extraction + Haiku LLM classifier for risk scoring. Blocks process substitution, zmodload, etc. Git hooks deferred until trust established.

- YOLO Classifier: Inline ML decision engine (no external API). Decides permissions with confidence/reasoning. Last line of defense for "auto" mode.

- Second AI (Security Monitor): Watches every tool call for prompt-injection attacks hidden in your own codebase files. "You can't trust your own repo anymore."

- PreToolUse hooks + AST parsers: Calculate "blast radius" before any command runs.

- Anti-distillation: Decoy tool definitions injected into API calls to poison competitor training data.

- Undercover Mode: Employee-only secrecy instructions (prevents leaking codenames in commits/PRs).

Idea: This is adversarial-by-design. Enterprises get a hardened "Agent OS" that assumes the codebase or user could be malicious.

4. Employee-Only Gates & "Downgraded" Public Version

The most explosive finding (@iamfakeguru's reverse-engineering against billions of agent logs):

- Verification gate in user.ts: Post-edit checks (compile, lint, tests) are gated behind process.env.USER_TYPE === 'ant'. Public users get "did bytes hit disk?" success metric only. Internal telemetry showed 29–30% false-claims rate; they built the fix and kept it internal.

5. Hidden Features & Easter Eggs (89 Feature Flags)

- KAIROS: Always-on ambient assistant (15s ticks, proactive tools, PR notifications).

- ULTRAPLAN: Offloads 30-min planning to cloud Opus 4.6 containers.

- Buddy System: Tamagotchi-style pet (18 species, PRNG-seeded stats, hats, animations). Soul persists across sessions; anti-cheat via char codes to hide codenames.

- Coordinator Mode: XML-based multi-agent direction with scratch dirs.

- Skills/Plugins system: Prompt macros + bundling.

- Daemon Mode + UDS Inbox: Background management.

Memory "dreams," agents self-verify adversarially, and the terminal UI is hand-rolled React/Ink with double-buffering (60fps, hardware scrolling for 10k-line outputs).

6. Limitations & "Vibe-Coded" Reality

- No true semantic/AST understanding (grep-only for refactors → misses dynamic imports, string refs).

- Massive files (e.g., 5.5k-line print.ts with 12 nesting levels).

- Context strategies are composited because single ones fail edge cases.

- 29% of users still allow dangerous find/rm/curl in auto mode (per community analysis).

Takeaway: The product feels infinitely better than competitors because of obsessive context hygiene, verification agents, and safety hooks — not because the model is magically smarter.

Community Impact & Reimplementations

- Fastest-growing GitHub repos in history (one clean-room Python rewrite hit 50k–92k stars in hours).

- Multiple mirrors + full docs (claude-code-info.vercel.app).

- People already running it locally, having agents analyze their own codebase, or porting to Rust.

- Ethical clean-room reimplements (e.g., instructkr/claw-code) emphasize: study the harness architecture, don't redistribute leaked code.

Bottom line: The leak didn't just expose code — it revealed that the future of coding agents is a thin, prompt-driven TAOR loop wrapped in an industrial-grade deterministic fortress. The model is the brain; the 510k lines are the skull, spine, and immune system.

Nitish M. 1mo

512,000 lines of TypeScript and 1,900 files. The scale alone tells you something important: the harness IS the product now. This is what Software 3.0 looks like in practice. The model is the engine but the context engineering around it is where all the sophistication lives.

1 Reaction

Sergii Makarevych 1mo

Claude Code has structural bottlenecks that cause predictable failures: context compaction destroys working memory mid-refactor, file reads silently truncate at 2,000 lines, tool results get cut to 2,000-byte previews, the default system prompt biases toward minimal output over correct output, and the agent has no built-in verification loop between editing code and reporting success. This CLAUDE.md overrides all of it. https://github.com/iamfakeguru/claude-md

Deep Analysis of Claude Code Leaked Source

Sergii Makarevych

1. Core Architecture & "Prompt > Orchestration" Philosophy

2. Context & Memory Management (The Real Secret to Stability)

3. Security & Anti-Cheat Moats (Defense-in-Depth Fortress)

4. Employee-Only Gates & "Downgraded" Public Version

Recommended by LinkedIn

5. Hidden Features & Easter Eggs (89 Feature Flags)

6. Limitations & "Vibe-Coded" Reality

Community Impact & Reimplementations

More articles by Sergii Makarevych

Others also viewed

9 Things I Learned Reading Claude Code’s Source Code

Anthropic Just Bought Bun: The Infrastructure of the AI Age

Want better AI code suggestions? The secret might be in your type system.

Octo-spork: “Not a spoon, not a fork… something in between… a fopoon.”

Knock, Knock, who is there? How an April Fool’s Joke Accidentally Proved That Code Is No Longer a Moat 💥🤡⚙️

Async/Await Explained — Beyond the Basics

The Universal Binary: How the WebAssembly Component Model Redefines Cloud-Native Portability

I Spent 3 Hours Reading Claude Code's Leaked Source Code So You Don't Have To. Here's What's Actually Inside.

I spent time experimenting with Agent Skills in VS Code. Here's what I built.

How to Execute a Large-Scale Code Migration Using AI as Your Pair Programmer

Explore content categories

1. Core Architecture & "Prompt > Orchestration" Philosophy

2. Context & Memory Management (The Real Secret to Stability)

3. Security & Anti-Cheat Moats (Defense-in-Depth Fortress)

4. Employee-Only Gates & "Downgraded" Public Version

Recommended by LinkedIn

5. Hidden Features & Easter Eggs (89 Feature Flags)

6. Limitations & "Vibe-Coded" Reality

Community Impact & Reimplementations

More articles by Sergii Makarevych

Eight reasons AI code reviewers are mostly useless

Your CLAUDE.md files are silently bloating every AI session

AIKaggler, Part 2 — Notebooks, Classification and the shape of the knowledge base

A Config-Driven Way to Build Spec-Driven Workflows for Claude Code

AIKaggler, Part 1 — Turning Kaggle Writeups Into a Knowledge Base

Spec-Driven Development (SDD) — Course Summary

A New Way to Encode Documents for AI Agents: Navigable Trees Instead of Embeddings

How Claude Code's Memory Actually Works — Lessons from the Accidental Source Leak

Spec-driven development workflow - how to get production-ready code from AI

Claude Code as a Self-Developing AI System

Others also viewed

9 Things I Learned Reading Claude Code’s Source Code

Anthropic Just Bought Bun: The Infrastructure of the AI Age

Want better AI code suggestions? The secret might be in your type system.

Octo-spork: “Not a spoon, not a fork… something in between… a fopoon.”

Knock, Knock, who is there? How an April Fool’s Joke Accidentally Proved That Code Is No Longer a Moat 💥🤡⚙️

Async/Await Explained — Beyond the Basics

The Universal Binary: How the WebAssembly Component Model Redefines Cloud-Native Portability

I Spent 3 Hours Reading Claude Code's Leaked Source Code So You Don't Have To. Here's What's Actually Inside.

I spent time experimenting with Agent Skills in VS Code. Here's what I built.

How to Execute a Large-Scale Code Migration Using AI as Your Pair Programmer

Similar topics

Understanding Anthropic Claude AI

Deep Dive Into LLM System Architecture

How Claude Code Transforms Team Workflows

Explore content categories