AK Andrej Karpathy
Ex-OpenAI / Tesla · Software 1.0/2.0/3.0 framing · "context window as RAM"
- Sequoia AI Ascent 2026 talk — christened the shift from vibe coding to Agentic Engineering. Named December 2025 his inflection point. "You can outsource your thinking, but you can't outsource your understanding." The New Stack
- Coined "context engineering" as successor to prompt engineering. Tweet, June 2025
- The viral "Karpathy CLAUDE.md" — Karpathy observed the failure modes; Forrest Chang distilled them into a copy-pasteable file (Jan 2026) that crossed 110k+ stars. Source · our position (use the file; ignore the unverified accuracy claim).
Anchors client conversations — "you're moving from vibe coding to agentic engineering, and the discipline is context engineering + harness design."
SW Simon Willison
Co-creator of Django · simonwillison.net · de facto field journalist of LLM tooling
Cite his pattern catalog as our public reference. "Only add agentic patterns when simpler approaches demonstrably underperform" is a useful guardrail.
AR Armin Ronacher
Creator of Flask · ex-Sentry · started new agents-native company in 2025 · lucumr.pocoo.org
Best single source for "what working with agents actually feels like." Quote his language-choice findings when clients ask which stacks to standardize on.
MH Mitchell Hashimoto
Co-founder of HashiCorp · now builds Ghostty
"Harness engineering" is a sellable deliverable on its own — auditing where teams should turn corrections into reusable agent checks.
BC Boris Cherny
Head of Claude Code at Anthropic · ex-Meta Principal Eng · author of Programming TypeScript
Design philosophy: latent demand, prototype-driven instead of PRDs, plan mode is a design document — never let Claude write code until plan is approved.
Plan-before-code is the single highest-leverage habit we can install.
GL Geoffrey Litt
Notion design engineer · previously Ink & Switch · PhD MIT
- "Code like a surgeon" (Oct 2025) — stay in the loop on primary work; delegate secondary tasks.
- Long-running theme: malleable software — end users adapting tools via AI.
Useful counterweight to "full autonomy" hype — the "surgeon model" is what most enterprise clients should actually buy.
SY Steve Yegge (Sourcegraph)
Veteran blogger (Amazon, Google) · now at Sourcegraph
Quotable, executive-friendly "this is the biggest change in 35 years" framing. Good slide-deck material.
HH Hamel Husain
Ex-Airbnb / GitHub ML · Parlance Labs · the de facto evals teacher
Core claim: 60–80% of dev time on real LLM products should be error analysis, and most failed LLM products share no robust evals as the root cause.
Evals are our Trojan horse — most clients have none, and we can deliver a working eval suite in weeks.
EY Eugene Yan
Member of Technical Staff at Anthropic · ex-Amazon/Alibaba · eugeneyan.com
Use his 2×2 as the executive whiteboard tool when a client says "we want to do AI" without saying what — the four quadrants force them to name defensive vs offensive value and data-side vs user-side. Most software-eng clients sit in the defensive/data and defensive/user quadrants and don't realize it.
sw swyx · Shawn Wang
Latent Space podcast · coined "AI Engineer"
Best meta-source for who-is-doing-what. Subscribe ourselves.
LM Lance Martin (LangChain)
Engineer at LangChain · clearest writer on context engineering for agents
- "Context Engineering for Agents" (June 2025) — taxonomy: Write / Select / Compress / Isolate.
- Cites Manus AI data: a typical agentic task uses ~50 tool calls — context budget is the real constraint, not model capability.
Use the W/S/C/I taxonomy when auditing a client agent — most production issues map cleanly onto one of these four.
DB Drew Breunig
Independent writer/researcher · collaborator with Lance Martin · dbreunig.com
Use the four-failure taxonomy directly in client diagnostic reports.
WY Walden Yan (Cognition / Devin)
Cognition co-founder · originator of the "Don't Build Multi-Agents" position
Pair with Anthropic's multi-agent research write-up to give clients an honest debate.
IS Ilya Sutskever
Co-founder of OpenAI · founder/CEO of Safe Superintelligence (SSI, ~$30B valuation) · famously reclusive in 2024–2025
- Dwarkesh Podcast — "We're moving from the age of scaling to the age of research" (Nov 25 2025) — frames 2012–2020 as research, 2020–2025 as scaling, 2026+ as research again. Another 100× scaling would help but not transform.
- "Jaggedness." Models pass the hardest PhD-level exams but lack the taste, judgment, and reliability of a human expert. RL on benchmark distributions creates models that are superhuman at test-taking but unreliable in practice. → Patterns · Jaggedness
- Continual / on-the-job learning as the next paradigm. Rejects one-time pre-training; AGI as a system that learns after deployment.
- Timeline: 5–20 years for a system that learns as efficiently as a human.
Quote "jaggedness" directly when execs are dazzled by SWE-bench numbers. End-of-pure-scaling validates the harness > model thesis: returns now come from scaffolding, context, and evals — not raw compute. Frame our work as the on-ramp to continual-learning systems.
+ Honorable mentions to track #
- Aman Sanger / Michael Truell (Cursor) — interviews on agent architecture, infill models, codebase indexing.
- Erik Schluntz, Barry Zhang (Anthropic agent team) — Anthropic Academy talks on agent patterns.
- Scott Wu (Cognition CEO) — Devin case-study writeups.
- Beyang Liu / Quinn Slack (Sourcegraph) — context graphs, Amp essays.
- Jason Liu — instructor / structured outputs; RAG-pattern writings.
- Chip Huyen — co-author of "What We've Learned…", podcast appearances on agent ops.
★ Reading priority shortlist #
- Anthropic, "Building Effective Agents".
- Karpathy, Sequoia AI Ascent 2026 talk.
- Ronacher, "Agentic Coding Recommendations" + "A Year Of Vibes."
- Cherny, Latent Space + Lenny's interviews.
- Hashimoto, Zed blog conversation.
- Lance Martin, "Context Engineering for Agents."
- Drew Breunig, "How Long Contexts Fail" + "How to Fix Your Context."
- Cognition, "Don't Build Multi-Agents" + "What's Actually Working."
- Anthropic, "How We Built Our Multi-Agent Research System."
- Hamel Husain, "LLM Evals: Everything You Need to Know."
- Sutskever, Dwarkesh interview (Nov 2025) — for the jaggedness vocabulary and end-of-scaling framing.