People

Influential practitioners

One-line bio → recent essays/talks → consulting takeaway. Use these as primary references when shaping conversations with technical clients.

AK Andrej Karpathy

Ex-OpenAI / Tesla · Software 1.0/2.0/3.0 framing · "context window as RAM"
  • Sequoia AI Ascent 2026 talk — christened the shift from vibe coding to Agentic Engineering. Named December 2025 his inflection point. "You can outsource your thinking, but you can't outsource your understanding." The New Stack
  • Coined "context engineering" as successor to prompt engineering. Tweet, June 2025
  • The viral "Karpathy CLAUDE.md" — Karpathy observed the failure modes; Forrest Chang distilled them into a copy-pasteable file (Jan 2026) that crossed 110k+ stars. Source · our position (use the file; ignore the unverified accuracy claim).
Anchors client conversations — "you're moving from vibe coding to agentic engineering, and the discipline is context engineering + harness design."

SW Simon Willison

Co-creator of Django · simonwillison.net · de facto field journalist of LLM tooling
Cite his pattern catalog as our public reference. "Only add agentic patterns when simpler approaches demonstrably underperform" is a useful guardrail.

AR Armin Ronacher

Creator of Flask · ex-Sentry · started new agents-native company in 2025 · lucumr.pocoo.org
Best single source for "what working with agents actually feels like." Quote his language-choice findings when clients ask which stacks to standardize on.

MH Mitchell Hashimoto

Co-founder of HashiCorp · now builds Ghostty
"Harness engineering" is a sellable deliverable on its own — auditing where teams should turn corrections into reusable agent checks.

BC Boris Cherny

Head of Claude Code at Anthropic · ex-Meta Principal Eng · author of Programming TypeScript

Design philosophy: latent demand, prototype-driven instead of PRDs, plan mode is a design document — never let Claude write code until plan is approved.

Plan-before-code is the single highest-leverage habit we can install.

GL Geoffrey Litt

Notion design engineer · previously Ink & Switch · PhD MIT
  • "Code like a surgeon" (Oct 2025) — stay in the loop on primary work; delegate secondary tasks.
  • Long-running theme: malleable software — end users adapting tools via AI.
Useful counterweight to "full autonomy" hype — the "surgeon model" is what most enterprise clients should actually buy.

SY Steve Yegge (Sourcegraph)

Veteran blogger (Amazon, Google) · now at Sourcegraph
Quotable, executive-friendly "this is the biggest change in 35 years" framing. Good slide-deck material.

HH Hamel Husain

Ex-Airbnb / GitHub ML · Parlance Labs · the de facto evals teacher

Core claim: 60–80% of dev time on real LLM products should be error analysis, and most failed LLM products share no robust evals as the root cause.

Evals are our Trojan horse — most clients have none, and we can deliver a working eval suite in weeks.

EY Eugene Yan

Member of Technical Staff at Anthropic · ex-Amazon/Alibaba · eugeneyan.com
Use his 2×2 as the executive whiteboard tool when a client says "we want to do AI" without saying what — the four quadrants force them to name defensive vs offensive value and data-side vs user-side. Most software-eng clients sit in the defensive/data and defensive/user quadrants and don't realize it.

sw swyx · Shawn Wang

Latent Space podcast · coined "AI Engineer"
Best meta-source for who-is-doing-what. Subscribe ourselves.

LM Lance Martin (LangChain)

Engineer at LangChain · clearest writer on context engineering for agents
  • "Context Engineering for Agents" (June 2025) — taxonomy: Write / Select / Compress / Isolate.
  • Cites Manus AI data: a typical agentic task uses ~50 tool calls — context budget is the real constraint, not model capability.
Use the W/S/C/I taxonomy when auditing a client agent — most production issues map cleanly onto one of these four.

DB Drew Breunig

Independent writer/researcher · collaborator with Lance Martin · dbreunig.com
Use the four-failure taxonomy directly in client diagnostic reports.

WY Walden Yan (Cognition / Devin)

Cognition co-founder · originator of the "Don't Build Multi-Agents" position
Pair with Anthropic's multi-agent research write-up to give clients an honest debate.

IS Ilya Sutskever

Co-founder of OpenAI · founder/CEO of Safe Superintelligence (SSI, ~$30B valuation) · famously reclusive in 2024–2025
  • Dwarkesh Podcast — "We're moving from the age of scaling to the age of research" (Nov 25 2025) — frames 2012–2020 as research, 2020–2025 as scaling, 2026+ as research again. Another 100× scaling would help but not transform.
  • "Jaggedness." Models pass the hardest PhD-level exams but lack the taste, judgment, and reliability of a human expert. RL on benchmark distributions creates models that are superhuman at test-taking but unreliable in practice. → Patterns · Jaggedness
  • Continual / on-the-job learning as the next paradigm. Rejects one-time pre-training; AGI as a system that learns after deployment.
  • Timeline: 5–20 years for a system that learns as efficiently as a human.
Quote "jaggedness" directly when execs are dazzled by SWE-bench numbers. End-of-pure-scaling validates the harness > model thesis: returns now come from scaffolding, context, and evals — not raw compute. Frame our work as the on-ramp to continual-learning systems.

+ Honorable mentions to track #

Reading priority shortlist #

  1. Anthropic, "Building Effective Agents".
  2. Karpathy, Sequoia AI Ascent 2026 talk.
  3. Ronacher, "Agentic Coding Recommendations" + "A Year Of Vibes."
  4. Cherny, Latent Space + Lenny's interviews.
  5. Hashimoto, Zed blog conversation.
  6. Lance Martin, "Context Engineering for Agents."
  7. Drew Breunig, "How Long Contexts Fail" + "How to Fix Your Context."
  8. Cognition, "Don't Build Multi-Agents" + "What's Actually Working."
  9. Anthropic, "How We Built Our Multi-Agent Research System."
  10. Hamel Husain, "LLM Evals: Everything You Need to Know."
  11. Sutskever, Dwarkesh interview (Nov 2025) — for the jaggedness vocabulary and end-of-scaling framing.