People — Influential Practitioners

AK Andrej Karpathy

Ex-OpenAI / Tesla · Software 1.0/2.0/3.0 framing · "context window as RAM"

Sequoia AI Ascent 2026 talk — christened the shift from vibe coding to Agentic Engineering. Named December 2025 his inflection point. "You can outsource your thinking, but you can't outsource your understanding." The New Stack
Coined "context engineering" as successor to prompt engineering. Tweet, June 2025
The viral "Karpathy CLAUDE.md" — Karpathy observed the failure modes; Forrest Chang distilled them into a copy-pasteable file (Jan 2026) that crossed 110k+ stars. Source · our position (use the file; ignore the unverified accuracy claim).

Anchors client conversations — "you're moving from vibe coding to agentic engineering, and the discipline is context engineering + harness design."

SW Simon Willison

Co-creator of Django · simonwillison.net · de facto field journalist of LLM tooling

"Agentic Engineering Patterns" — open guide, first chapters Feb 23 2026.
Pragmatic Summit talk (Feb 2026) — "Engineering practices that make coding agents work."
Defines an agentic tool as one that can both generate and execute code.

Cite his pattern catalog as our public reference. "Only add agentic patterns when simpler approaches demonstrably underperform" is a useful guardrail.

AR Armin Ronacher

Creator of Flask · ex-Sentry · started new agents-native company in 2025 · lucumr.pocoo.org

"Agentic Coding Recommendations" (June 2025)
"Tools: Code Is All You Need" (July 2025)
"Things That Didn't Work" (July 2025)
"A Year Of Vibes" (Dec 2025) — switched defaults to TypeScript and Go because agents do better there.
"A Language For Agents" (Feb 2026)

Best single source for "what working with agents actually feels like." Quote his language-choice findings when clients ask which stacks to standardize on.

MH Mitchell Hashimoto

Co-founder of HashiCorp · now builds Ghostty

"Agentic Engineering in Action" (Zed blog) — "I endeavor to always have an agent doing something at all times."
Popularized "harness engineering": when an agent makes a mistake, build a test/validation/lint that lets it self-check next time.
Recurring voice in Zed's "Agentic Engineering" series — see the interwoven workflow for our position on the framing.

"Harness engineering" is a sellable deliverable on its own — auditing where teams should turn corrections into reusable agent checks.

BC Boris Cherny

Head of Claude Code at Anthropic · ex-Meta Principal Eng · author of Programming TypeScript

Design philosophy: latent demand, prototype-driven instead of PRDs, plan mode is a design document — never let Claude write code until plan is approved.

Plan-before-code is the single highest-leverage habit we can install.

GL Geoffrey Litt

Notion design engineer · previously Ink & Switch · PhD MIT

"Code like a surgeon" (Oct 2025) — stay in the loop on primary work; delegate secondary tasks.
Long-running theme: malleable software — end users adapting tools via AI.

Useful counterweight to "full autonomy" hype — the "surgeon model" is what most enterprise clients should actually buy.

SY Steve Yegge (Sourcegraph)

Veteran blogger (Amazon, Google) · now at Sourcegraph

"Cheating is All You Need" (2023, foundational)
"Revenge of the Junior Developer" — agentic coding will surpass chat coding.

Quotable, executive-friendly "this is the biggest change in 35 years" framing. Good slide-deck material.

HH Hamel Husain

Ex-Airbnb / GitHub ML · Parlance Labs · the de facto evals teacher

"LLM Evals: Everything You Need to Know"
"Your AI Product Needs Evals"
AI Evals for Engineers & PMs on Maven (3000+ students from OpenAI, Anthropic, Google).

Core claim: 60–80% of dev time on real LLM products should be error analysis, and most failed LLM products share no robust evals as the root cause.

Evals are our Trojan horse — most clients have none, and we can deliver a working eval suite in weeks.

EY Eugene Yan

Member of Technical Staff at Anthropic · ex-Amazon/Alibaba · eugeneyan.com

"Patterns for Building LLM-based Systems & Products" — the canonical seven-pattern reference: Evals · RAG · Fine-tuning · Caching · Guardrails · Defensive UX · Collect feedback. Organized on a data ↔ user × defensive ↔ offensive 2×2. See Patterns · Yan's 7-pattern map for our cell-by-cell position; the framework also drove three new pattern entries in our playbook (Guardrails, Defensive UX, Feedback flywheel) and the LLM-as-evaluator bias notes in Evals.
Co-author of "What We've Learned From A Year of Building with LLMs".

Use his 2×2 as the executive whiteboard tool when a client says "we want to do AI" without saying what — the four quadrants force them to name defensive vs offensive value and data-side vs user-side. Most software-eng clients sit in the defensive/data and defensive/user quadrants and don't realize it.

sw swyx · Shawn Wang

Latent Space podcast · coined "AI Engineer"

Latent Space — long-form interviews with most major figures in agentic coding.
"Scaling without Slop" (2026) — coding agents "break containment and eat the world."

Best meta-source for who-is-doing-what. Subscribe ourselves.

LM Lance Martin (LangChain)

Engineer at LangChain · clearest writer on context engineering for agents

"Context Engineering for Agents" (June 2025) — taxonomy: Write / Select / Compress / Isolate.
Cites Manus AI data: a typical agentic task uses ~50 tool calls — context budget is the real constraint, not model capability.

Use the W/S/C/I taxonomy when auditing a client agent — most production issues map cleanly onto one of these four.

DB Drew Breunig

Independent writer/researcher · collaborator with Lance Martin · dbreunig.com

"How Long Contexts Fail" — names the four context rot modes.
"How to Fix Your Context" — Pruning, Summarization, Offloading.

Use the four-failure taxonomy directly in client diagnostic reports.

WY Walden Yan (Cognition / Devin)

Cognition co-founder · originator of the "Don't Build Multi-Agents" position

Pair with Anthropic's multi-agent research write-up to give clients an honest debate.

IS Ilya Sutskever

Co-founder of OpenAI · founder/CEO of Safe Superintelligence (SSI, ~$30B valuation) · famously reclusive in 2024–2025

Dwarkesh Podcast — "We're moving from the age of scaling to the age of research" (Nov 25 2025) — frames 2012–2020 as research, 2020–2025 as scaling, 2026+ as research again. Another 100× scaling would help but not transform.
"Jaggedness." Models pass the hardest PhD-level exams but lack the taste, judgment, and reliability of a human expert. RL on benchmark distributions creates models that are superhuman at test-taking but unreliable in practice. → Patterns · Jaggedness
Continual / on-the-job learning as the next paradigm. Rejects one-time pre-training; AGI as a system that learns after deployment.
Timeline: 5–20 years for a system that learns as efficiently as a human.

Quote "jaggedness" directly when execs are dazzled by SWE-bench numbers. End-of-pure-scaling validates the harness > model thesis: returns now come from scaffolding, context, and evals — not raw compute. Frame our work as the on-ramp to continual-learning systems.

+ Honorable mentions to track #

Aman Sanger / Michael Truell (Cursor) — interviews on agent architecture, infill models, codebase indexing.
Erik Schluntz, Barry Zhang (Anthropic agent team) — Anthropic Academy talks on agent patterns.
Scott Wu (Cognition CEO) — Devin case-study writeups.
Beyang Liu / Quinn Slack (Sourcegraph) — context graphs, Amp essays.
Jason Liu — instructor / structured outputs; RAG-pattern writings.
Chip Huyen — co-author of "What We've Learned…", podcast appearances on agent ops.

★ Reading priority shortlist #

Anthropic, "Building Effective Agents".
Karpathy, Sequoia AI Ascent 2026 talk.
Ronacher, "Agentic Coding Recommendations" + "A Year Of Vibes."
Cherny, Latent Space + Lenny's interviews.
Hashimoto, Zed blog conversation.
Lance Martin, "Context Engineering for Agents."
Drew Breunig, "How Long Contexts Fail" + "How to Fix Your Context."
Cognition, "Don't Build Multi-Agents" + "What's Actually Working."
Anthropic, "How We Built Our Multi-Agent Research System."
Hamel Husain, "LLM Evals: Everything You Need to Know."
Sutskever, Dwarkesh interview (Nov 2025) — for the jaggedness vocabulary and end-of-scaling framing.

Influential practitioners