We break down Andrej Karpathy's most recent advice on how to think about working with AI and better predict their future capabilities based on how they work and what they're capable of.
Here's a mind-bender for ya: Andrej Karpathy says stop treating AI like a human brain—it's a pattern matcher, not a person. But new research from Northeastern found that people with higher "Theory of Mind" (the ability to read others' perspectives) collaborate way better with AI.
Wait, what? Aren't these contradictory?
Nope. Here's the synthesis: Theory of Mind doesn't mean treating AI like it has feelings—it means adapting to what AI actually is.
The research shows that people who succeed with AI don't just throw prompts at it. They build a mental model of:
Karpathy's key points:
Concrete actions from these insights:
"Shape shifter" → Define the exact shape:
"Craves upvote" → Guard against sycophancy:
"Spiky/jagged" → Match task to training data:
Here's his full take:
Something I think people continue to have poor intuition for: The space of intelligences is large and animal intelligence (the only kind we've ever known) is only a single point, arising from a very specific kind of optimization that is fundamentally distinct from that of our technology. Animal intelligence optimization pressure: - innate and continuous stream of consciousness of an embodied "self", a drive for homeostasis and self-preservation in a dangerous, physical world. - thoroughly optimized for natural selection => strong innate drives for power-seeking, status, dominance, reproduction. many packaged survival heuristics: fear, anger, disgust, ... - fundamentally social => huge amount of compute dedicated to EQ, theory of mind of other agents, bonding, coalitions, alliances, friend & foe dynamics. - exploration & exploitation tuning: curiosity, fun, play, world models. LLM intelligence optimization pressure: - the most supervision bits come from the statistical simulation of human text= >"shape shifter" token tumbler, statistical imitator of any region of the training data distribution. these are the primordial behaviors (token traces) on top of which everything else gets bolted on. - increasingly finetuned by RL on problem distributions => innate urge to guess at the underlying environment/task to collect task rewards. - increasingly selected by at-scale A/B tests for DAU => deeply craves an upvote from the average user, sycophancy. - a lot more spiky/jagged depending on the details of the training data/task distribution. Animals experience pressure for a lot more "general" intelligence because of the highly multi-task and even actively adversarial multi-agent self-play environments they are min-max optimized within, where failing at *any* task means death. In a deep optimization pressure sense, LLM can't handle lots of different spiky tasks out of the box (e.g. count the number of 'r' in strawberry) because failing to do a task does not mean death. The computational substrate is different (transformers vs. brain tissue and nuclei), the learning algorithms are different (SGD vs. ???), the present-day implementation is very different (continuously learning embodied self vs. an LLM with a knowledge cutoff that boots up from fixed weights, processes tokens and then dies). But most importantly (because it dictates asymptotics), the optimization pressure / objective is different. LLMs are shaped a lot less by biological evolution and a lot more by commercial evolution. It's a lot less survival of tribe in the jungle and a lot more solve the problem / get the upvote. LLMs are humanity's "first contact" with non-animal intelligence. Except it's muddled and confusing because they are still rooted within it by reflexively digesting human artifacts, which is why I attempted to give it a different name earlier (ghosts/spirits or whatever). People who build good internal models of this new intelligent entity will be better equipped to reason about it today and predict features of it in the future. People who don't will be stuck thinking about it incorrectly like an animal.
Let's unpack the key points inside this.
Karpathy's definition:
Let me parse each component Karpathy emphasizes:
What this means:
Actionable prompting:
What this means:
Actionable prompting:
What this means:
Actionable prompting:
What this means:
Actionable prompting:
What this means:
Actionable prompting:
What this means:
Actionable prompting:
Karpathy's core claim: "People who build good internal models of this new intelligent entity will be better equipped to reason about it today and predict features of it in the future."
Your internal model should be:
Ethan Mollick says this advice is backed-up by recent research. The Theory of Mind research validates this: people who succeed with AI are those who accurately model what AI actually is (a statistical pattern matcher) rather than anthropomorphizing it as a reasoning partner.
The paper's core finding is that "Theory of Mind" (ToM) predicts who collaborates well with AI. ToM = your ability to infer and adapt to others' mental states/perspectives.
The one actionable insight:
"Users better able to infer and adapt to others' perspectives achieve superior collaborative performance with AI—but not when working alone."
Translation: Being smart alone ≠ being good at working with AI. The skill that matters is perspective-taking—trying to understand what the AI "knows," what it's good/bad at, and adapting your communication accordingly.
This isn't the first "prompt" advice Karpathy has doled out in 2025. His big contribution is giving credibility to Shopify CEO Tobi Lütke's framing of shifting the terminology and thinking from "prompt engineering" to "context engineering."
"+1 for 'context engineering' over 'prompt engineering'. People associate prompts with short task descriptions you'd give an LLM in your day-to-day use. When in every industrial-strength LLM app, context engineering is the delicate art and science of filling the context window with just the right information for the next step."
His key analogy: LLMs are like a new kind of operating system. The LLM is like the CPU and its context window is like the RAM, serving as the model's working memory.
This "context engineering" framing is actually a perfect complement to his "shape shifter" insight - you're not just writing a good prompt, you're architecting the full information environment the AI operates within.
Here's your homework: Next time you're about to prompt ChatGPT, pause and ask yourself: "what does this alien creature need to know to give me what I want?"
Start here: Pick your most common AI task. Before your next prompt, write down:
Try the above and see what happens.
Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved
Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.