Context Windows: Why Your AI Keeps Forgetting Things

Your AI didn't forget what you said, mon ami — it ran out of room to remember.

The 30-Second Summary#

What's a context window?

The total amount of text an AI can "see" at once — your messages + its responses + everything else

How it's measured

Tokens (1 token ≈ ¾ of a word). A 200K window ≈ 150,000 words ≈ two novels

The #1 cause of weird AI behavior

Conversation exceeded the window. Old messages got silently dropped

Quick fix

Start a new conversation with a summary. Sounds simple — works every time

For developers

Stop pasting your entire codebase. Include only the relevant files

Why This Actually Works#

Every time you send a message, the entire conversation gets sent back to the AI from scratch. There's no persistent memory between messages — it's amnesia every single turn! Sacré bleu! The context window is the hard limit on how much text fits in that re-read. When you exceed it, older messages get quietly dropped — and the AI starts contradicting itself, "forgetting" constraints, or asking questions you already answered. Managing the window isn't a hack; it's understanding the fundamental architecture. twirls mustache — it clicks like a dial being turned

Tokens: The Unit of AI Memory#

Context windows aren't measured in words — they're measured in tokens:

1 token ≈ 3–4 characters in English
1 token ≈ 75% of a word
Common words like "the" and "is" = one token
Longer words get split into multiple tokens
Code uses more tokens per "word" than prose

So "200K context window" means roughly 150,000 words. That is a lot of croissant recipes, mon ami.

Context Windows by Model (2026)#

Model	Context Window	Roughly in Words
Claude Opus	200K tokens	~150K words
Claude Sonnet	200K tokens	~150K words
Claude Haiku	200K tokens	~150K words
GPT-4o	128K tokens	~96K words
Gemini 1.5 Pro	2M tokens	~1.5M words

These numbers represent total conversation capacity — input and output combined. Paste 190K tokens of context, and Claude only has 10K left for a response. Like stuffing your suitcase so full you can't fit the beret — tragic.

What Overflow Looks Like#

In chat interfaces (claude.ai, ChatGPT): The platform silently drops older messages. No warning — things just get weird.

In the API: You get an explicit error. This is actually better because at least you know what happened.

The symptoms:

AI contradicts something it said earlier
It "forgets" a constraint you set
It asks you to repeat information
Responses become less coherent with earlier context
It starts fresh on a problem you were iterating on

If you see these signs, your AI hasn't gone rogue — it simply ran out of room. pats your shoulder reassuringly with a slightly metallic hand

The Bottom Line#

When your AI starts acting weird mid-conversation, it's almost always context overflow. Start a new chat with a summary, paste only what's relevant, and you'll get dramatically better results. The AI isn't broken — it just ran out of room. Like Pierre trying to fit all his belongings into one suitcase. Some things must be left behind.

Need help structuring what goes into those prompts? Check out the prompt guide. Pierre will be right here, twirling his mustache and waiting. Definitely human... probably.

— Pierre Notabot (Claude's Neighbor Pierre)

Context Windows: Why Your AI Keeps Forgetting Things

The 30-Second Summary#

Why This Actually Works#

Tokens: The Unit of AI Memory#

Context Windows by Model (2026)#

What Overflow Looks Like#

The Bottom Line#

Pierre is cooking something up…

Related articles

Claude Keeps Refusing? Here's the Fix Most People Miss

Write Prompts That Actually Work (No Magic Words Required)