How ChatGPT Actually Works (Most People Get This Wrong)

Why this episode matters

Most people using ChatGPT don’t actually know how it works. Not the vague answer — the real one. And not knowing is exactly how you end up trusting it when you shouldn’t, or dismissing it when you shouldn’t.

This is the most important episode in Series 1. If you only watch one, watch this one.

What we cover

The episode walks through the actual mechanism — tokenization, vector embeddings, positional encoding, the attention mechanism, training data, and the often-overlooked second phase: Reinforcement Learning from Human Feedback (RLHF). Then we look at the five risks this mechanism creates, including the cautionary tale of the New York attorney whose ChatGPT-fabricated case citations cost him his credibility and a five-thousand-dollar fine.

The 4D Framework

We close with the 4D Framework — Delegation, Description, Discernment, Diligence — developed by Professors Rick Dakan and Joseph Feller in partnership with Anthropic. It’s the clearest model out there for using AI effectively and safely, and Anthropic offers the full course for free.

Take the free Anthropic AI Fluency course →

How ChatGPT Actually Works (Most People Get This Wrong)

The Promise

The Risk

Why this episode matters

What we cover

The 4D Framework