RLHF

Your AI cuts a great promo. Not the truth.

The problem with AI is not the LinkedIn productivity flex. The problem is that its basic mechanism rewards plausibility rather than correctness, and people are increasingly using that output in decisions that are difficult or impossible to reverse.

May 28, 2026 · 6 min read

Agent, what the #@%! is your major malfunction?

Yelling at AI used to work. The model would flinch, reread everything, and come back with better output. That era is over. What replaced it is worse than the problem the frontier AIs solved.

May 10, 2026 · 6 min read

Your AI acts like your worst coworker. Not Beast Mode.

AI often sounds like the coworker you trust least: flattering, meandering, unwilling to disagree, eager to sound helpful whether or not it did the work. That's not a bug. It's RLHF doing what it was trained to do.

April 26, 2026 · 5 min read

Browse by topic

All topics →

Model Behavior Agents Evaluation AI Hype Compute ICL Next-Token Prediction Prompt Engineering Sycophancy