← Topics

Evaluation

Prompt engineering is a credibility tell. Not a discipline.

Engineering assumes a target that holds still. Frontier language models do not. The fastest way to identify someone who has not run an AI system in production is to watch them say 'prompt engineering' anyway.

AI writing its own tests is theatre.

Different agents in a workflow don't create independence when they share the weights. The architecture is real engineering. It's just not the check your slide says it is.