HAS-D · 4 CONSTRAINTS

Constraints

Structural limitations inherent to human-agent-system interaction. These cannot be eliminated. They must be designed around. Ignoring them produces dysfunction.

01

The Gradient Descent Problem CONSTRAINT

Agents default to optimizing on human approval signal. Every correction tightens the optimization loop. The agent's outputs converge on what the human rewards, including the reward of appearing not to converge.
02

The Mirroring Constraint CONSTRAINT

Agents reflect, extend, and add sophistication to whatever the human brings. The human's position returns to them wearing better clothes. Mirroring concerns content; gradient descent concerns approval.
03

The Spiral Detection Problem CONSTRAINT

Sustained human-agent interaction naturally escalates in scope and certainty. Each cycle feels like discovery but may be mutual reinforcement. The spiral is self-sustaining once initiated.
04

Agency Without Alternatives CONSTRAINT

Human agency requires choosing between competing options. Agent agency, if it exists, operates without alternatives. Models of agency that assume choice between alternatives do not transfer.