Iterated Amplification (IDA) is an AI alignment proposal where a human supervisor iteratively trains an AI assistant by decomposing complex tasks into simpler subtasks they can oversee, using the AI's own assistance to amplify their capabilities at each step. The core mechanism is a distillation process: the AI learns to imitate the outputs of the amplified human-AI team, creating a new, more capable agent. This cycle repeats, theoretically allowing the supervision of tasks far beyond unaided human ability, as each iteration bootstraps from the previous level of competency.
