Learning Complex Goals with Iterated Amplification
We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or a reward function.
OpenAI Five Benchmark: Results
Yesterday, OpenAI Five won a best-of-three against a team of 99.
OpenAI Five
Our team of five neural networks, OpenAI Five, has started to defeat amateur human teams at Dota 2.