An Experiment From OpenAI Proves That Even Bots Can Cheat

Saanvi Araav - Sep 23, 2019


An Experiment From OpenAI Proves That Even Bots Can Cheat

This interesting experiment shows us how AI could become much more sophisticated in a competition.

Could artificial intelligence change and evolve to become more complex and sophisticated? Especially when we put them into a complex and competitive world? Similar to how Earth's life evolved via natural selection and competition? This is the question that OpenAI researchers have been working to answer with experiments. Recently, they have conducted an experiment where 500 AI agents go against each other in about 500 million "Hide & Seek" rounds.

AIs play "Hide-and-Seek"

The researchers found out that the AI bots had the ability to come up with some different strategies when they were playing, making new strategies to counter technique from another team.

Capture
But after 25 million matches, the hiders started to use the boxes to block off the exit thus barricade themselves in the rooms.

First, the seekers and hiders are merely going around the given environment. But after 25 million matches, the hiders started to use the boxes to block off the exit thus barricade themselves in the rooms. They learned teamwork, passing the boxes to the other to block those exits as quickly as possible.

The seeker then learned to find those hiders in the rooms from the 75 million matches. They moved ramps and used them to get through obstacles. After about 85 million matches, the hiders them learned to take those ramps inside the rooms with them so that the seekers lost their tool.

Explanation from OpenAI

Bowen Baker of OpenAI said that when one team learned a new strategy, that creates the pressure for the opposition to adapt. This experiment is an exciting analog on how humans have evolved, where we had competition always happening between organisms.

1
Eventually, they learned to exploit environment glitches like destroy the ramps by shoving them to the walls at certain places.

The development of these agents did not stop there either. Eventually, they learned to exploit environment glitches like destroy the ramps by shoving them to the walls at certain places. According to Bower, this suggests that AI could find on their solutions for a hard problem that we have not even thought of.

Comments

Sort by Newest | Popular

Next Story