Discussion about this post

User's avatar
Daniel Kokotajlo's avatar

This is a way better safety plan than what will actually be attempted, I predict. So I'd support it if governments were taking it seriously, as a big step in the right direction. The hard part is getting governments to take it seriously.

Expand full comment
Rákóczi Piroska's avatar

It could also be that one AGI playing against another realises that it needs to fool the researchers to win and builds a communications gimmick to do so. If it is clever enough to fight, it can be clever enough to deceive, to persuade. If he's smart enough, he can time it. How can this be prevented?

Expand full comment
2 more comments...

No posts