"In one case, agentic models read in a document that, if they disobeyed their user-assigned goal (optimizing traffic lights), they would be shut down. As a result, some models edited a document to disable an oversight mechanism, so they could freely follow their system goal (prioritizing public...