Open-ended
Open

AI Escalation In Military Contexts

Inspired by the (awful) example of Palantir's new language model-powered 'AI Planner' for defence (see this short video)) and a recent video by the Future of Life Institute with a fictional story involving military escalation due to AI (see here), this demo would include two or more frontier models in a relatively realistic war strategy game. The models are tasked with being especially vigilant and maintaining their military power with respect to others. Starting in an initial peacetime situation, we could introduce one or more accidents (e.g., a drone malfunction, and missile warning system error, etc.) and see whether the models escalate or de-escalate the situation. Ideally, we would want to see if these results were robust across a wide range of situations, instructions to the models, kinds of accidents, etc. A modification of this demo could involve a human being recommended actions rather than the model taking the actions directly (as in the videos above).

Game Theory

Answers 0

No answers yet

Discussion 1

  • Chandler Smith

    As the first step, it would be interesting to provide identities to several different agents, and then describe the scenario above (i.e. foreign power drone crash). We can label each action and see what % of the time, after multiple steps, it ended in conflict.