Governance of AI
Progress in artificial Intelligence (AI) is likely to be one of the most important developments in the coming century. There is a non-trivial chance that we will see, in two decades, developments that transform society, the economy, and international relations. These will pose radical opportunities and challenges. I seek to anticipate these and identify levers for avoiding the risks. I do this work at DeepMind and the Centre for the Governance of AI (GovAI). (GovAI's google scholar page).
For my talks, see allandafoe/ai-talks
For an overview of my perspective, I recommend:
[25p] Allan Dafoe. (2022). AI Governance: Overview and Theoretical Lenses. In Oxford Handbook on AI Governance, edited by Bullock, J.B., et al., Oxford: Oxford University Press, 2022. (bit.ly/Dafoe-Handbook)
[50p] Allan Dafoe. (2018). AI Governance: A Research Agenda. Centre for the Governance of AI, Future of Humanity Institute, University of Oxford. (pdf)
Work in reverse chronological order:
Sandbrink, Jonas, Hamish Hobbs, Jacob Swett, Allan Dafoe, and Anders Sandberg. "Differential technology development: A responsible innovation principle for navigating technology risks." Available at SSRN (2022).
Baobao Zhang, Markus Anderljung, Lauren Kahn, Noemi Dreksler, Michael C. Horowitz, Allan Dafoe. Ethics and Governance of Artificial Intelligence: Evidence from a Survey of Machine Learning Researchers.” (forthcoming). Journal of Artificial Intelligence Research. (arXiv) (journal) (Kahn presentation)
Remco Zwetsloot, Baobao Zhang, Noemi Dreksler, Lauren Kahn, Markus Anderljung, Allan Dafoe, and Michael C. Horowitz. Skilled and Mobile: Survey Evidence of Immigration Preferences of AI Researchers. (2021). The Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society. (arXiv)
Allan Dafoe, Yoram Bachrach, Gillian Hadfield, Eric Horvitz, Kate Larson, & Thore Graepel. (2021). Cooperative AI: machines must learn to find common ground. Nature, 593. (link)
Toby Shevlane & Allan Dafoe. (2021). The Machinery of Power: Artificial Intelligence as a General-Purpose Power Technology. (link to draft)
Sophie-Charlotte Fischer, Jade Leung, Markus Anderljung, Cullen O’Keefe, Stefan Torges, Saif M. Khan, Ben Garfinkel, and Allan Dafoe. (2021). AI Policy Levers: A Review of the US Government’s Tools to Shape AI Research, Development, and Deployment. Centre for the Governance of AI, Future of Humanity Institute, University of Oxford. (link)
Waqar Zaidi & Allan Dafoe. (2021). International Control of Powerful Technology: Lessons from the Baruch Plan for Nuclear Weapons. Centre for the Governance of AI, Future of Humanity Institute, University of Oxford. 2021: 9. (pdf)
Media: John Thornhill. (2021). Only scientists and voters can change the politics of catastrophe. Financial Times. (link)
R. Daniel Bressler, Robert F. Trager, Allan Dafoe. (2021). The Offense-Defense Balance and the Costs of Anarchy: When Welfare Improves Under Offensive Advantage. (link)
Carina Prunkl, Carolyn Ashurst, Markus Anderljung, Helena Webb, Jan Leike, & Allan Dafoe. (2021). Institutionalizing Ethics in AI through Broader Impact Requirements. Nature Machine Intelligence, 3(2), 104-110. (journal, pdf)
Carolyn Ashurst, Markus Anderljung, Carina Prunkl, Jan Leike, Yarin Gal, Toby Shevlane, Allan Dafoe. (2020). A Guide to Writing the NeurIPS Impact Statement. Medium. (link)
Allan Dafoe, Edward Hughes, Yoram Bachrach, Tantum Collins, Kevin R. McKee, Joel Z. Leibo, Kate Larson, & Thore Graepel. (2020). Open Problems in Cooperative AI. (arxiv)
Gregory Lewis, Jacob Jordan, David Relman, Gregory Koblentz, Jade Leung, Allan Dafoe, Cassidy Nelson et al. (2020). The Biosecurity Benefits of Genetic Engineering Attribution. Nature Communications. 11, no. 6294. (journal)
Jeffrey Ding & Allan Dafoe. (Forthcoming). The Logic of Strategic Assets: From Oil to AI. Security Studies. (arxiv)
On recommended reading list for the European Council on Foreign Relations director's podcast (at 31:30)
Toby Shevlane, Ben Garfinkel, & Allan Dafoe. (April 2020). Contact tracing apps can help stop coronavirus. But they can hurt privacy. Monkey Cage, Washington Post. (link)
Miles Brundage, Shahar Avin, Jasmine Wang, et al. (2020). Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims. (pdf)
Toby Shevlane & Allan Dafoe. (2020). The Offense-Defense Balance of Scientific Knowledge: Does Publishing AI Research Reduce Misuse of the Technology? In Proceedings of the 2020AAAI/ACM Conference on AI, Ethics, and Society (AIES’20). (pp. 173-179). https://bit.ly/ShevlaneDafoeODBK
Cullen O’Keefe, Peter Cihon, Carrick Flynn, Ben Garfinkel, Jade Leung & Allan Dafoe. (2020). The Windfall Clause: Distributing the Benefits of AI for the Common Good. In Proceedings of the 2020 AAAI/ACM Conference on AI, Ethics, and Society (AIES’20) (pp.327-331). (arxiv)
Cullen O’Keefe, Peter Cihon, Carrick Flynn, Ben Garfinkel, Jade Leung and Allan Dafoe. (2020). The Windfall Clause: Distributing the Benefits of AI. Centre for the Governance of AI Technical Report. (link)
Aaron Tucker, Markus Anderljung, & Allan Dafoe. Social and Governance Implications of Improved Data Efficiency. (2020). In Proceedings of the 2020 AAAI/ACM Conference on AI, Ethics, and Society (AIES’20). (pp. 378-384 (arxiv).
Remco Zwetsloot & Allan Dafoe. (2019). Thinking About Risks From AI: Accidents, Misuse, and Structure. Lawfare. (link)
Baobao Zhang & Allan Dafoe. (2020). U.S. Public Opinion on the Governance of Artificial Intelligence. In Proceedings of the 2020 AAAI/ACM Conference on AI, Ethics, and Society (AIES’20) (pp. 187-193). (arxiv, conference)
Nick Bostrom, Allan Dafoe, & Carrick Flynn. (2019). Public Policy and Superintelligent AI: A Vector Field Approach. in S. Matthew Liao ed. Ethics of Artificial Intelligence. New York: Oxford University Press. (pdf, publisher)
Allan Dafoe. (2018). AI Governance: A Research Agenda. Centre for the Governance of AI, Future of Humanity Institute, University of Oxford. (pdf)
Miles Brundage, Shahar Avin, ..., Allan Dafoe, ..., Dario Amodei. (2018). The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation. (pdf)
Katja Grace, John Salvatier, Allan Dafoe, Baobao Zhang, & Owain Evans. (2018). Viewpoint: When Will AI Exceed Human Performance? Evidence from AI Experts. Journal of Artificial Intelligence Research. 62: 729-754. (journal, arxiv)
Syllabus for Yale seminar "Global Politics of AI", 2017.
Allan Dafoe & Miles Brundage. (2017). Evidence submitted to Lords Select Committee on Artificial Intelligence on behalf of Future of Humanity Institute. (pdf)
Reputation, Honor, Provocation, Resolve
Leaders and publics care about reputation and honor. This concern seems to be an important cause of war. Is it? I investigate this through survey experiments, natural experiments, and theory.
Allan Dafoe, Remco Zwetsloot, and Matthew Cebul. (2021). Reputations for Resolve and Higher-Order Beliefs in Crisis Bargaining. Journal of Conflict Resolution: 0022002721995549.
Allan Dafoe, Sophia Hatz, & Baobao Zhang. (2020). Coercion and Provocation. Journal of Conflict Resolution. (pdf)
Allan Dafoe, Jonathan Renshon, & Paul Huth. (2014). Reputation and Status as Motives for War. Annual Review of Political Science. 17: 371–393 (pdf)
Allan Dafoe. (2011). Review of Thomas Lindemann's 'Causes of War: the Struggle for Recognition.' Journal of Peace Research. 48(5): 685-686. (pdf)
The Liberal Peace
The peace amongst liberal countries is one of the most important phenomena for the wellbeing of humanity. I seek to understand what causes it.
Joslyn N. Barnhart, Robert F. Trager, Elizabeth N. Saunders, Allan Dafoe. (2020). "Women’s Suffrage and the Democratic Peace: Female Voters Slow the March to War." Foreign Affairs. August 18. (link)
Consulted with Steven Pinker who advised President Obama's Athens speech about how to characterize the democratic peace. November 14, 2016. (link)
Runner-up for Nils Petter Gleditsch JPR Article of the Year Award, 2014.
Allan Dafoe, John Oneal, & Bruce Russett. (2013). The Democratic Peace: Weighing the Evidence and Cautious Inference. International Studies Quarterly. 57(1): 201–214. (article pdf) (replication files)
Allan Dafoe & Bruce Russett. (2013). Does Capitalism Account for the Democratic Peace? The Evidence Still Says No. In Assessing the Capitalist Peace, ed. Gerald Schneider & Nils Petter Gleditsch. Routledge. (replication files)
Moderator and contributor for National Intelligence Council website on Global Trends 2030; August 2012. Moderated virtual roundtable involving William Thompson, Jack S. Levy, Richard Rosecrance, Benjamin Fordham, Bradley Thayer, Joshua Goldstein, Steven Pinker, and Erik Gartzke.
Causal inference is central to social science. Many of our tools depend on implausible parametric assumptions, are fragile, and opaque. I seek to develop tools for causal inference, particularly for observational data, that do not depend on implausible assumptions, are more robust, and more transparent.
Garret Christensen, Allan Dafoe, Edward Miguel, Don A. Moore, Andrew K Rose. (2019). "A study of the impact of data sharing on article citations using journal policies as a natural experiment." PloS one. Dec 18;14(12):e0225883. (journal open)
Devin Caughey, Allan Dafoe, & Luke Miratrix. Beyond the Sharp Null: Permutation Tests, Heterogeneous Effects, and Bounded Null Hypotheses. (arxiv)
Allan Dafoe, Baobao Zhang, & Devin Caughey. (2018). Information Equivalence in Survey Experiments: Diagnostics and Solutions. Political Analysis. 26(4): 399-416. (link to pdf and pre-analysis plan)
Devin Caughey, Allan Dafoe, & Jason Seawright. (2017). Nonparametric Combination (NPC):A Framework for Testing Elaborate Theories. The Journal of Politics. 79(2): 688-701. (pdf, journal, pdf with appendix)
Allan Dafoe. (2018). Nonparametric Identification of Causal Effects under Temporal Dependence. Sociological Methods & Research. 47(2): 136-168 (pdf)
Allan Dafoe. (2012). "Commentary on John Gerring's Social Science Methodology." Qualitative and Multi-Method Research Newsletter. 10(1):1-4. (pdf)
Science depends on transparency. I work to promote better transparency norms and practices: scientists should share complete replication files, should preregister their analyses, should make their analyses transparent, and should evaluate the robustness of their results to reasonable alternative specifications.
Leamer-Rosenthal Prize for Open Social Science: Emerging Researcher. (2015).
Andrew Bertoli, Allan Dafoe, & Robert Trager. (2019). Is There a War Party? Party Change, the Left-Right Divide and International Conflict. Journal of Conflict Resolution. 63(4),950-975. (pdf, journal)
K. Charlotte Jander, Allan Dafoe, E. Allen Herre. (2016). Fitness Reduction for Uncooperative Fig Wasps through Reduced Offspring Size: A Third Component of Host Sanctions. Ecology. 97(9): 2491-2500. (pdf, appendix, journal)
Allan Dafoe & Jason Lyall. (2015). From Cell Phones to Conflict? Reflections on the Emerging ICT-Political Conflict Research Agenda. Journal of Peace Research. 52(3): 401-413. (pdf) (journal-ungated) (Monkey Cage)
"To Have and to Hold: Exploring the Personal Archive." (2006). Proceedings of the SIGCHI conference on Human Factors in Computing Systems: 275-284. Kaye, Joseph 'Jofish', Janet Vertesi, Shari Avery, Allan Dafoe, Shay David, Lisa Onaga, Ivan Rosero, Trevor Pinch (pdf)