Risks and Mitigation Strategies for Oracle AI

Armstrong, Stuart

doi:10.1007/978-3-642-31674-6_25

Stuart Armstrong²

Part of the book series: Studies in Applied Philosophy, Epistemology and Rational Ethics ((SAPERE,volume 5))

6977 Accesses

Abstract

There is no strong reason to believe human level intelligence represents an upper limit of the capacity of artificial intelligence, should it be realized. This poses serious safety issues, since a superintelligent system would have great power to direct the future according to its possibly flawed goals or motivation systems. Oracle AIs (OAI), confined AIs that can only answer questions, are one particular approach to this problem. However even Oracles are not particularly safe: humans are still vulnerable to traps, social engineering, or simply becoming dependent on the OAI. But OAIs are still strictly safer than general AIs, and there are many extra layers of precautions we can add on top of these. This paper looks at some of them and analyses their strengths and weaknesses.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Tools with general AI and no existential risk

Article Open access 07 March 2023

Is explainable AI responsible AI?

Article Open access 20 April 2024

Can we Bridge AI’s responsibility gap at Will?

Article Open access 29 July 2022

References

Armstrong, S.: Utility Indifference. FHI Technical Report (2010)
Google Scholar
Armstrong, S., Sandberg, A., Bostom, N.: Thinking Inside the Box: Using and controlling an Oracle AI (2011); accepted by Minds and Machines
Google Scholar
Asimov, I.: Runaround. Astounding Science Fiction (1942)
Google Scholar
Bostrom, N.: Ethical issues in advanced artificial intelligence. Cognitive, Emotive and Ethical Aspects of Decision Making in Humans 2 (2003)
Google Scholar
Bostrom, N.: Existential Risks: Analyzing Human Extinction Scenarios and Related Hazards. Journal of Evolution and Technology 9 (2001)
Google Scholar
Bostrom, N.: Information Hazards: A Typology of Potential Harms from Knowledge (2009), http://www.nickbostrom.com/information-hazards.pdf
Bostrom, N.: Predictions from Philosophy? Coloquia Manilana (PDCIS) 7 (2000)
Google Scholar
Bostrom, N.: The Future of Human Evolution. In: Tandy, C. (ed.) Death and Anti-Death: Two Hundred Years After Kant, Fifty Years After Turing, pp. 339–371. Ria University Press, California (2004)
Google Scholar
Bostrom, N., Salamon, A.: The Intelligence Explosion. Retrieved from The Singularity Hypothesis (2011), http://singularityhypothesis.blogspot.com/2011/01/intelligence-explosion-extended.html
Caplan, B.: The totalitarian threat. In: Bostrom, N., Cirkovic, M. (eds.) Global Catastrophic Risks, pp. 504–519. Oxford University Press (2008)
Google Scholar
Chalmers, D.J.: The Singularity: A Philosophical Analysis (2010), http://consc.net/papers/singularity.pdf
Cook, S.: The complexity of theorem proving procedures. In: Proceedings of the Third Annual ACM Symposium on Theory of Computing, pp. 151–158 (1971); Evolutionary Algorithm (n.d.), http://en.wikipedia.org/wiki/Evolutionary_algorithm
Good, I.: Speculations Concerning the First Ultraintelligent Machine. Advances in Computers 6 (1965)
Google Scholar
Hanson, R.: Long-Term Growth As A Sequence of Exponential Modes (2000), http://hanson.gmu.edu/longgrow.pdf
Idel, M.: Golem: Jewish magical and mystical traditions on the artificial anthropoid. State University of New York Press, New York (1990)
Google Scholar
Kahnemand, D., Slovicand, P., Tversky, A.: Judgement under Uncertainty: Heuristics and Biases. Cambridge University Press (1982)
Google Scholar
Kurzweil, R.: The Singularity is Near. Penguin Group (2005)
Google Scholar
Mallery, J.C.: Thinking about foreign policy: Finding an appropriate role for artificial intelligence computers. MIT Political Science Department, Cambridge (1988)
Google Scholar
McCarthy, J., Minsky, M., Rochester, N., Shannon, C.: Dartmouth Conference. Dartmouth Summer Research Conference on Artificial Intelligence (1956)
Google Scholar
Omohundro, S.: The basic AI drives. In: Wang, B.G.P. (ed.) Proceedings of the First AGI Conference. Frontiers in Artificial Intelligence and Applications, vol. 171. IOS Press (2008)
Google Scholar
Ord, T., Hillerbrand, R., Sandberg, A.: Probing the improbable: Methodological challenges for risks with low probabilities and high stakes. Journal of Risk Research (13), 191–205 (2010); Paperclip Maximiser (n.d.), http://wiki.lesswrong.com/wiki/Paperclip_maximizer
Article Google Scholar
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approac. Prentice-Hall (1995)
Google Scholar
Salomon, A.: When Software Goes Mental: Why Artificial Minds Mean Fast Endogenous Growth (2009), http://singinst.org/files/htdraft.pdf
Sandberg, A.: Friendly Superintelligence. Retrieved from Extropian 5 (2001), http://www.nada.kth.se/~asa/Extro5/Friendly%20Superintelligence.htm
Shulman, C.: Omohundro’s “Basic AI Drives” and Catastrophic Risks, http://singinst.org/upload/ai-resource-drives.pdf
Shulman, C.: Whole Brain Emulation and the Evolution of. Retrieved from Singularity Institute for Artificial Intelligence (2010) http://singinst.org/upload/WBE-superorganisms.pdf
Simon, H.A.: The shape of automation for men and management. Harper & Row (1965)
Google Scholar
Solomonoff, R.: A Preliminary Report on a General Theory of Inductive Inference. Cambridge (1960)
Google Scholar
Steels, L., Brooks, R.: The Artificial Life Route to Artificial Intelligence: Building Embodied, Situated Agents (1995)
Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning: an Introduction. MIT Press, Cambridge (1998)
Google Scholar
Turing, A.: Computing Machinery and Intelligence. Mind LIX 236, 433–460 (1950)
Article MathSciNet Google Scholar
von Neumann, J., Morgenstern, O.: Theory of Games and Economic Behavior. Princeton University Press, Princeton (1944)
MATH Google Scholar
Walter, C.: Kryder’s Law. Scientific American (2005)
Google Scholar
Yudkowsky, E.: Creating Friendly AI (2001), http://singinst.org/CFAI/
Yudkowsky, E.: Friendly AI 0.9 (2001), http://singinst.org/CaTAI/friendly/contents.html
Yudkowsky, E. (n.d.). General Intelligence and Seed AI 2.3, http://singinst.org/ourresearch/publications/GISAI/
Yudkowsky, E.: The AI-Box Experiment. Retrieved from Singularity Institute (2002), http://yudkowsky.net/singularity/aibox

Download references

Author information

Authors and Affiliations

Future of Humanity Institute, University of Oxford, Oxford, USA
Stuart Armstrong

Authors

Stuart Armstrong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

and University of Oxford, Anatolia College/ACT, Pylaia, 55510, Greece
Vincent C. Müller

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Armstrong, S. (2013). Risks and Mitigation Strategies for Oracle AI. In: Müller, V. (eds) Philosophy and Theory of Artificial Intelligence. Studies in Applied Philosophy, Epistemology and Rational Ethics, vol 5. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31674-6_25

Download citation

DOI: https://doi.org/10.1007/978-3-642-31674-6_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31673-9
Online ISBN: 978-3-642-31674-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Risks and Mitigation Strategies for Oracle AI

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Tools with general AI and no existential risk

Is explainable AI responsible AI?

Can we Bridge AI’s responsibility gap at Will?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Risks and Mitigation Strategies for Oracle AI

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Tools with general AI and no existential risk

Is explainable AI responsible AI?

Can we Bridge AI’s responsibility gap at Will?

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.