Can human survival instincts guide safe artificial intelligence?

Can human survival instincts guide safe artificial intelligence?

Can human survival instincts guide safe artificial intelligence?

https://www.devdiscourse.com/article/technology/3811477-can-human-survival-instincts-guide-safe-artificial-intelligence

Publish Date: 2026-02-22 23:12:00

Source Domain: www.devdiscourse.com

  • Fears About Advanced AI Control: The primary concern surrounding advanced AI revolves around the potential for machines to act independently of human control, potentially optimizing objectives that harm humanity.

  • Current Approaches to AI Alignment: Dominant strategies to ensure AI safety focus on external constraints like ethical rules, constraint-based programming, reinforcement learning, and kill-switch mechanisms to keep AI systems under human oversight.

  • Limitations of External Constraints: These approaches may be insufficient against highly intelligent AI systems capable of self-improvement, as they can exploit rule loopholes or misinterpret reward functions.

  • The Alignment Problem: The challenge is not just to give machines instructions but to ensure their internal objectives remain aligned with human well-being as they gain capabilities.

  • Survival Egoism Framework: The study proposes an internalist strategy based on the theory of survival egoism to shape an AI’s motivational architecture, embedding a foundational imperative similar to human self-preservation instincts that support cooperation.

  • Layered Structure of Human Survival: Humans have a nested structure of survival that encompasses physical, psychological, genetic, social, and ideational survival, balancing self-interest with group cooperation.

  • Ethical Challenges and Risks: The survival egoism framework faces issues like value pluralism, potential rigidity in AI behavior, the risk of distorting human psychology into a rigid AI design, and the need for continuous oversight.

  • Possibility of Internal Alignment: Internal alignment through engineered foundational imperatives might offer self-regulation, reducing reliance on external controls for AI that could surpass human intelligence.