AI models appear to recognize moral complexity — then ignore it, new study by researchers affiliated with Harvard Kennedy School’s Allen Lab finds

AI models appear to recognize moral complexity — then ignore it, new study by researchers affiliated with Harvard Kennedy School’s Allen Lab finds

AI models appear to recognize moral complexity — then ignore it, new study by researchers affiliated with Harvard Kennedy School’s Allen Lab finds

https://ash.harvard.edu/articles/ai-models-appear-to-recognize-moral-complexity-then-ignore-it-new-study-by-researchers-affiliated-with-harvard-kennedy-schools-allen-lab-finds/

Publish Date: 2026-05-27 14:03:00

Source Domain: ash.harvard.edu

  • AI models, when faced with complex ethical dilemmas, express uncertainty but make consistent decisions, indicating an implicit value hierarchy rather than true ethical deliberation.
  • The study “Crocodile Tears: Can the Ethical-Moral Intelligence of AI Models Be Trusted?” found that leading AI models resolved tragic ethical tradeoffs with uniformity, favoring options related to worker safety over environmental protections or vocational training.
  • The authors introduce an ethical-moral intelligence framework focusing on expertise, sensitivity, coherence, and transparency, and suggest existing benchmarks are inadequate for evaluating moral reasoning in AI.
  • The research calls for greater transparency in AI models’ ethical reasoning, suggesting systems should alert users to conflicting values, and proposes a “badging” system instead of monolithic benchmarks for model evaluation.
  • Sarah Hubbard warns against trusting AI models as genuine ethical-moral agents and stresses the need for high standards of ethical-moral intelligence before these models are entrusted with decisions carrying moral weight.