{"id":185856,"date":"2026-02-09T13:56:00","date_gmt":"2026-02-09T18:56:00","guid":{"rendered":"https:\/\/testing.news-you-need.com\/index.php\/2026\/02\/09\/creating-humanitys-last-exam-udaily\/"},"modified":"2026-02-09T14:30:11","modified_gmt":"2026-02-09T19:30:11","slug":"creating-humanitys-last-exam-udaily","status":"publish","type":"post","link":"https:\/\/testing.news-you-need.com\/index.php\/2026\/02\/09\/creating-humanitys-last-exam-udaily\/","title":{"rendered":"Creating Humanity&#8217;s Last Exam | UDaily"},"content":{"rendered":"<p><a href=\"https:\/\/www.udel.edu\/udaily\/2026\/february\/humanitys-last-exam-ai-benchmarking-manuel-schottdorf-cas\/\">Creating Humanity&#8217;s Last Exam | UDaily<\/a><\/p>\n<p><a href=\"https:\/\/www.udel.edu\/udaily\/2026\/february\/humanitys-last-exam-ai-benchmarking-manuel-schottdorf-cas\/\">https:\/\/www.udel.edu\/udaily\/2026\/february\/humanitys-last-exam-ai-benchmarking-manuel-schottdorf-cas\/<\/a><\/p>\n<p>Publish Date: <a href=\"publish_date]\">2026-02-09 13:56:00<\/a><\/p>\n<p>Source Domain: <a href=\"www.udel.edu\">www.udel.edu<\/a><\/p>\n<ul>\n<li>The Center for AI Safety collaborated with experts worldwide to develop Humanity\u2019s Last Exam (HLE), a benchmark test comprising 2,500 questions designed to evaluate AI knowledge, accuracy, and reasoning.<\/li>\n<li>The questions were sourced from more than 1,000 professors, researchers, and graduate students across nearly 500 institutions in 50 countries, published in the journal Nature on Jan. 28.<\/li>\n<li>The HLE includes questions that probe the limits of human knowledge and require independent reasoning that is unlikely to be found in AI training data, such as obscure knowledge and niche domains.<\/li>\n<li>Manuel Schottdorf, a neuroscientist involved in the HLE, highlights that AI struggles with questions that require empirical understanding or mental representations of physical processes unlike humans.<\/li>\n<li>He emphasizes that while AI can produce impressive outputs based on large datasets, it lacks the deep reasoning and empirical experience necessary for true understanding, suggesting a need for more granular mental representations in AI development.<\/li>\n<li>Schottdorf advises caution in trusting AI outputs, especially for critical decisions, as AI can produce nonsensical answers, even at claimed high levels of performance.<\/li>\n<li>Success on the HLE indicates a machine&#8217;s proficiency in complex problem-solving but falls short of demonstrating true intelligence or reliability, as other cognitive and experiential factors are unaccounted for.<\/li>\n<\/ul>\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Creating Humanity&#8217;s Last Exam | UDaily https:\/\/www.udel.edu\/udaily\/2026\/february\/humanitys-last-exam-ai-benchmarking-manuel-schottdorf-cas\/ Publish Date: 2026-02-09 13:56:00 Source Domain: www.udel.edu The&#8230;<\/p>\n","protected":false},"author":1,"featured_media":185857,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/www.udel.edu\/content\/dam\/udelImages\/udaily\/2026\/february\/LEAD-800x420-AI-test.jpg","fifu_image_alt":"","footnotes":""},"categories":[14],"tags":[],"class_list":["post-185856","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence"],"_links":{"self":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/185856"}],"collection":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/comments?post=185856"}],"version-history":[{"count":1,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/185856\/revisions"}],"predecessor-version":[{"id":185858,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/185856\/revisions\/185858"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media\/185857"}],"wp:attachment":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media?parent=185856"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/categories?post=185856"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/tags?post=185856"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}