{"id":200651,"date":"2026-03-30T14:41:00","date_gmt":"2026-03-30T18:41:00","guid":{"rendered":"https:\/\/testing.news-you-need.com\/index.php\/2026\/03\/30\/artificial-intelligence-ready-to-score-full-marks-on-one-of-worlds-most-challenging-tests\/"},"modified":"2026-03-30T14:45:14","modified_gmt":"2026-03-30T18:45:14","slug":"artificial-intelligence-ready-to-score-full-marks-on-one-of-worlds-most-challenging-tests","status":"publish","type":"post","link":"https:\/\/testing.news-you-need.com\/index.php\/2026\/03\/30\/artificial-intelligence-ready-to-score-full-marks-on-one-of-worlds-most-challenging-tests\/","title":{"rendered":"Artificial intelligence ready to score full marks on one of world&#8217;s most challenging tests"},"content":{"rendered":"<p><a href=\"https:\/\/www.gbnews.com\/news\/artificial-intelligence-full-marks-test-google-gemini\">Artificial intelligence ready to score full marks on one of world&#8217;s most challenging tests<\/a><\/p>\n<p><a href=\"https:\/\/www.gbnews.com\/news\/artificial-intelligence-full-marks-test-google-gemini\">https:\/\/www.gbnews.com\/news\/artificial-intelligence-full-marks-test-google-gemini<\/a><\/p>\n<p>Publish Date: <a href=\"publish_date]\">2026-03-30 14:41:00<\/a><\/p>\n<p>Source Domain: <a href=\"www.gbnews.com\">www.gbnews.com<\/a><\/p>\n<ul>\n<li>Google&#8217;s Gemini model has achieved 45.9 percent on &#8220;Humanity&#8217;s Last Exam,&#8221; a significant leap from previous performances.<\/li>\n<li>The test, designed to measure the divide between machine learning and human intellect, comprises 2,500 questions across roughly 100 disciplines requiring doctoral-level comprehension.<\/li>\n<li>The test was collaboratively developed by Scale and the Centre for AI Safety, drawing from over 70,000 questions proposed by experts from approximately 50 countries.<\/li>\n<li>The benchmark&#8217;s purpose is to evaluate both breadth and depth of knowledge and reasoning in AI systems, comparing them to the capability of universal experts.<\/li>\n<li>AI models&#8217; recent rapid advancement, noted by researchers like Calvin Zhang of Scale, has led to predictions that full marks could be achieved within twelve months.<\/li>\n<li>While some models, like Google&#8217;s Gemini and Anthropic&#8217;s Claude, show improving performance, others still lag, indicating persistent gaps in AI&#8217;s understanding.<\/li>\n<li>Experts like Dr. Tung Nguyen stress that the test highlights the importance of human expertise in depth, context, and specialized knowledge.<\/li>\n<li>There is optimism from industry representatives, such as Kate Olszewska, that full marks could be quickly achieved if enough resources and focus are directed toward this goal.<\/li>\n<\/ul>\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Artificial intelligence ready to score full marks on one of world&#8217;s most challenging tests https:\/\/www.gbnews.com\/news\/artificial-intelligence-full-marks-test-google-gemini&#8230;<\/p>\n","protected":false},"author":1,"featured_media":200652,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/www.gbnews.com\/media-library\/image.jpg?id=65426822&width=1200&height=600&coordinates=0%2C92%2C0%2C241","fifu_image_alt":"","footnotes":""},"categories":[14],"tags":[20],"class_list":["post-200651","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","tag-artificial-intelligence"],"_links":{"self":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/200651"}],"collection":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/comments?post=200651"}],"version-history":[{"count":1,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/200651\/revisions"}],"predecessor-version":[{"id":200653,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/200651\/revisions\/200653"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media\/200652"}],"wp:attachment":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media?parent=200651"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/categories?post=200651"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/tags?post=200651"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}