{"id":186960,"date":"2026-02-13T03:34:00","date_gmt":"2026-02-13T08:34:00","guid":{"rendered":"https:\/\/testing.news-you-need.com\/index.php\/2026\/02\/13\/yao-shunyus-google-debut-new-gemini-model-shatters-sota-records-only-7-humans-defending-carbon\/"},"modified":"2026-02-13T04:15:14","modified_gmt":"2026-02-13T09:15:14","slug":"yao-shunyus-google-debut-new-gemini-model-shatters-sota-records-only-7-humans-defending-carbon","status":"publish","type":"post","link":"https:\/\/testing.news-you-need.com\/index.php\/2026\/02\/13\/yao-shunyus-google-debut-new-gemini-model-shatters-sota-records-only-7-humans-defending-carbon\/","title":{"rendered":"Yao Shunyu&#8217;s Google Debut: New Gemini Model Shatters SOTA Records, Only 7 Humans Defending Carbon"},"content":{"rendered":"<p><a href=\"https:\/\/eu.36kr.com\/en\/p\/3681358416129668\">Yao Shunyu&#8217;s Google Debut: New Gemini Model Shatters SOTA Records, Only 7 Humans Defending Carbon<\/a><\/p>\n<p><a href=\"https:\/\/eu.36kr.com\/en\/p\/3681358416129668\">https:\/\/eu.36kr.com\/en\/p\/3681358416129668<\/a><\/p>\n<p>Publish Date: <a href=\"publish_date]\">2026-02-13 03:34:00<\/a><\/p>\n<p>Source Domain: <a href=\"eu.36kr.com\">eu.36kr.com<\/a><\/p>\n<ul>\n<li>Google launched Gemini 3 Deep Think to counteract advances by competitors like Claude Opus 4.6 and GPT Codex 5.3.<\/li>\n<li>On Codeforces, Gemini 3 Deep Think achieved a prestigious Elo score of 3455, ranking 8th globally.<\/li>\n<li>It surpassed the previous top Elo score of 2727 held by o3; also, it set a record of 84.6% on ARC-AGI-2, significantly higher than previous models.<\/li>\n<li>In the Humanity&#8217;s Last Exam (HLE), Gemini 3 Deep Think achieved a 48.4% score, which is state-of-the-art (SOTA).<\/li>\n<li>The new version of Deep Think is designed to excel in scientific research and engineering, with capabilities including analyzing sketches, modeling shapes, and generating 3D printing files.<\/li>\n<li>Notable achievements include successfully identifying a logical flaw in a specialized mathematical paper that eluded previous manual reviews and optimizing processes for new semiconductor material growth.<\/li>\n<li>The upgrade notably reduced reasoning costs from $77.16 per task to $13.62.<\/li>\n<li>Gemini 3 Deep Think has won numerous SOTAs and gold medals in difficult benchmarks like the ARC-AGI series and International Mathematical Olympiad while also performing well in physics and chemistry Olympiads.<\/li>\n<li>The development team includes prominent Chinese researchers, such as Yi Tay and Shunyu Yao, both of whom have significant contributions to the AI and physics fields.<\/li>\n<\/ul>\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Yao Shunyu&#8217;s Google Debut: New Gemini Model Shatters SOTA Records, Only 7 Humans Defending Carbon&#8230;<\/p>\n","protected":false},"author":1,"featured_media":186961,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/img.36krcdn.com\/hsossms\/20260213\/v2_67ec71b9c0c84ef8b681292cfc8a2ab1@46958@ai_oswg807579oswg1053oswg495_img_png~tplv-1marlgjv7f-ai-v3:600:400:600:400:q70.jpg","fifu_image_alt":"","footnotes":""},"categories":[14],"tags":[22],"class_list":["post-186960","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","tag-artificial-general-intelligence"],"_links":{"self":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/186960"}],"collection":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/comments?post=186960"}],"version-history":[{"count":1,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/186960\/revisions"}],"predecessor-version":[{"id":186962,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/186960\/revisions\/186962"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media\/186961"}],"wp:attachment":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media?parent=186960"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/categories?post=186960"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/tags?post=186960"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}