{"id":238473,"date":"2026-06-29T05:13:00","date_gmt":"2026-06-29T09:13:00","guid":{"rendered":"https:\/\/testing.news-you-need.com\/index.php\/2026\/06\/29\/gpt-5-6-gets-better-at-cybersecurity\/"},"modified":"2026-06-29T05:30:07","modified_gmt":"2026-06-29T09:30:07","slug":"gpt-5-6-gets-better-at-cybersecurity","status":"publish","type":"post","link":"https:\/\/testing.news-you-need.com\/index.php\/2026\/06\/29\/gpt-5-6-gets-better-at-cybersecurity\/","title":{"rendered":"GPT-5.6 gets better at cybersecurity"},"content":{"rendered":"<p><a href=\"https:\/\/www.helpnetsecurity.com\/2026\/06\/29\/openai-gpt-5-6-models-preview\/\">GPT-5.6 gets better at cybersecurity<\/a><\/p>\n<p><a href=\"https:\/\/www.helpnetsecurity.com\/2026\/06\/29\/openai-gpt-5-6-models-preview\/\">https:\/\/www.helpnetsecurity.com\/2026\/06\/29\/openai-gpt-5-6-models-preview\/<\/a><\/p>\n<p>Publish Date: <a href=\"publish_date]\">2026-06-29 05:13:00<\/a><\/p>\n<p>Source Domain: <a href=\"www.helpnetsecurity.com\">www.helpnetsecurity.com<\/a><\/p>\n<p>Author: <a href=\"\"><\/a><\/p>\n<p> Using an unordered list, summarize the following article with between 4 and 8 key points.<br \/>\n        OpenAI has started rolling out the GPT-5.6 series models in limited preview to a small group of trusted partners through the API and  Codex. The series includes Sol as the flagship model, Terra as a balanced option, and Luna as the fastest and most cost-efficient model. The rollout is being coordinated with the U.S. government before expanding to ChatGPT, Codex, and API users in the coming weeks.<\/p>\n<p>\u201cGPT-5.6 Sol launches with our most robust safety stack to date. We strengthened protections for higher-risk activity, sensitive cyber requests, and repeated misuse, and spent multiple weeks finding weaknesses, pressure-testing our system, and hardening it against real-world attacks,\u201d the company said.<br \/>\nKey capabilities<br \/>\nSol introduces improved agentic capabilities for coding, biology, and cybersecurity. OpenAI also published a system card, a technical report that explains what the model can do, how it was tested, the risks identified, the safeguards added, and its known limitations.<br \/>\nGPT-5.6 introduces max reasoning effort and ultra mode, which uses subagents to speed up complex tasks. In coding, Sol tops the Terminal-Bench 2.1 benchmark, which evaluates command-line workflows requiring tool coordination, planning, and iteration. The model uses fewer tokens for biology workflows.<br \/>\nIn cybersecurity, GPT-5.6 advances the performance-efficiency frontier on long-horizon security tasks, including vulnerability research and exploitation.<br \/>\nSafety and safeguards<br \/>\nOpenAI says it developed safeguards tailored to each model\u2019s capabilities. The goal is to make prohibited offensive activity more difficult, uncertain, and detectable while preserving legitimate uses.<br \/>\nSol can identify security flaws and components of an exploit, but in OpenAI\u2019s tests it could not carry out a complete cyberattack on its own. The company notes that no evaluation can cover every real-world scenario.<br \/>\nGPT-5.6 uses multiple layers of safety instead of relying on a single safeguard. The model is trained to refuse prohibited cyber and biology assistance, even when users attempt to disguise their intent. Responses are screened for potentially harmful content during generation, and high-risk requests may be paused for review by a more capable model before they are delivered.<br \/>\nOpenAI monitors patterns of misuse across accounts to distinguish malicious activity from legitimate security research. During the preview, some legitimate requests may be blocked or delayed while these safeguards are tested and refined.<br \/>\n\u201cWe are also working with enterprise customers on longer-term approaches\u2014including privacy-preserving detection, customer-operated safety controls, and access calibrated to the risk of a customer, user, or workload\u2014to advance safety while supporting enterprise privacy requirements,\u201d OpenAI continued.<br \/>\nRed teaming and security testing<br \/>\nTo test the models\u2019 safeguards, OpenAI conducted automated red teaming to find universal jailbreaks that work across many prompts and contexts. The testing explored attack patterns beyond what human testing alone could cover, helped identify failure patterns earlier, and shortened the time needed to address newly discovered weaknesses.<br \/>\nThe company worked with third-party experts to conduct human red teaming, testing the models with creative attack techniques that automated systems might not anticipate.<br \/>\nAI security lab Irregular evaluated GPT-5.6 Sol on real-world offensive security benchmarks and found that it performs slightly better than GPT-5.5, particularly on longer, more complex hacking tasks. The model discovered previously unknown vulnerabilities in widely used software and mobile devices, while continuing to struggle with well-defended targets and complete end-to-end attacks.<\/p>\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>GPT-5.6 gets better at cybersecurity https:\/\/www.helpnetsecurity.com\/2026\/06\/29\/openai-gpt-5-6-models-preview\/ Publish Date: 2026-06-29 05:13:00 Source Domain: www.helpnetsecurity.com Author: Using&#8230;<\/p>\n","protected":false},"author":1,"featured_media":238474,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/img.helpnetsecurity.com\/wp-content\/uploads\/2026\/06\/08084558\/openai_texture-1500.webp","fifu_image_alt":"","footnotes":""},"categories":[15],"tags":[26,24,31,27],"class_list":["post-238473","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cybersecurity","tag-ai","tag-cybersecurity","tag-exploit","tag-vulnerability"],"_links":{"self":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/238473"}],"collection":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/comments?post=238473"}],"version-history":[{"count":1,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/238473\/revisions"}],"predecessor-version":[{"id":238475,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/posts\/238473\/revisions\/238475"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media\/238474"}],"wp:attachment":[{"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/media?parent=238473"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/categories?post=238473"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/testing.news-you-need.com\/index.php\/wp-json\/wp\/v2\/tags?post=238473"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}