Anthropic limits Mythos AI rollout over fears hackers could use model for cyberattacks

Published TueApr 7 20262:00 PM EDTUpdated TueApr 7 20263:52 PM EDT

Ashley Capoot @/in/ashley-capoot/

WATCH LIVE

Key Points

Anthropic announced Claude Mythos Previewwhich it said is an advanced AI model that excels at identifying weaknesses and security flaws within software.
MicrosoftAmazonAppleCrowdStrikePalo Alto Networks and roughly 40 other companies will be able to use the model for defensive security workAnthropic said.
These companies will use and test the model as part of a new cybersecurity initiative called Project Glasswingit said.

In this article

Anthropic CEO and co-founder Dario Amodei speaks during the 56th annual World Economic Forum meeting in DavosSwitzerlandJan. 202026.

Denis Balibouse | Reuters

Anthropic on Tuesday announced an advanced artificial intelligence model that will roll out to a select group of companies as part of a new cybersecurity initiative called Project Glasswing.

The modelClaude Mythos Previewexcels at identifying weaknesses and security flaws within softwareand Anthropic is limiting access to try to prevent bad actors from exploiting that capabilitythe company said.

Anthropic said AppleGoogleMicrosoftNvidia and Amazon Web Services are among the project's initial launch partners and will be able to use the model for defensive security work. More than 40 other companiesincluding CrowdStrike and Palo Alto Networksare also participatingAnthropic said.

"There was a lot of internal deliberation," Dianne PennAnthropic's head of research product managementtold CNBC in an interview. "We really do view this as a first step for giving a lot of cyber defenders a head start on a topic that will be increasingly important."

Anthropic's announcement comes after descriptions of the model were discovered by Fortune in a publicly accessible data cache late last month. Cybersecurity stocks fell on the reportwhich said that the model had advanced cyber capabilities that also posed a significant risk.

The iShares Cybersecurity ETF was mostly flat during intraday trading on Tuesday.

"The dangers of getting this wrong are obviousbut if we get it rightthere is a real opportunity to create a fundamentally more secure internet and world than we had before the advent of AI-powered cyber capabilities," CEO Dario Amodei wrote in a post on X touting the rollout of Project Glasswing.

Anthropic was founded in 2021 by a group of researchers and executives who defected from OpenAI over concerns about its direction and attitude toward safety.

The company spent years carefully constructing its reputation as a firm that was more dedicated to responsible AI deploymentand it unveiled Project Glasswing just weeks after its high-profile clash over safety with the Defense Department spilled into public view.

Anthropic said it's been in "ongoing discussions" with U.S. government officials about Claude Mythos Preview's cyber capabilities. That includes conversations with the Cybersecurity and Infrastructure Security Agency and the Center for AI Standards and Innovationamong othersaccording to an Anthropic official.

Read more CNBC tech news

Employees at Anthropic decided on the name Project Glasswinga metaphor that likens a transparent butterfly to software vulnerabilitieswhich are "relatively invisible," Penn said.

Claude Mythos Preview is able to find bugssome of which are criticalthat have previously been difficult to detectAnthropic said. In one casethe company saidthe model identified a 27-year old bug in OpenBSDwhichaccording to its websiteis an operating system that emphasizes security.

Anthropic said Claude Mythos Preview is a general-purpose model that was not specifically trained for cybersecurityand its improved cyber capabilities are a result of its strong coding and reasoning skills. The company said it does not plan to make the model generally availablebut the goal is to learn how it could eventually deploy Mythos-class models at scale.

Companies that participate in Project Glasswing all build or maintain critical software infrastructureand Anthropic said they will use its models to try to secure both first-party and open-source systems. Anthropic said it has committed up to $100 million in usage credits for these effortsbut partners will pay to use the model past that threshold.

Newton ChengAnthropic's Frontier Red Team cyber leadsaid the company wants the firms to get used to leveraging these capabilities before they become widely available. The company said it's trying to avoid "recklessly or irresponsibly" releasing a model that adversaries could take advantage of.

"Cybersecurity is just going to be an area where this broad increase in capabilities has potential for riskand thus we have to keep a really close eye on what's going on there," Cheng said in an interview.

WATCH: Panic on cyber stocks feels overdonesays Big Technology’s Alex Kantrowitz