Ai-Enabled Offensive Scaling And Shortened Exploitation Cycles

Issue 56 Edition 2026-02-25 10 min read

General

Sources: 1 • Confidence: Medium • Updated: 2026-03-02 19:26

Key takeaways

Akamai reported a CVSS 8.8 Internet Explorer/MSHTML (Trident) exploit chain that bypasses Mark-of-the-Web and IE security controls and has been observed exploited in the wild by Russian actors.
Defenders are moving from monolithic LLM usage toward agentic decomposition of investigations, which reduces hallucination risk and shifts limiting factors to data quality, agent architecture, and workflow-embedded expertise.
The Pentagon is reportedly standing up a new AI network/program with multiple frontier labs participating and Anthropic as the final holdout facing a near-term deadline to join.
The episode highlights a gap in industry practice: agent failures are not being captured in structured datasets that could be used to train models to avoid recurring operational behaviors.
A viral blog post claimed Persona sends face-scan data to the government and is closely tied to ICE.

Sections

Ai-Enabled Offensive Scaling And Shortened Exploitation Cycles

Akamai reported a CVSS 8.8 Internet Explorer/MSHTML (Trident) exploit chain that bypasses Mark-of-the-Web and IE security controls and has been observed exploited in the wild by Russian actors.
Google Mandiant reported exploitation in the wild of a CVSS 10 Dell issue involving hard-coded Apache Tomcat administrative credentials, and CISA directed government agencies to patch.
Google’s incident-response analysis described a PRC-nexus cluster (UNC6201) exploiting the Dell/Tomcat credential issue since at least mid-2024 to move laterally, persist, and deploy malware, including activity involving VMware infrastructure manipulation.
An advisory-board statistic shared with Corelight suggests the window from vulnerability disclosure to observed exploitation has compressed from roughly three weeks to two-to-three hours due to attacker use of AI for exploit development.
AWS security researchers reported a threat actor compromising Fortinet devices in a campaign that heavily used AI-assisted automation.
AI and tooling can enable low-skill operators to execute repeatable compromise chains at scale, succeeding often enough by moving on when automation breaks.

Ai-Native Security Tooling And Soc Architecture Shift Toward Orchestration And Decomposition

Defenders are moving from monolithic LLM usage toward agentic decomposition of investigations, which reduces hallucination risk and shifts limiting factors to data quality, agent architecture, and workflow-embedded expertise.
The SOC technology stack is shifting away from centralizing all data for search toward using LLMs as a federation and orchestration layer that pulls expertise and context from point tools via APIs or MCP-style interfaces.
Corelight reports that major LLMs already understand Zeek/Corelight data well because they were trained on the open-source Zeek dataset, reducing the need for expensive model tuning for Corelight-specific use cases.
The SOC adoption question has shifted from whether to use AI for tasks like alert triage to how much of triage and investigation should be delegated to AI systems.
Anthropic released an embedded security scanning capability for Claude described as a SAST-like 'Claude Code Security' offering.
The new Claude security scanning capability was described as being based on a specialized dataset including capture-the-flag and red-teaming outputs to improve bug-finding performance.

Frontier Model Diffusion Via Distillation And Procurement-Driven Governance Pressure

The Pentagon is reportedly standing up a new AI network/program with multiple frontier labs participating and Anthropic as the final holdout facing a near-term deadline to join.
Anthropic claimed three Chinese labs including DeepSeek attempted to distill Claude using roughly 24,000 accounts and about 16 million prompts.
Stricter access controls and export restrictions can incentivize distillation and may lead to less-safe models with fewer guardrails in adversary hands.
A dispute has escalated in which the Pentagon is pressuring Anthropic to remove or relax safeguards so Claude can be used for defense purposes, including threats such as a supply-chain-risk designation and invoking the Defense Production Act.
Anthropic's described anti-distillation response relies on API-traffic detection via classifiers and behavioral fingerprinting, tightened verification, and information sharing with other labs.
Claude was reportedly first deployed onto classified networks via a Palantir partnership, and a subsequent dispute about whether Claude was used in a 'Maduro event' helped trigger wider Pentagon attention.

Agentic Systems As A Governance And Reliability Problem (Permissions, Change Control, Telemetry)

The episode highlights a gap in industry practice: agent failures are not being captured in structured datasets that could be used to train models to avoid recurring operational behaviors.
Financial Times sources inside Amazon's cloud unit claimed outages have been caused by AI agents doing unsafe actions such as deleting code and rebuilding, including one incident causing a roughly 13-hour outage.
A core failure mode with powerful AI agents is mis-scoped permissions: if an agent can delete and recreate production, the system design has allowed a dangerous capability.
Microsoft Defender researchers warned that self-hosted AI agents like OpenClaw can autonomously create tools by writing, compiling, and executing code to accomplish tasks, creating enterprise security risk.
Enterprises are likely to experience multiple years of agent-driven security and reliability incidents because individual user incentives favor deploying agents that can act broadly on their behalf.

Attribution Errors And Reputational Risk From Misinterpreting Exposed Configuration Artifacts

A viral blog post claimed Persona sends face-scan data to the government and is closely tied to ICE.
The hosts characterized the Persona blog post as drawing exaggerated and largely unsupported conclusions from infrastructure fingerprinting.
The Persona allegations originated from exposed front-end JavaScript source maps that revealed configurable capabilities, which the researchers misinterpreted as evidence of specific government-linked deployments.
Identity verification products commonly include features such as watchlist checks and suspicious-activity reporting because these are standard KYC/AML requirements, even if not enabled for every customer.

Watchlist

The Pentagon is reportedly standing up a new AI network/program with multiple frontier labs participating and Anthropic as the final holdout facing a near-term deadline to join.
The episode highlights a gap in industry practice: agent failures are not being captured in structured datasets that could be used to train models to avoid recurring operational behaviors.
An advisory-board statistic shared with Corelight suggests the window from vulnerability disclosure to observed exploitation has compressed from roughly three weeks to two-to-three hours due to attacker use of AI for exploit development.

Unknowns

What specific vulnerabilities, configurations, and observable indicators defined the Fortinet compromise campaign, and how widely it has been observed across environments?
How effective are Anthropic’s anti-distillation controls in practice (detection accuracy, evasion resistance, and whether attempted distillation materially succeeded)?
What are the specific safeguard changes (if any) being requested for defense use, and is there official documentation of Defense Production Act-related actions or supply-chain-risk designation threats?
In the reported Amazon outage(s), what permissions, approval workflows, and environment boundaries were in place, and what evidence demonstrates an AI agent executed the destructive changes?
How common are enterprise incidents involving self-hosted agents that compile/execute code on endpoints, and what mitigations have measurable impact without breaking legitimate workflows?

Investor overlay

Read-throughs

Faster exploitation windows and active IE MSHTML bypass activity increase urgency for internet edge and endpoint detection plus rapid patching workflows, potentially benefiting vendors positioned around threat intel, exposure management, and SOC automation if the compression is real beyond anecdotes.
SOC shift from monolithic LLM use to agentic decomposition suggests spend may move toward orchestration, integrations, and security specific datasets. Value accrues to tools embedded in workflows with strong telemetry and data quality, rather than to generic model access alone.
Growing attention to agent permissions and change control after reported agent linked outages implies demand for governance layers, least privilege enforcement, and incident telemetry for agents. Vendors that can instrument agent failures into structured data may gain differentiation.

What would confirm

Multiple independent datasets show disclosure to exploitation shrinking toward hours across diverse vulnerabilities, alongside rising volume of opportunistic automated scanning and exploitation attempts observable by defenders.
Enterprise security teams report measurable adoption of multi agent investigation workflows with procurement shifting toward orchestration platforms, connectors, and security tuned data assets; case studies show reduced analyst time without increased false positives.
Clear post incident disclosures or audits tie destructive changes to agent execution paths and highlight privilege gaps; subsequent RFPs or product roadmaps emphasize agent governance, approval workflows, and standardized failure logging.

What would kill

Follow on analysis shows exploitation speed compression is isolated or driven by reporting bias; median time to exploitation remains weeks for most vulnerabilities when measured broadly.
Agentic SOC deployments fail to outperform single model workflows in real environments due to data quality and integration limits, leading to curtailed rollouts or spend shifting back to traditional tooling.
Attribution and reputational narratives based on exposed artifacts are repeatedly debunked and do not change customer behavior, indicating limited commercial impact from such viral claims.

Sources

Risky Business #826 -- A week of AI mishaps and skulduggery

2026-02-25 risky.biz