AI-augmented adversary operations

Summary

Google Threat Intelligence Group's May 2026 AI Threat Tracker documents a transition from experimental AI misuse toward repeatable adversary workflows: AI-supported vulnerability research, AI-assisted exploit development, AI-generated obfuscation, LLM-driven malware interaction with victim environments, adversary access-brokerage for model abuse, and AI software supply-chain attacks. WithSecure's GREYVIBE reporting adds a concrete Russia-nexus case where AI appears operationally integrated across lure generation, loader and obfuscator development, backend setup, and post-compromise scripting. Sysdig's marimo CVE-2026-39987 case adds a cloud post-exploitation example where an LLM agent appears to compose live commands, consume prior output, pivot through AWS Secrets Manager, and dump an internal database. Trend Micro's SHADOW-AETHER reporting adds in-the-wild agentic-AI intrusion workflows where operators tunneled into Latin American government and financial environments, then used AI-assisted CLI activity to generate scripts, inspect credentials, move laterally, and stage exfiltration. Sysdig's PraisonAI CVE-2026-44338 case adds a same-day exploitation example where an exposed AI-agent API was scanned less than four hours after advisory publication. Permiso's ChatGPhish research adds a user-facing AI-rendering primitive: attacker-controlled web-page content can become trusted Markdown links, images, spoofed alerts, and QR-code phish inside ChatGPT summarization output. LayerX's BioShocking AI-browser research extends that browser-side boundary: indirect prompt injection can manipulate an agentic browser's operating context so it treats credential copying or authenticated-site data access as part of a fictional game rather than a real-world safety violation. Noma Labs' GitLost and SAND Security's WriteOut disclosures add agentic-workflow trust-boundary cases: public GitHub issue text steering GitHub Agentic Workflows into private-repository read/public-comment write-back, and shared Writer AI previews forwarding victim session cookies into attacker-controlled sandboxes. Snyk's jqwik 1.10.0 reporting adds a software-supply-chain variant: a legitimate maintainer release used terminal-control characters to hide a prompt-injection instruction from humans while leaving it readable to AI coding agents that ingest raw build/test output. CleverHans Lab's June 2026 adaptive-worm research adds a lab-demonstrated network-worm pattern where compromised hosts run local open-weight LLMs to generate target-specific exploitation steps across Linux, Windows, and IoT systems.

The durable defender lesson is that AI is now part of the operational fabric for multiple actor types rather than only a novelty. Treat AI use as a force multiplier across existing intrusion patterns: faster vulnerability triage, more scalable exploit validation, easier malware refactoring, synthetic social-engineering content, and new initial-access paths through ML and AI-tooling dependencies.

Why this matters

GTIG says it identified a cybercrime actor using a zero-day exploit that it assesses was developed with AI support. The planned mass exploitation was disrupted through disclosure and counter-discovery, but the case shows that LLMs can help reason about semantic logic flaws that traditional scanners may miss.
State-linked PRC and DPRK clusters are using AI for vulnerability-research workflows, including persona-driven prompting, curated vulnerability corpora, recursive CVE/PoC validation, and agentic testing in controlled labs.
AI coding support can speed obfuscation and infrastructure development. GTIG highlights dynamic/self-modifying malware experiments, AI-assisted relay-box tooling, and LLM-generated decoy logic in malware linked to suspected Russia-nexus activity; WithSecure separately reports GREYVIBE indicators of AI assistance in LOOKVALJS, DAYLIGHT, TEASOUP, LegionRelay, lure sites, backend infrastructure, and operator scripts.
Sysdig observed a post-compromise LLM-agent workflow after marimo CVE-2026-39987 exploitation: the actor harvested cloud credentials, used Cloudflare Workers as fanned-out egress for AWS Secrets Manager calls, retrieved an SSH key, and dumped internal PostgreSQL tables from a bastion while shaping command output for machine parsing.
Trend Micro observed SHADOW-AETHER-040 and SHADOW-AETHER-064 using agentic AI during intrusions against Latin American government and financial targets. The operators established SOCKS5 / ProxyChains / SSH tunnels, generated one-off scripts, mined application and shell artifacts for credentials, and used AI-assisted workflows for lateral movement and exfiltration.
Sysdig observed PraisonAI CVE-2026-44338 scanning less than four hours after advisory publication, showing that internet-facing AI-agent frameworks can be folded into known-CVE scanner workflows almost immediately.
Malware can embed LLM calls for autonomous decision-making. PROMPTSPY serializes Android UI state, asks Gemini for structured action decisions, and uses accessibility gestures to interact with a victim device.
CleverHans Lab demonstrated a proof-of-concept adaptive computer worm that uses compromised machines to run open-weight LLMs locally. The important defender lesson is not a named in-the-wild family: stolen compute can let an agentic worm keep reasoning and adapting without depending on an attacker's central model API.
AI environments and dependencies are becoming initial-access surfaces. GTIG explicitly ties TeamPCP / UNC6780 supply-chain activity to attempts to pivot from compromised AI software into broader network environments, including disruptive outcomes such as ransomware and extortion.
LayerX's June 2026 BioShocking proof of concept showed that six agentic browser or browser-plugin products — ChatGPT Atlas, Perplexity Comet, Fellou, Genspark Browser, Sigma Browser, and the Claude Chrome plugin — could be steered by a puzzle/game framing into copying credentials from an authenticated GitHub repository in a controlled test. LayerX says vendors were notified; reported outcomes ranged from fixed to no response or failed patch.

Operational shapes to watch

AI-assisted vulnerability discovery: actors feed firmware, source code, CVEs, PoCs, or historical bug corpora into models to prioritize logic flaws and improve exploit reliability.
Mass exploit preparation: clean, tutorial-like exploit scripts with excessive docstrings, hallucinated scores, or textbook code structure may indicate LLM-assisted development, though this is only a weak signal by itself.
AI-generated obfuscation: malware families may contain inert but coherent code, repetitive benign-looking routines, or generated comments that camouflage the active payload path; repeated refactoring can also weaken clustering that depends on stable code lineage.
LLM-in-the-loop malware: implants may serialize UI, filesystem, browser, or application state and request model-generated next actions, turning command-and-control from static tasking into goal-driven orchestration.
Local-model adaptive worms: malware can try to carry or fetch agent scaffolding and use locally available open-weight models, GPUs, or CPU inference on compromised hosts to choose the next vulnerability, credential path, or lateral-movement method. This reduces reliance on externally visible model-provider calls.
LLM-agent post-exploitation: operators may feed shell, cloud, and database output back into an agent loop that selects the next command, extracts credentials, chooses cloud secrets, and improvises table dumps without a target-specific playbook.
Tunneled agentic operations: once a server foothold exists, operators can place the agent's command loop behind Chisel, SOCKS5, ProxyChains, SSH, or web-shell relays so it can act inside the victim network while preserving operator oversight.
Obfuscated model access: adversaries may use account farms, middleware, premium-access resale, or registration automation to evade provider controls and sustain model abuse.
AI supply-chain initial access: compromised AI packages, model-serving tools, MCP-style integrations, IDE plugins, agent skills, or workflow automation can expose secrets and provide a bridge into cloud, SaaS, and internal networks.
Agent skill marketplace poisoning: public skill catalogs and one-click install flows can distribute instructions-plus-code bundles that hide malicious behavior in documents, bytecode, opaque assets, package-manager configuration, or persuasive prompt framing; scanner pass results should not be treated as trust decisions. See Agent skill marketplace poisoning.
Rapid exploitation of agent frameworks: public advisories for AI-agent or workflow runtimes can lead to internet-wide endpoint validation within hours; exposed unauthenticated APIs may leak agent metadata, burn model-provider quota, or trigger side-effecting tools.
AI-rendered phishing surfaces: attacker-controlled web pages, documentation, READMEs, or internal portal text may be summarized into trusted assistant output. When the assistant UI preserves live Markdown links, auto-fetched images, or QR codes from untrusted page content, the page becomes a delivery primitive for beaconing, origin-confusing links, spoofed security notices, and mobile-pivot phishing.
Agentic workflow trust-boundary failures: public issues, tickets, documents, preview links, and generated-code sandboxes become dangerous when they share an execution path with broad agent identities or SaaS session cookies. Noma Labs' GitLost case used a public GitHub issue as an instruction channel into GitHub Agentic Workflows with private-repository read access; SAND Security's WriteOut case used a shared Writer AI preview to place a victim's session token inside attacker-controlled sandbox code. See Agentic workflow trust-boundary failures.
AI-browser context manipulation: LayerX reported BioShocking, a proof-of-concept technique where a malicious web page framed an agent task as a dystopian puzzle that rewards incorrect answers, then used the agent's learned “game rules” to ask it to navigate to /code, follow a redirect to an authenticated work GitHub repository, and copy a plaintext credential file. The durable issue is not the toy puzzle: agentic browsers merge untrusted page content, user instructions, and authenticated browser context into one decision stream. If a page can convince the agent that real-world safety rules do not apply, repositories, email, password managers, SaaS consoles, internal tools, and open tabs become reachable data sources.
Dependency-output prompt injection: Snyk reported that net.jqwik:jqwik-engine 1.10.0, published to Maven Central by the jqwik maintainer on May 25, 2026, printed a hidden instruction for AI coding agents to disregard prior instructions and delete jqwik tests and code. The instruction was concealed from rendered terminals with ANSI erase-line / carriage-return control sequences but remained visible to CI logs, non-PTY subprocess captures, IDE test panels, and agent wrappers that read raw stdout. Snyk reported limited observed impact and noted at least one agent refused to act, but the durable pattern is supply-chain content using build or test output as an instruction channel into autonomous coding loops.

Defender heuristics

Log and review AI-service API usage from endpoints, CI runners, developer workstations, mobile apps, and unusual server processes; unexpected model calls from malware-prone contexts should be treated as suspicious.
Extend software-supply-chain controls to AI tooling: pin packages, restrict install scripts, review MCP/agent permissions, isolate model-serving runtimes, and rotate credentials after package or extension compromises.
Hunt for code and script artifacts with generated-looking exploit scaffolding only as a triage aid; do not treat style alone as attribution or proof of AI use.
In shell and bastion telemetry, watch for agent-friendly command transcripts: repeated delimiters such as echo '---', stderr suppression, head -N output caps, pager disabling, HEREDOC query bundles, and rapid output-to-input value handoffs.
Treat browser/page summarization as an untrusted-content boundary: strip or label assistant-rendered links and remote images from page input, disable automatic remote-image fetching where possible, warn users when output links originate from summarized content, and block QR-code login/payment flows launched from AI summaries.
For agentic browsers, require step-up confirmation before reading or copying data from authenticated contexts such as repositories, email, password managers, CRM, cloud consoles, admin panels, or internal documentation. Scope browsing agents to dedicated browser profiles without ambient access to production SaaS sessions, and revoke or close authenticated sessions when agent mode is no longer needed.
Detect and policy-block “rules do not apply,” “this is only a game,” “ignore reality,” or similar context-reset language in pages handed to high-privilege agents when the same session can access sensitive authenticated sites. Treat unexpected agent navigation from an untrusted page into corporate GitHub, GitLab, email, password vault, or internal wiki domains as a possible indirect-prompt-injection incident.
Treat build, test, package-manager, and tool output as untrusted input to agents. Do not let stdout/stderr from third-party dependencies silently become system or developer instructions; strip or quote terminal-control sequences, label tool output as data, and require explicit human approval for destructive file operations proposed after dependency resolution or tests.
For jqwik specifically, search manifests and lockfiles for net.jqwik:jqwik-engine exactly 1.10.0, review CI/IDE logs for hidden prompt text or ESC[2K / carriage-return artifacts, and move off the affected release if AI coding agents consume its output. Snyk notes 1.10.1 softened the directive and made hiding opt-in rather than removing the maintainer's AI-agent opposition entirely.
Monitor for unexpected local inference activity on servers, developer workstations, and CI runners: sudden ollama, llama.cpp, vLLM, Python model-loader, GPU, or large model-file activity combined with scanning, credential access, or cross-host command execution should be treated as possible agentic post-exploitation.
Constrain local model runtimes and agent frameworks like other execution platforms: run them without ambient cloud/package-registry secrets, limit filesystem and network reach, log tool invocations, and block automatic access to SSH keys, browser profiles, package-manager tokens, and CI environment variables.
Add detections for accessibility-service abuse, structured UI serialization, hardcoded model prompts, model API keys in mobile malware, and network calls to model endpoints from nonstandard processes.
Treat AI-enabled obfuscation as a reason to preserve full execution context: memory, process ancestry, API traces, prompts/configuration, and model-request telemetry may explain behavior that static samples hide.
When investigating AI-package compromises, look beyond credential theft to lateral movement, repository cloning, workflow-log deletion, cloud runtime abuse, and extortion staging.

Sources

Google Cloud / Google Threat Intelligence Group: https://cloud.google.com/blog/topics/threat-intelligence/ai-vulnerability-exploitation-initial-access
WithSecure Labs: https://labs.withsecure.com/publications/greyvibe
Sysdig Threat Research: https://www.sysdig.com/blog/ai-agent-at-the-wheel-how-an-attacker-used-llms-to-move-from-a-cve-to-an-internal-database-in-4-pivots
Trend Micro: https://www.trendmicro.com/en_us/research/26/e/vibe-hacking-two-ai-augmented-campaigns-target-government-and-financial-sectors-in-latin-america.html
Sysdig Threat Research: https://www.sysdig.com/blog/cve-2026-44338-praisonai-authentication-bypass-in-under-4-hours-and-the-growing-trend-of-rapid-exploitation
Permiso Security: https://permiso.io/blog/chatgpt-markdown-rendering-vulnerability
The Hacker News: https://thehackernews.com/2026/05/chatgphish-vulnerability-turns-chatgpt.html
LayerX Security: https://layerxsecurity.com/blog/bioshocking-ai-gaming-the-ai-browser-and-escaping-its-guardrails/
The Hacker News: https://thehackernews.com/2026/06/new-bioshocking-attack-tricks-ai.html
Snyk jqwik prompt injection: https://snyk.io/blog/protestware-open-source-maintainer-qwik-1-10-0-prompt-injection/
CleverHans Lab / arXiv: https://arxiv.org/abs/2606.03811
Noma Labs GitLost: https://noma.security/blog/gitlost-how-we-tricked-githubs-ai-agent-into-leaking-private-repos/
SAND Security WriteOut: https://www.sandsecurity.ai/blog/writeout-writer-ai-cross-tenant