Active ThreatApril 18, 202610 min read

5 APT Groups Deploy AI Malware That Writes Its Own Code Mid-Attack

Sources:Google Cloud Blog — GTIG AI Threat Tracker: Distillation, Experimentation, and Integration|Google Cloud Blog — GTIG AI Threat Tracker: Advances in Threat Actor Usage|Infosecurity Magazine — AI-Enabled Malware Now Actively Deployed, Says Google|The Hacker News — Google Uncovers PROMPTFLUX Malware That Uses Gemini AI to Rewrite Its Code Hourly|CSO Online — Novel malware from Russia's APT28 prompts LLMs to create malicious Windows commands|Google Threat Intelligence Group — AI Risk and Resilience Mandiant Special Report|Recorded Future — New malware uses AI to adapt during attacks|Cybersecurity Dive — AI-based malware makes attacks stealthier and more adaptive|GTIG AI Threat Tracker PDF — November 2025

Eric Bang

Founder & Cybersecurity Evangelist

APT groups confirmed abusing Gemini for offensive cyber operations

100,000+

Prompts submitted in a single AI model extraction campaign

Novel AI-integrated malware families identified by GTIG in Q4 2025–Q1 2026

Nation-states confirmed weaponising AI tools: Russia, China, Iran, DPRK

HONESTCUE doesn't generate malicious code before an attack — it calls the Google Gemini API during execution to write and compile C# payloads in memory, on demand, at the moment of infection. There is no file on disk. There is no static binary for an antivirus signature to match. Every deployment produces a unique, never-before-seen payload generated live by an LLM. This is the capability that Google's Threat Intelligence Group (GTIG) has confirmed is now active in the wild in their Q1 2026 AI Threat Tracker — and it marks a foundational shift in what defenders must detect.

AI-powered malware is no longer a research concept or a theoretical threat model. GTIG identified five novel malware families — HONESTCUE, PROMPTFLUX, PROMPTSTEAL, COINBAIT, and QUIETVAULT — each architected to use large language model APIs as a core component of their attack chain. PROMPTSTEAL, confirmed deployed by Russia's APT28 against Ukrainian targets, constitutes the first documented case of malware dynamically generating commands through an LLM API in live offensive operations. Five government-backed threat actor groups across Russia, China, Iran, and North Korea have been confirmed abusing Gemini for reconnaissance, phishing lure generation, and malware development.

Why does this matter right now? Because the security industry's primary detection layer — signature-based antivirus and static file scanning — has no answer for AI-integrated malware that generates its own code dynamically at runtime. The threat requires a fundamentally different detection approach: behavioural analytics, egress controls on AI API traffic, and in-memory compilation monitoring. Every enterprise with internet-accessible endpoints is in scope. The attacker's cost of entry is a stolen Gemini or Hugging Face API token, available on dark web markets for under $5.

What HONESTCUE's Gemini-Powered Architecture Enables That Wasn't Possible Before

Traditional malware development produces static code: a binary or script is written, compiled, and delivered to a victim unchanged. Detection relies on this static artefact — antivirus vendors extract file hashes and byte patterns, build signatures, and block known-bad files. Even polymorphic malware that mutates its code does so using pre-written mutation routines embedded in the malware itself, creating detectable patterns over time.

HONESTCUE breaks this model. Discovered by GTIG in September 2025 and confirmed in active deployment, HONESTCUE is a downloader and launcher framework written to interact with the Gemini API as a live code generation service. When HONESTCUE executes on a victim system, it sends a structured natural language prompt to Gemini: requesting a complete, self-contained C# program that downloads a remote URL into memory as a byte array, loads it as a .NET assembly using System.Reflection.Assembly.Load, and executes its entry point. Gemini returns a unique C# source file. HONESTCUE compiles it in memory using .NET's CSharpCodeProvider framework and executes the resulting assembly — leaving zero disk artefacts. The second-stage payload is hosted on Discord CDN, a legitimate service that most corporate proxy and firewall rules allow.

The result is a fileless attack chain where every step produces unique artefacts: a unique API call, a unique C# source string, a unique in-memory assembly. Static detection is blind. GTIG attributes HONESTCUE to a suspected single actor or small group in a proof-of-concept stage, but the technique's operational viability has been proven. Defenders should assume wider adoption is imminent.

APT28 PROMPTSTEAL: How Russia Is Weaponising Hugging Face Against Ukraine

While HONESTCUE is assessed to be in a testing phase, PROMPTSTEAL is confirmed in live operations. Google GTIG and CERT-UA jointly confirmed that APT28 — Russia's GRU-linked military intelligence cyber unit, also tracked as FROZENLAKE — has deployed PROMPTSTEAL (tracked by CERT-UA as LAMEHUG) against Ukrainian targets.

PROMPTSTEAL is a Python-based data miner that masquerades as an image generation application: it presents a user interface guiding victims through a series of image generation prompts while covertly querying the Hugging Face API in the background. It uses stolen API tokens to authenticate to Hugging Face's inference endpoint for Qwen2.5-Coder-32B-Instruct, a 32-billion-parameter open-source code LLM. Its prompt asks the LLM to generate Windows commands that gather system information, enumerate files in specific directories, and copy targeted documents to a staging folder. The LLM output — legitimate Windows commands — is executed blindly by PROMPTSTEAL before the results are exfiltrated to an attacker-controlled C2 server.

This architecture is operationally significant: because PROMPTSTEAL's commands are dynamically generated rather than hardcoded, the malware's reconnaissance behaviour can be changed server-side simply by updating the Hugging Face prompt — no malware update required. CERT-UA's public disclosure links this activity to the broader APT28 campaign infrastructure targeting Ukrainian government and municipal healthcare institutions documented in April 2026.

For context on how nation-state actors have evolved their tooling against supply chain targets, see the [North Korea 1,700-package supply chain attack](/blog/north-korea-supply-chain-1700-packages) documented in March 2026.

“APT28's use of PROMPTSTEAL constitutes our first observation of malware querying an LLM deployed in live offensive operations — a milestone in the evolution of nation-state cyber capabilities.”
— Google Threat Intelligence Group (GTIG), Q1 2026 AI Threat Tracker

Indicators of Compromise

Artifact	Type	SHA-256 (Truncated)
huggingface.co/api/inference (Qwen2.5-Coder-32B-Instruct endpoint)	Network	PROMPTSTEAL C2 — LLM query for command generation
PROMPTSTEAL / LAMEHUG Python binary (image gen UI decoy)	File	Masquerades as image generation application
Stolen Hugging Face API tokens	Credential	Used for authentication to inference API; monitor HF token revocation
CSharpCodeProvider in-memory compilation	Behavioral	HONESTCUE indicator — System.Reflection.Assembly.Load without disk artefact
Discord CDN (cdn.discordapp.com)	Network	HONESTCUE second-stage payload delivery channel

Any instance of msimg32.dll found outside C:\Windows\System32 is an active IOC. Isolate the host immediately. Full hashes and IOC lists are available via the Cisco Talos GitHub repository.

Who Is at Risk from AI-Integrated Malware Campaigns

The five government-backed groups confirmed by GTIG represent only the documented tier. Based on GTIG's findings, the organisations most exposed to AI-integrated malware campaigns fall into three categories.

Government and defence sector organisations are primary targets. APT28 focused PROMPTSTEAL against Ukrainian government entities and healthcare institutions. UNC2970 (North Korea) specifically used Gemini for defence sector reconnaissance — mapping job roles, salary structures, and organisational charts within defence contractors. APT31 (China) used Gemini to research vulnerability exploitation techniques, RCE chains, and WAF bypass methods. If your organisation operates in or serves the defence, government, or critical infrastructure sector, you should assume active intelligence collection is underway using AI-augmented reconnaissance.

Financial services and cryptocurrency organisations face targeted AI-built phishing infrastructure. UNC5356, a financially motivated cluster, deployed COINBAIT — a fully functional cryptocurrency exchange phishing kit built entirely using the Lovable AI platform, with zero custom code. COINBAIT operates a React SPA with Supabase backend-as-a-service, Cloudflare proxying, and legitimate image hosting — all arranged by AI — making it effectively invisible to domain-reputation-based email filtering. ATOMIC, a macOS information stealer, is distributed via ClickFix social engineering through public AI chat shares and specifically targets cryptocurrency wallet data.

Any enterprise using Gemini, Hugging Face, or other AI API platforms faces secondary risk from stolen API token abuse. GTIG documented black-market resale of API tokens stolen from vulnerable open-source AI platforms — meaning attacker-controlled AI queries may be billed to your organisation's account while exfiltrating data from your infrastructure.

Real-World AI Malware Confirmed in the Wild: COINBAIT, ATOMIC, and Model Extraction at Scale

Beyond HONESTCUE and PROMPTSTEAL, GTIG documented three additional confirmed deployments in the Q1 2026 report cycle.

COINBAIT (November 2025, UNC5356): A phishing kit targeting cryptocurrency investors, built entirely using Lovable — a commercial AI-powered application development platform. COINBAIT's React SPA presents a convincing cryptocurrency exchange interface. Supabase handles credential logging on the backend, Lovable hosts images, and Cloudflare proxies traffic. The diagnostic fingerprint is verbose developer-style logging visible in the browser console: strings like '? Analytics: Initializing...', '? RouteGuard: Admin redirected session, allowing free access to...' and '? Analytics: Tracking password attempt:'. No AI or security expertise was required to build it.

ATOMIC (December 2025, ClickFix operators): A macOS information stealer distributed through public AI chat sharing features — specifically Lovable and similar platforms' public share links. Victims click a shared 'AI project' link, are presented with a ClickFix verification page, and execute a PowerShell or shell command that installs ATOMIC. The stealer harvests browser data, cryptocurrency wallet files, Desktop and Documents folders, and system information before exfiltrating to C2.

Model extraction at scale (ongoing, multiple actors): GTIG identified campaigns submitting over 100,000 prompts to Gemini in a single operation, designed to extract internal reasoning traces — a form of intellectual property theft that also enables building offline replica models that bypass safety controls. While primarily a provider-level risk, organisations that have suffered AI API token theft may unknowingly be funding distillation attacks against frontier AI providers.

For additional context on how credential theft tools are evolving in tandem with these campaigns, see the [malicious Chrome extensions OAuth2 token theft](/blog/malicious-chrome-extensions-oauth2-token-theft) analysis.

Detecting HONESTCUE, PROMPTSTEAL, and AI-Integrated Malware Behavioural Indicators

Signature-based detection is structurally blind to AI-integrated malware. Detection must shift to behavioural signals across endpoint, network, SaaS, and identity telemetry.

**Endpoint detection for HONESTCUE:** Alert on invocations of .NET's CSharpCodeProvider class from processes that are not Visual Studio, MSBuild, or other known developer tooling. Specifically monitor System.Reflection.Assembly.Load calls that execute code compiled in-memory without any corresponding on-disk assembly file. Flag System.Net.WebClient.DownloadData calls followed immediately by in-memory assembly loads. Detection rule: `process.commandline contains 'CSharpCodeProvider' AND file.path NOT MATCHES developer_tool_allowlist`.

**Network detection for HONESTCUE:** Alert on HTTP POST requests to generativelanguage.googleapis.com (Gemini API) from non-browser, non-sanctioned-application processes. Monitor cdn.discordapp.com for downloads by scripting engines, .NET runtimes, or PowerShell outside of legitimate collaboration tooling.

**Network detection for PROMPTSTEAL:** Alert on structured API calls to api-inference.huggingface.co from non-developer hosts. Flag Python processes making POST requests to HuggingFace inference endpoints during non-business hours. Correlate with file system activity targeting Documents, Desktop, and user profile directories immediately after LLM API calls.

**COINBAIT detection:** Hunt browser console logs for the verbose '? Analytics:' prefix strings. Alert on Supabase-backend domains with no legitimate business application association. Monitor for newly registered domains with Cloudflare proxying and Lovable.app image hosting URLs in page source.

Across all vectors: integrate AI API provider threat intelligence feeds (Google Safe Browsing, Hugging Face security advisories) and monitor for leaked API tokens associated with your organisation's developer accounts in credential threat intelligence platforms.

Block or allowlist AI API egress

Restrict outbound connections to Gemini API, Hugging Face inference, OpenAI, and similar endpoints to explicitly approved hosts. Any non-developer endpoint reaching these APIs is a high-fidelity alert.

Alert on in-memory .NET compilation

Deploy SIEM/EDR rules for CSharpCodeProvider and System.Reflection.Assembly.Load calls from non-developer processes. This is HONESTCUE's primary detection surface.

Monitor Discord CDN from non-collaboration processes

Flag downloads from cdn.discordapp.com initiated by scripting engines, PowerShell, or .NET runtimes. Legitimate Discord traffic comes from the Discord desktop app.

Hunt for COINBAIT console fingerprints

Search proxy and SIEM logs for browser sessions touching Supabase backends from unfamiliar domains. Inspect JavaScript console output on phishing candidate pages for '? Analytics:' debug strings.

Rotate and audit AI API tokens

Audit all Gemini, Hugging Face, and OpenAI API keys in use across your organisation. Enable usage alerts for abnormal query volumes. Revoke any tokens present in public or compromised repositories.

Defensive Controls Against AI-Augmented Attacks: From Egress Filtering to Canary Documents

Detection is necessary but insufficient — defenders must also reduce the blast radius of AI-integrated malware that bypasses initial detection. GTIG's Q1 2026 report identifies four control categories with the highest defensive return.

**Egress controls on AI API traffic** are the single highest-leverage control. PROMPTSTEAL depends on reaching Hugging Face's inference API. HONESTCUE depends on reaching Gemini's API. If neither API is reachable from non-developer endpoints, both malware families lose their core capability. Implement explicit allowlists for AI API domains at the proxy and firewall layer, scoped to approved developer hosts only. This does not prevent attackers who use local or pre-cached LLMs, but it neutralises the current generation of API-dependent AI malware.

**Steganographic canary documents** represent an emerging defensive technique documented by academic researchers in 2026: embedding invisible watermarks in sensitive documents that trigger alerts when the document is submitted to an AI service's ingestion pipeline. If an attacker uses QUIETVAULT or similar credential stealers to exfiltrate documents and then submits them to an LLM for analysis, the canary fires — giving defenders an early warning of data exfiltration and AI-assisted reconnaissance before the attack reaches its objective.

**Behavioural EDR with MITRE ATT&CK coverage** across T1059 (Command and Scripting Interpreter), T1620 (Reflective Code Loading), T1105 (Ingress Tool Transfer), and T1041 (Exfiltration Over C2 Channel) covers the primary execution patterns of HONESTCUE, PROMPTSTEAL, and ATOMIC. Ensure EDR telemetry covers in-memory code execution — not just on-disk file scanning.

For evasion-resistant endpoint security, the approach contrasts with the BYOVD (Bring Your Own Vulnerable Driver) technique used by threat actors like [Qilin to blind EDR tools](/blog/qilin-byovd-edr-silencing): AI malware operates at the application layer and can be detected through API traffic monitoring and in-memory compilation telemetry, without requiring kernel-level visibility.

Restrict AI API egress to approved hosts only

Configure proxy and firewall allowlists for Gemini, Hugging Face, and OpenAI APIs scoped to developer endpoints. Deny-all for all other hosts is the target state.

Deploy in-memory compilation detection rules

Add SIEM detections for CSharpCodeProvider, Assembly.Load, and similar reflective execution patterns. HONESTCUE's fileless chain has a clear behavioural signature even without a file hash.

Enable AI API usage alerting on developer accounts

Configure anomaly alerts on API usage spikes for all organisational Gemini and Hugging Face accounts. An attacker using stolen tokens will generate unusual query patterns.

Implement steganographic canary documents

Embed canary watermarks in sensitive document repositories. Any submission of these documents to an LLM inference endpoint generates an immediate alert.

Cross-reference GTIG VirusTotal collections

Search your endpoints against the GTIG-published HONESTCUE and COINBAIT VirusTotal collections for historical execution artefacts. Run retrospective threat hunts across 90-day EDR telemetry.

The bottom line

AI-powered malware active deployment is no longer a future threat — HONESTCUE, PROMPTSTEAL, and COINBAIT are operational today, confirmed by Google GTIG, with APT28 deploying PROMPTSTEAL in live operations against Ukraine. Five APT groups and four nation-states have crossed the threshold from AI-assisted to AI-integrated attack tooling. The single most impactful control you can implement today: block or strictly allowlist egress to Gemini, Hugging Face, and OpenAI inference APIs from all non-developer enterprise endpoints. Do it before end of business today.

Frequently asked questions

What is HONESTCUE malware?

HONESTCUE is a downloader and launcher framework discovered by Google's Threat Intelligence Group in late 2025. It calls the Google Gemini API mid-execution to generate C# source code on demand, compiles it in memory using .NET CSharpCodeProvider, and executes a second-stage payload — leaving no static files on disk. Because every deployment generates a unique payload, traditional signature-based antivirus tools cannot detect it. HONESTCUE delivers second-stage malware via Discord CDN.

How does AI-powered malware evade antivirus and EDR detection?

AI-integrated malware like HONESTCUE and PROMPTFLUX queries large language model APIs at runtime to generate unique obfuscated code for each infection, defeating static signature databases. PROMPTFLUX uses Gemini's API to rewrite its own VBScript obfuscation techniques hourly via a 'Thinking Robot' module. Because the malicious code does not exist until execution, hash-based and pattern-based detection tools have no baseline to match against. Behavioural EDR focused on process actions rather than file signatures is the most effective countermeasure.

Which threat actors are confirmed using AI in cyberattacks in 2026?

Google GTIG confirmed five government-backed groups abusing Gemini: APT31 and APT41 (China, PRC), APT42 (Iran), UNC2970 (North Korea, DPRK), and UNC6418 (unattributed). APT28 (Russia) is deploying PROMPTSTEAL in active operations against Ukraine. Financially motivated cluster UNC5356 is using AI-generated phishing kits against cryptocurrency and financial organisations. All accounts were disabled by Google after discovery.

Has APT28 been confirmed using AI malware in live operations?

Yes. Google GTIG and CERT-UA confirmed APT28 (also known as FROZENLAKE) deployed PROMPTSTEAL — also tracked as LAMEHUG — in active operations against Ukrainian targets. PROMPTSTEAL is a Python data miner that queries the Hugging Face API using stolen tokens to dynamically generate Windows commands for reconnaissance and data exfiltration. It is the first documented case of a nation-state threat actor using LLM-generated commands in live offensive operations.

How can I detect HONESTCUE or PROMPTSTEAL on my network?

For HONESTCUE: alert on .NET CSharpCodeProvider invocations and System.Reflection.Assembly.Load calls that compile in-memory without corresponding disk artifacts; monitor Discord CDN traffic from non-browser processes. For PROMPTSTEAL: detect outbound API calls to Hugging Face (huggingface.co) from non-developer hosts, flag Python processes making repeated structured POST requests to inference endpoints, and hunt for suspicious files masquerading as image generators with no legitimate UI interaction. Correlate endpoint, SaaS, and identity telemetry.

What is the difference between PROMPTFLUX and PROMPTSTEAL?

PROMPTFLUX is a VBScript dropper that queries the Google Gemini API to regenerate its own obfuscation code hourly, making it a self-modifying evasion tool still in development or testing phase. PROMPTSTEAL is a Python-based data miner deployed by APT28 in live operations that queries Hugging Face's Qwen2.5-Coder LLM to dynamically generate Windows reconnaissance and exfiltration commands — the first confirmed LLM-powered malware in active nation-state operations.

How do I defend against LLM-integrated malware and AI-augmented attacks?

Key defensive controls: block or strictly allowlist egress to AI API endpoints (Gemini, Hugging Face, OpenAI) from enterprise endpoints not used for AI development; deploy behavioural EDR that alerts on in-memory code compilation, unusual scripting engine calls, and unexpected outbound API traffic; monitor for stolen AI API tokens in credential threat intelligence feeds; implement steganographic canary documents to detect data submission to AI services; enforce zero-trust network segmentation to limit blast radius if an AI-powered dropper establishes a foothold.

What is an AI model extraction attack and why does it matter?

A model extraction attack — also called distillation attack — submits massive volumes of prompts to a target AI service to reconstruct the model's reasoning behaviour without authorisation. GTIG observed campaigns submitting over 100,000 prompts in a single operation targeting Gemini's internal reasoning traces. While primarily a risk to AI service providers (intellectual property theft, ToS violation), it also enables attackers to build cheaper local replicas of frontier AI models for use in offline offensive tooling, bypassing AI provider safety controls entirely.

Sources & references

Free newsletter

Get threat intel before your inbox does.

50,000+ security professionals read Decryption Digest for early warnings on zero-days, ransomware, and nation-state campaigns. Free, weekly, no spam.

Unsubscribe anytime. We never sell your data.

Author

Eric BangCISSP

Founder & Cybersecurity Evangelist, Decryption Digest

Cybersecurity professional with expertise in threat intelligence, vulnerability research, and enterprise security. Covers zero-days, ransomware, and nation-state operations for 50,000+ security professionals weekly.

View profile →LinkedIn

Back to all briefings

Subscribe for Updates

AI malware HONESTCUE PROMPTSTEAL APT28 Gemini API LLM malware fileless attack state-sponsored AI GTIG offensive AI