Hackers Use AI Tools to Automate Active Directory Discovery and Test EDR Evasion


A threat actor used AI-assisted development tools to build a post-exploitation framework that automates Active Directory discovery and tests malware against endpoint detection and response products.

The activity, detailed by Sophos X-Ops, shows how attackers are using AI tools to speed up malware development, lab testing, documentation, and workflow coordination. Sophos stressed that humans still directed the process.

The framework combined several offensive components, including Cobalt Strike profiles, Telegram-based command and control, Python shellcode injection scripts, a Cloudflare Worker redirector, an automated Active Directory discovery panel, and a testing lab for EDR evasion.

How Sophos found the AI-assisted attack framework

Sophos detected the activity after an anomalous endpoint in a customer environment triggered alerts for payloads stored in C:\Users\User\Documents\test. The files pointed to a broader framework designed to test stealthy post-exploitation activity.

The toolkit included customized Cobalt Strike profiles that tried to make beacon traffic resemble normal web traffic. It also used a Telegram bot API channel for command and control, allowing communications to flow through trusted third-party infrastructure instead of direct attacker servers.

BleepingComputer reported that Sophos linked the framework to cybercriminal activity tied to ransomware deployment and data theft, not legitimate red-team testing. Sophos did not publicly name the ransomware group because of ongoing investigations.

ComponentWhat researchers foundWhy it matters
Cobalt Strike profilesProfiles designed to make beacon traffic look like legitimate web requestsCan help malicious traffic blend into normal network activity
Telegram C2Command-and-control communication routed through Telegram bot infrastructureCan make blocking and attribution harder for defenders
Cloudflare WorkerFront-end redirector used to hide the real backend command serverCan obscure infrastructure and complicate incident response
Python malware scriptsScripts used for shellcode injection into Windows executablesCan support payload execution while attempting to preserve normal program behavior
AI-assisted workflowCursor and Claude agents helped with coding, testing, and coordinationCan reduce development time and support rapid experimentation

AI helped build and test the toolkit

Investigators found multiple Python scripts written in Russian that appeared to be partially AI-generated. They also found a Git repository containing two major parts: an automated Active Directory discovery panel and a lab for testing payloads against security products.

The setup used Cursor as an AI-native development environment and multiple AI agents with assigned roles. One Claude Opus 4.5 agent coordinated core operations and rule setting, while other agents supported EDR testing, documentation, operational security checks, proxy stress testing, and virtual machine deployment.

Help Net Security reported that the lab used Windows Server 2022 virtual machines for Sophos, CrowdStrike, and Microsoft Defender testing, plus a separate Ubuntu system hosting a Sliver command-and-control server.

The Active Directory panel was automated, not fully autonomous

The Active Directory component is one of the most important parts of the case. It resembled AI-driven automation because it could collect task results, choose the next action from predefined options, dispatch work to remote agents, and reevaluate when results came back.

However, Sophos said it was not a fully autonomous large language model making open-ended decisions. It followed structured logic and human-defined workflows.

This distinction matters. The threat is not a self-directed AI attacker. The threat is a human operator using AI and automation to move faster during post-exploitation work.

Why Active Directory remains a prime target

Active Directory is a high-value target because it controls identities, permissions, computers, groups, and access paths across many enterprise networks. Once attackers map AD relationships, they can look for privileged users, weak delegation paths, exposed service accounts, or routes to domain control.

MITRE ATT&CK describes Domain Account Discovery as a technique where attackers enumerate domain accounts to understand an environment and identify targets for later activity. Automated AD discovery can speed up that process.

For defenders, this means identity telemetry matters as much as malware detection. Suspicious directory queries, unusual enumeration, unexpected use of domain tools, and abnormal authentication patterns can reveal early-stage intrusion activity before ransomware deployment.

Attack goalLikely enterprise riskDefensive focus
Map domain usersAttackers identify privileged or useful accountsMonitor unusual account and group enumeration
Find lateral movement pathsAttackers move from one host to more sensitive systemsTrack remote execution, SMB activity, and new admin sessions
Test EDR bypassesAttackers refine payloads before deploymentUse behavior-based detection and layered controls
Use trusted platforms for C2Malicious communications blend into normal trafficInspect traffic patterns, process lineage, and endpoint events
Prepare ransomware activityData theft and encryption may followAlert on staging, archive creation, credential dumping, and privilege escalation

The lab tested payloads against multiple EDR tools

The attacker built a controlled lab to test malware against real endpoint protection agents. Sophos said the environment used several Windows Server 2022 virtual machines and a separate Ubuntu system for Sliver command and control.

The core payload generator was written in Python and produced Windows executables and DLLs, often using Rust and Go payloads. It wrapped payloads in layers of encryption, evasion logic, and alternative execution methods for testing.

Sophos said nearly 80 modules tested more than 70 techniques. The frameworkโ€™s own documentation claimed high bypass success after repeated iterations, but Sophos noted that the reviewed test data did not fully support those claims.

AI agents ingested public security research

The framework also included instructions for AI agents to read public security research, extract attack techniques, map them to MITRE ATT&CK, prepare lab steps, run experiments, and report findings.

This creates a faster feedback loop for attackers. Instead of manually reading every new research post, operators can ask AI agents to summarize techniques, identify tooling requirements, and help reproduce ideas inside a test environment.

The case shows how public offensive and defensive research can be repurposed. Publishing technical research remains important for defenders, but threat actors can now process the same material faster with AI-assisted workflows.

Why defenders should not panic, but should adapt

The Sophos report does not mean AI has replaced skilled attackers. It shows that AI can help operators automate repetitive steps, write code faster, organize test results, and iterate payloads more quickly.

Help Net Security also noted that Sophos linked the activity to ransomware and data theft operations, while stating that the group involved remains undisclosed because of active investigations.

The defensive lesson is practical. Organizations should assume attackers will use AI to compress development cycles. That makes baseline controls, detection engineering, identity security, and incident response speed more important.

  • Patch exposed services and high-risk enterprise software quickly.
  • Use phishing-resistant multifactor authentication for privileged and remote access.
  • Monitor Active Directory enumeration and unusual account discovery behavior.
  • Detect suspicious use of Cobalt Strike, Sliver, Telegram C2, and Cloudflare Worker redirectors.
  • Correlate endpoint detections with process lineage, network traffic, and identity events.
  • Restrict PowerShell, script interpreters, and unsigned tools where business workflows allow it.
  • Hunt for payloads staged in user directories and temporary development folders.
  • Test recovery plans for ransomware and data theft scenarios.

Detection should focus on behavior, not labels

AI-assisted malware development can change file hashes, payload wrappers, and code structure quickly. That reduces the value of static indicators alone.

Article ingestion and technique mapping instructions for AI agents

Security teams should focus on behavior that attackers still need to perform. That includes unusual script execution, process injection attempts, suspicious parent-child process chains, command-and-control traffic, credential access, and abnormal domain enumeration.

MITRE ATT&CK describes attempts to impair defenses by disabling or modifying tools, services, logging, or protections. EDR evasion testing fits that broader defensive-impairment pattern, even when the exact payload changes.

What security teams should watch now

Organizations should review alerts involving payload staging in user directories, especially folders that look like test environments. The Sophos case began with payloads found under a user Documents path, which then led investigators to the broader framework.

Teams should also monitor for unusual use of developer tooling on non-developer systems. AI coding tools, Git repositories, payload builders, and lab automation software may have legitimate uses, but they can look suspicious on endpoints that do not belong to engineering or security teams.

BleepingComputer reported that artifacts in the environment included Cobalt Strike operator logs tied to a ransom note and multiple organizations listed on a ransomware data leak site. That makes early investigation of suspicious toolchains especially important.

SignalWhy it mattersSuggested action
Payloads in user test foldersMay indicate malware staging or local testingCollect files, isolate the host, and review process history
Unexpected Git repositoriesMay contain automation scripts, payload builders, or C2 configurationReview commit history and linked remote repositories
Telegram bot API traffic from serversMay indicate covert command-and-control communicationCorrelate with endpoint process and network telemetry
Cloudflare Worker redirect behaviorMay hide backend infrastructureInspect destination patterns and process owners
Unusual domain enumerationMay indicate Active Directory reconnaissanceReview account context, host role, and timing

AI changes the speed of attacks, not the fundamentals

The most important takeaway is that AI helps attackers move faster, but it does not remove their need for access, credentials, execution, command and control, and persistence.

Sophos reached the same conclusion: AI lowers the barrier for sophisticated post-exploitation development, but organizations still need the same defense-in-depth basics.

Those basics include timely patching, modern authentication, MFA or passkeys, broad EDR coverage, identity monitoring, least privilege, network segmentation, and a tested ransomware response plan.

Domain Account Discovery and other Active Directory reconnaissance techniques should receive special attention because ransomware operators often rely on identity paths before they deploy final payloads.

Defensive impairment behavior should also remain a high-priority alert category. When attackers test or apply EDR evasion, they usually leave signals across process activity, logs, tamper events, and network communications.

The rise of AI-assisted offensive tooling does not make traditional security controls obsolete. It makes weak controls easier to find, test, and exploit at speed.

FAQ

What did Sophos find in the AI-assisted attack framework?

Sophos found a post-exploitation framework that used AI-assisted tooling, an automated Active Directory discovery panel, Cobalt Strike profiles, Telegram-based command and control, Python shellcode injection scripts, a Cloudflare Worker redirector, and a lab for testing EDR evasion.

Was the Active Directory attack framework fully autonomous?

No. Sophos said the Active Directory discovery panel used predefined workflows and human-directed logic. It collected results, selected next steps from set options, dispatched tasks, and reevaluated output, but it did not act as a fully autonomous LLM.

Which AI tools were involved?

The reported workflow used Cursor and multiple Claude Opus agents. One agent coordinated operations, while others helped with EDR testing, documentation, operational security checks, proxy stress testing, and virtual machine deployment.

Why is Active Directory discovery important to attackers?

Active Directory discovery helps attackers identify users, groups, privileged accounts, computers, and paths for lateral movement. Ransomware operators often use this information before data theft or encryption.

Does AI make EDR useless?

No. AI can help attackers test and revise malware faster, but it does not remove the need for access, execution, credentials, command and control, and persistence. EDR remains important when paired with identity monitoring, patching, MFA, and behavior-based detection.

How can organizations defend against AI-assisted post-exploitation tools?

Organizations should patch quickly, enforce phishing-resistant MFA, monitor Active Directory enumeration, restrict privileged access, detect suspicious C2 traffic, investigate payload staging in user folders, and correlate endpoint, identity, and network telemetry.

Readers help support VPNCentral. We may get a commission if you buy through our links. Tooltip Icon

Read our disclosure page to find out how can you help VPNCentral sustain the editorial team Read more

User forum

0 messages