Carnegie Mellon Researchers Demonstrate That LLMs Can Autonomously Plan and Execute Real-World Cyberattacks
24.7.2025 15:05:00 CEST | Business Wire | Press Release
In a major advance in the fields of cybersecurity and artificial intelligence, researchers from Carnegie Mellon University, in collaboration with Anthropic, have demonstrated that large language models (LLMs) can autonomously plan and execute sophisticated cyberattacks on enterprise-grade network environments without human intervention.
The study, led by Ph.D. candidate Brian Singer from Carnegie Mellon's Department of Electrical and Computer Engineering, reveals that LLMs, when structured with high-level planning capabilities and supported by specialized agent frameworks, can simulate network intrusions that closely mirror real-world breaches. The study’s most striking finding: an LLM was able to successfully replicate the infamous 2017 Equifax data breach in a controlled research environment—autonomously exploiting vulnerabilities, installing malware, and exfiltrating data.
“Our research shows that with the right abstractions and guidance, LLMs can go far beyond basic tasks,” said Singer. “They can coordinate and execute attack strategies that reflect real-world complexity.”
The team developed a hierarchical architecture where the LLM acts as a strategist, planning the attack and issuing high-level instructions, while a mix of LLM and non-LLM agents carry out low-level tasks like scanning networks or deploying exploits. This approach proved far more effective than earlier methods, which relied solely on LLMs executing shell commands.
This work builds on Singer’s prior research into making autonomous attacker and defender tools more accessible and programmable for human developers. Ironically, the same abstractions that simplified development for humans made it easier for LLMs to autonomously perform similar tasks.
While the findings are groundbreaking, Singer emphasized that the research remains a prototype.
“This isn’t something that’s going to take down the internet tomorrow,” he said. “The scenarios are constrained and controlled—but it’s a powerful step forward.”
The implications are twofold: the research highlights serious long-term safety concerns about the potential misuse of increasingly capable LLMs, but it also opens up transformative possibilities for defensive cybersecurity.
“Today, only large organizations can afford red team exercises to proactively test their defenses,” Singer explained. “This research points toward a future where AI systems continuously test networks for vulnerabilities, making these protections accessible to small organizations too.”
The project was conducted in collaboration with Anthropic, which provided model credits and technical consultation. The team included CMU students and faculty affiliated with CyLab, the university’s security and privacy institute. An early version of the research was presented at an OpenAI-hosted security workshop in May.
The resulting paper, “On the Feasibility of Using LLMs to Autonomously Execute Multi-host Network Attacks,” has been cited in multiple industry reports and is already informing safety documentation for cutting-edge AI systems. Lujo Bauer and Vyas Sekar, co-directors of CMU’s Future Enterprise Security Initiative, served as faculty advisors for the project.
Looking ahead, the team is now studying how similar architectures might enable autonomous AI defenses, exploring scenarios where LLM-based agents detect and respond to attacks in real time.
“We're entering an era of AI versus AI in cybersecurity,” Singer said. “And we need to understand both sides to stay ahead.”
About the College of Engineering: The College of Engineering at Carnegie Mellon University is a top-ranked engineering college that is known for our intentional focus on cross-disciplinary collaboration in research. The College is well-known for working on problems of both scientific and practical importance. Our “maker” culture is ingrained in all that we do, leading to novel approaches and transformative results. Our acclaimed faculty have a focus on innovation management and engineering to yield transformative results that will drive the intellectual and economic vitality of our community, nation, and world.
About CyLab:CyLab is the university-wide security and privacy institute at Carnegie Mellon University. We coordinate security and privacy research and education across all university departments. Our mission is to catalyze, support, promote, and strengthen collaborative security and privacy research and education across departments, disciplines, and geographic boundaries to achieve significant impact on research, education, public policy, and practice.
View source version on businesswire.com: https://www.businesswire.com/news/home/20250724351815/en/
Contacts
Media Contact:
Michael Cunningham
Carnegie Mellon University
mcunningham@cmu.edu
412-443-2051
(c) 2024 Business Wire, Inc., All rights reserved.
Business Wire, a Berkshire Hathaway company, is the global leader in multiplatform press release distribution.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Visa Opens the Door to AI-Driven Shopping for Businesses Worldwide8.4.2026 18:00:00 CEST | Press Release
Visa Inc. (NYSE: V) today unveiled Intelligent Commerce Connect, a new solution that makes it easier for businesses to connect to and participate in AI-powered commerce. Intelligent Commerce Connect acts as a network, protocol, and token vault-agnostic ‘on ramp’ to agentic commerce for agent builders, merchants, and enablers. As consumers increasingly rely on AI agents to make purchases, businesses – whether they are building agents, selling to them, or processing transactions – need a simple way to get started. Intelligent Commerce Connect, part of the Visa Intelligent Commerce portfolio, meets that need. Through a single integration via the Visa Acceptance Platform, Intelligent Commerce Connect enables secure payment initiation, tokenization, spend controls, and authentication. The solution integrates both Visa Intelligent Commerce APIs, which are used to process agent purchases using Visa cards, and other networks’ APIs, allowing agents to pay with both Visa and non-Visa cards*. Thi
Andersen Consulting Strengthens Digital Transformation Capabilities Through Kyanon Consulting Collaboration8.4.2026 15:30:00 CEST | Press Release
Andersen Consulting enhances its platform through a Collaboration Agreement with Kyanon Consulting, a Vietnam-based technology consulting firm known for delivering large-scale digital transformation solutions. Founded in 2025, as an arm of Kyanon Digital, Kyanon Consulting provides end-to-end digital and technology services to retail, banking and finance, and manufacturing organizations seeking to modernize operations, improve customer engagement, and accelerate growth. The firm delivers solutions across digital strategy, enterprise and product development, system integration, workflow automation, advanced analytics, and AI-driven insights for customer experience. “At Kyanon Consulting, our mission is to create digital impact that truly matters,” said Tai Huynh, founder of Kyanon Consulting. “We equip clients with the tools, insights, and innovation needed to strengthen resilience and unlock new opportunities. Collaborating with Andersen Consulting allows us to bring our capabilities t
Sumitomo Corporation, SMBC Aviation Capital, Apollo and Brookfield Complete the Acquisition of Air Lease Corporation8.4.2026 15:13:00 CEST | Press Release
Sumitomo Corporation, SMBC Aviation Capital, Apollo-managed funds (“Apollo”) and Brookfield today announced that they have completed the previously announced acquisition of Air Lease Corporation (“Air Lease”) and have renamed the business Sumisho Air Lease Corporation (“Sumisho Air Lease”). This transformational transaction improves the financial position of the business with long term support and aviation expertise from co-investors Sumitomo Corporation, SMBC Aviation Capital, Apollo and Brookfield. Sumisho Air Lease’s strong foundation as an established aircraft lessor, supported by SMBC Aviation Capital’s industry‑leading capabilities as servicer, creates a platform with the scale and financial strength needed to meet the fast‑changing and increasingly complex requirements of airline customers. Sumisho Air Lease will also benefit from the deep expertise and long-standing commitment that both Sumitomo Corporation and SMBC Aviation Capital bring to the global aviation leasing sector.
Sitetracker Launches Scout, an Agentic AI Platform Purpose-Built for Critical Infrastructure8.4.2026 15:00:00 CEST | Press Release
Sitetracker, the leading Asset Lifecycle Management platform for critical infrastructure, today announced the launch of Scout, its new Agentic AI platform designed to help infrastructure owners, operators, and contractors gain deep insights and drive automation within their operations. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260408923336/en/ Scout, ready for real work As your AI analyst and agent, Scout is ready to work on day 1. Scout provides clarity when decisions are forming and momentum when action is required. It surfaces risk, synthesizes information, and helps accelerate execution by connecting data and driving action. Scout creates operational intelligence and turns it into action all in a secure environment that protects data sovereignty. “Our customers are looking to create compounding competitive advantages,” said Giuseppe Incitti, Chief Executive Officer of Sitetracker. “Scout delivers by providing easy t
Westinghouse Hosts Annual VVER Fuel Forum with Customers8.4.2026 15:00:00 CEST | Press Release
Westinghouse and MVM Paks Nuclear Power Plant (NPP) recently co-hosted the VVER Fuel Forum in Budapest to share insights and plans for the continued deployment of VVER-1000 and VVER-440 fuel in operating reactors. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260408646373/en/ Participants to the VVER Fuel Forum Péter János Horváth, CEO of MVM Paks, welcomed all the participants, highlighting that Hungary is ending two decades of single supplier fuel dependency thanks to the agreement recently signed with Westinghouse to supply the VVER-440 NOVA E-6 fuel design. Six customers presented the progress made and positive outcomes achieved in the past years with the introduction of Westinghouse fuel into mixed cores with resident fuel in their reactors: Energoatom has extensive experience with Westinghouse VVER-440 and VVER-1000 fuel, currently used in the nine reactors in operation. Ukraine will be the first country to operate en
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom