Carnegie Mellon Researchers Demonstrate That LLMs Can Autonomously Plan and Execute Real-World Cyberattacks
In a major advance in the fields of cybersecurity and artificial intelligence, researchers from Carnegie Mellon University, in collaboration with Anthropic, have demonstrated that large language models (LLMs) can autonomously plan and execute sophisticated cyberattacks on enterprise-grade network environments without human intervention.
The study, led by Ph.D. candidate Brian Singer from Carnegie Mellon's Department of Electrical and Computer Engineering, reveals that LLMs, when structured with high-level planning capabilities and supported by specialized agent frameworks, can simulate network intrusions that closely mirror real-world breaches. The study’s most striking finding: an LLM was able to successfully replicate the infamous 2017 Equifax data breach in a controlled research environment—autonomously exploiting vulnerabilities, installing malware, and exfiltrating data.
“Our research shows that with the right abstractions and guidance, LLMs can go far beyond basic tasks,” said Singer. “They can coordinate and execute attack strategies that reflect real-world complexity.”
The team developed a hierarchical architecture where the LLM acts as a strategist, planning the attack and issuing high-level instructions, while a mix of LLM and non-LLM agents carry out low-level tasks like scanning networks or deploying exploits. This approach proved far more effective than earlier methods, which relied solely on LLMs executing shell commands.
This work builds on Singer’s prior research into making autonomous attacker and defender tools more accessible and programmable for human developers. Ironically, the same abstractions that simplified development for humans made it easier for LLMs to autonomously perform similar tasks.
While the findings are groundbreaking, Singer emphasized that the research remains a prototype.
“This isn’t something that’s going to take down the internet tomorrow,” he said. “The scenarios are constrained and controlled—but it’s a powerful step forward.”
The implications are twofold: the research highlights serious long-term safety concerns about the potential misuse of increasingly capable LLMs, but it also opens up transformative possibilities for defensive cybersecurity.
“Today, only large organizations can afford red team exercises to proactively test their defenses,” Singer explained. “This research points toward a future where AI systems continuously test networks for vulnerabilities, making these protections accessible to small organizations too.”
The project was conducted in collaboration with Anthropic, which provided model credits and technical consultation. The team included CMU students and faculty affiliated with CyLab, the university’s security and privacy institute. An early version of the research was presented at an OpenAI-hosted security workshop in May.
The resulting paper, “On the Feasibility of Using LLMs to Autonomously Execute Multi-host Network Attacks,” has been cited in multiple industry reports and is already informing safety documentation for cutting-edge AI systems. Lujo Bauer and Vyas Sekar, co-directors of CMU’s Future Enterprise Security Initiative, served as faculty advisors for the project.
Looking ahead, the team is now studying how similar architectures might enable autonomous AI defenses, exploring scenarios where LLM-based agents detect and respond to attacks in real time.
“We're entering an era of AI versus AI in cybersecurity,” Singer said. “And we need to understand both sides to stay ahead.”
About the College of Engineering: The College of Engineering at Carnegie Mellon University is a top-ranked engineering college that is known for our intentional focus on cross-disciplinary collaboration in research. The College is well-known for working on problems of both scientific and practical importance. Our “maker” culture is ingrained in all that we do, leading to novel approaches and transformative results. Our acclaimed faculty have a focus on innovation management and engineering to yield transformative results that will drive the intellectual and economic vitality of our community, nation, and world.
About CyLab:CyLab is the university-wide security and privacy institute at Carnegie Mellon University. We coordinate security and privacy research and education across all university departments. Our mission is to catalyze, support, promote, and strengthen collaborative security and privacy research and education across departments, disciplines, and geographic boundaries to achieve significant impact on research, education, public policy, and practice.
View source version on businesswire.com: https://www.businesswire.com/news/home/20250724351815/en/
Contacts
Media Contact:
Michael Cunningham
Carnegie Mellon University
mcunningham@cmu.edu
412-443-2051
(c) 2024 Business Wire, Inc., All rights reserved.
Business Wire, a Berkshire Hathaway company, is the global leader in multiplatform press release distribution.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Civil Air Patrol Expands Fleet With 15 New Cessna Aircraft to Support Lifesaving and Community Missions15.12.2025 17:00:00 CET | Press Release
Textron Aviation Inc., a Textron Inc. (NYSE: TXT) company, announced today that Civil Air Patrol (CAP), the world’s largest operator of Cessna aircraft, is strengthening its national mission capabilities with an order for 15 additional piston-engine aircraft, including seven Cessna Skyhawk 172 and eight Cessna Skylane 182 models scheduled for delivery throughout 2026. The order follows recent deliveries of an additional two Cessna Skylane and one Cessna Turbo Stationair HD aircraft, expanding CAP’s fleet to more than 500 Cessna aircraft nationwide. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251215613573/en/ Delivery of an additional two Cessna Skylane and one Cessna Turbo Stationair HD aircraft joins CAP’s fleet of more than 500 Cessna aircraft nationwide. Cessna aircraft are designed and produced by Textron Aviation. “Civil Air Patrol’s missions demand aircraft that are reliable, versatile and ready to perform in critic
Winston & Strawn and Taylor Wessing UK to Combine, Creating a Premier Transatlantic Law Firm15.12.2025 16:52:00 CET | Press Release
Winston & Strawn and Taylor Wessing’s UK-led business announced today their intention to combine, creating a premier transatlantic law firm that would operate under a new shared name, Winston Taylor. The combination responds to increasing client demand for seamlessly integrated US–UK–EU counsel for the businesses, people, and markets driving capital and innovation. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251215914957/en/ The combination once final will unite two international firms with more than 400 years of combined history, complementary strengths, and a common vision to meet clients’ evolving global needs. The combined firm will include more than 1,400 lawyers, establishing one of the largest transatlantic firms whose footprint is primarily in the United States, the United Kingdom, and Europe, and also in Latin America and the Middle East. Leveraging significant strength and scale in major litigation, critical tra
Despite Barriers, Financial Institutions are Clear About AI's Greatest Impact15.12.2025 16:32:00 CET | Press Release
HTEC, a global AI-first provider of software and hardware design and engineering services, today released The State of AI in Financial Services & Insurance 2025, a first industry subset of its global research report in AI. This publication offers one of the clearest views to date into how financial institutions are adopting and scaling artificial intelligence. This industry-focused report analyzes insights from 250 C-suite leaders within financial services and insurance, drawn from HTEC’s broader global study of 1,529 C-suite executives—including CIOs, CTOs, CDOs, CPOs, CFOs, COOs, CEOs and CSOs—across Saudi Arabia, the UAE, the United Kingdom, the United States, Germany and Spain. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251215790717/en/ Executive Summary: The State of AI in Financial Services and Insurance 2025 The findings confirm a decisive shift in the industry: not a single respondent said AI is not a priority. L
Align Partners Sends Second Public Shareholder Letter to Coway, Urging Announcement of Revised Value-up Plan by January 30, 202615.12.2025 15:54:00 CET | Press Release
Align Partners Capital Management Inc. (“Align Partners”), a shareholder of Coway Co., Ltd. (“Coway”) since 2023 holding more than 4% of the Company’s outstanding shares through funds it manages or advises, announced that it has sent a second public shareholder letter to Coway’s Board of Directors. The letter calls for measures to address the company’s chronic undervaluation and enhance shareholder value. Align Partners has requested that Coway announce a revised corporate Value-up Plan reflecting these proposals by January 30, 2026. In the letter, Align Partners assessed Coway’s February 2025 plan as insufficient to address Coway’s persistent undervaluation and urged the Board to incorporate seven measures: (1) clear mid-to-long-term valuation and ROE targets with execution plans; (2) clarified and strengthened target capital structure policy; (3) updated shareholder return policy reflecting both the target capital structure policy and new dividend income tax separation regime; (4) en
Marathon Asset Management Provides Junior Capital Financing to EXALTA Group15.12.2025 15:00:00 CET | Press Release
Marathon Asset Management (“Marathon”), a leading global credit manager with more than $24 billion of assets under management, is pleased to announce the closing of a junior capital financing to EXALTA Group (“EXALTA” or the “Company”), a portfolio company of Montagu. Marathon led the financing that supported the formation of EXALTA through the strategic merger of three Montagu-owned companies including Intech, Resolve Surgical Technologies, and Tyber Medical. The transaction marks one of many successful transactions for Marathon’s European Credit business in the healthcare sector, where the firm has a knowledge-based advantage with a dedicated Healthcare Finance business and specialized medical advisory board providing sector insight to middle market companies. EXALTA is a global leader in orthopaedic contract design and manufacturing for spine, trauma, extremities, sports medicine and enabling technology providing comprehensive solutions to OEMs within the medical technology industry
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom