Business Wire

AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI

Share

Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.

By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.

Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.

To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.

Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.

The key benefits of this breakthrough include:

  • Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
  • Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
  • Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
  • Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.

“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”

Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.

“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”

Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.

To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.

About Aligned AI:

Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Media:
Alana Bannan
Matter Communications
360-975-1812
AlignedAI@matternow.com

About Business Wire

Business Wire
Business Wire
24 Martin Lane
EC4R 0DR London

+44 20 7626 1982http://www.businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

Civil Air Patrol Expands Fleet With 15 New Cessna Aircraft to Support Lifesaving and Community Missions15.12.2025 17:00:00 CET | Press Release

Textron Aviation Inc., a Textron Inc. (NYSE: TXT) company, announced today that Civil Air Patrol (CAP), the world’s largest operator of Cessna aircraft, is strengthening its national mission capabilities with an order for 15 additional piston-engine aircraft, including seven Cessna Skyhawk 172 and eight Cessna Skylane 182 models scheduled for delivery throughout 2026. The order follows recent deliveries of an additional two Cessna Skylane and one Cessna Turbo Stationair HD aircraft, expanding CAP’s fleet to more than 500 Cessna aircraft nationwide. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251215613573/en/ Delivery of an additional two Cessna Skylane and one Cessna Turbo Stationair HD aircraft joins CAP’s fleet of more than 500 Cessna aircraft nationwide. Cessna aircraft are designed and produced by Textron Aviation. “Civil Air Patrol’s missions demand aircraft that are reliable, versatile and ready to perform in critic

Winston & Strawn and Taylor Wessing UK to Combine, Creating a Premier Transatlantic Law Firm15.12.2025 16:52:00 CET | Press Release

Winston & Strawn and Taylor Wessing’s UK-led business announced today their intention to combine, creating a premier transatlantic law firm that would operate under a new shared name, Winston Taylor. The combination responds to increasing client demand for seamlessly integrated US–UK–EU counsel for the businesses, people, and markets driving capital and innovation. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251215914957/en/ The combination once final will unite two international firms with more than 400 years of combined history, complementary strengths, and a common vision to meet clients’ evolving global needs. The combined firm will include more than 1,400 lawyers, establishing one of the largest transatlantic firms whose footprint is primarily in the United States, the United Kingdom, and Europe, and also in Latin America and the Middle East. Leveraging significant strength and scale in major litigation, critical tra

Despite Barriers, Financial Institutions are Clear About AI's Greatest Impact15.12.2025 16:32:00 CET | Press Release

HTEC, a global AI-first provider of software and hardware design and engineering services, today released The State of AI in Financial Services & Insurance 2025, a first industry subset of its global research report in AI. This publication offers one of the clearest views to date into how financial institutions are adopting and scaling artificial intelligence. This industry-focused report analyzes insights from 250 C-suite leaders within financial services and insurance, drawn from HTEC’s broader global study of 1,529 C-suite executives—including CIOs, CTOs, CDOs, CPOs, CFOs, COOs, CEOs and CSOs—across Saudi Arabia, the UAE, the United Kingdom, the United States, Germany and Spain. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251215790717/en/ Executive Summary: The State of AI in Financial Services and Insurance 2025 The findings confirm a decisive shift in the industry: not a single respondent said AI is not a priority. L

Align Partners Sends Second Public Shareholder Letter to Coway, Urging Announcement of Revised Value-up Plan by January 30, 202615.12.2025 15:54:00 CET | Press Release

Align Partners Capital Management Inc. (“Align Partners”), a shareholder of Coway Co., Ltd. (“Coway”) since 2023 holding more than 4% of the Company’s outstanding shares through funds it manages or advises, announced that it has sent a second public shareholder letter to Coway’s Board of Directors. The letter calls for measures to address the company’s chronic undervaluation and enhance shareholder value. Align Partners has requested that Coway announce a revised corporate Value-up Plan reflecting these proposals by January 30, 2026. In the letter, Align Partners assessed Coway’s February 2025 plan as insufficient to address Coway’s persistent undervaluation and urged the Board to incorporate seven measures: (1) clear mid-to-long-term valuation and ROE targets with execution plans; (2) clarified and strengthened target capital structure policy; (3) updated shareholder return policy reflecting both the target capital structure policy and new dividend income tax separation regime; (4) en

Marathon Asset Management Provides Junior Capital Financing to EXALTA Group15.12.2025 15:00:00 CET | Press Release

Marathon Asset Management (“Marathon”), a leading global credit manager with more than $24 billion of assets under management, is pleased to announce the closing of a junior capital financing to EXALTA Group (“EXALTA” or the “Company”), a portfolio company of Montagu. Marathon led the financing that supported the formation of EXALTA through the strategic merger of three Montagu-owned companies including Intech, Resolve Surgical Technologies, and Tyber Medical. The transaction marks one of many successful transactions for Marathon’s European Credit business in the healthcare sector, where the firm has a knowledge-based advantage with a dedicated Healthcare Finance business and specialized medical advisory board providing sector insight to middle market companies. EXALTA is a global leader in orthopaedic contract design and manufacturing for spine, trauma, extremities, sports medicine and enabling technology providing comprehensive solutions to OEMs within the medical technology industry

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye