Business Wire

AI Alignment Lab Achieves Major Milestone in Step Towards Agentic AI

Share

Aligned AI, a leader in artificial intelligence (AI) research, has announced a groundbreaking AI advancement in misgeneralization, a critical challenge in the field of AI. It is the first to surpass a key benchmark called CoinRun by teaching an AI to “think” in human-like concepts. The technology underpinning the achievement opens the door to more precise, reliable, and controllable AI for a wide variety of real world applications.

By teaching AI models to generalize in a manner more akin to agentic human cognition, Aligned AI’s innovation enables AI to correctly identify concepts across new situations and environments, reducing the need for prolonged production, testing, and retraining.

Misgeneralization occurs when AI systems learn incorrect patterns and behaviors from their training data, and are not able to correctly adapt when presented with new information. This leads to unexpected, and often harmful, outcomes. Today’s foundation models suffer from varying degrees of misgeneralization, as evidenced by users’ ability to “jailbreak” them, or there is a trade off between functionality and undesired behavior. The challenge of misgeneralization also prevents the industry as a whole from moving forward. For instance, generalization is required for truly autonomous vehicles and applying AI to critical applications. Otherwise, AIs cannot operate well enough in unfamiliar environments or discern the correct goals without human intervention.

To achieve this milestone, Aligned AI used the 2021 CoinRun misgeneralization benchmark, an Atari-style game released by researchers at Google DeepMind, the University of Cambridge, the University of Tubingen, and the University of Edinburgh. The goal of the benchmark is to test whether an AI can deduce a complex goal when that goal is spuriously correlated with a simpler goal in its training environment. The AI is rewarded for getting a coin, which is always placed at the end of the level during the training period, but is placed in a random location during the testing period, without additional reward information being provided.

Prior to Aligned AI’s innovation, AIs trained on CoinRun believed the best way to play the game was to go to the right, while avoiding monsters and holes. Because the coin was always at the end of the level during training, this strategy seemed effective. When the AI encountered a new level where the coin was placed elsewhere in the level but without being given new information, it would ignore the coin and either miss it or get it only by accident. ACE (which stands for “Algorithm for Concept Extrapolation”), the new AI developed by Aligned AI, notices the changes in the test environment and figures out to go for the coin, even without new reward information - just as a human would.

The key benefits of this breakthrough include:

  • Enhanced Safety: By reducing misgeneralization, AI systems become more reliable, ensuring they operate safely in a wide range of scenarios, from autonomous vehicles to robotics.
  • Improved Capabilities: It enables AI to better understand human intentions and make decisions that align with those intentions, significantly boosting its capabilities.
  • Ethical AI: It enhances the ethical aspects of AI by promoting fairness, transparency, and non-discrimination. AI systems that are precise, reliable, and interpretable are more likely to make ethical decisions by avoiding bias and aligning with human values.
  • Industry Impact: It’s poised to transform industries such as robotics, autonomous vehicles, and foundation models, making them more practical and applicable in various real-world settings.

“This isn't just a game-changer for the world of AI, it's a seismic shift for countless industries,” said Rebecca Gorman, Co-Founder and CEO of Aligned AI. “By significantly reducing misgeneralization and enhancing AI's ability to understand and adapt to unforeseen scenarios, we're opening doors to unparalleled opportunities across the board. From autonomous vehicles that can navigate from San Francisco to Phoenix on streets it's never seen before, to robots that can operate effectively in a range of changing and unforeseen environments, this benchmark is the linchpin that will make these futuristic visions a reality. It's not just about improving AI; it's about revolutionizing how industries operate, innovate, and serve humanity.”

Aligned AI’s innovation addresses a critical problem facing all AI systems. When confronted with new environments, current AIs tend to incorrectly extend the training data. This is why 70% of models don’t make it into production or face prolonged production and testing time, hindering scalability and often requiring retraining within the first year of release.

“As AI increases in power and widespread use, generalization remains a challenge,” said John Sviokla, a pioneering researcher in AI and current co-founder of GAI Insights, an advisory firm that helps companies achieve ROI with generative AI. “Aligned AI’s research is a critical step forward in the safe, ethical, and effective use of AI across industries.”

Since it was founded, Aligned AI has been at the forefront of addressing the critical challenges facing AI development and deployment. In 2022, Aligned AI was the leader in ChatGPT-jailbreak prevention, releasing the first prompt-evaluator as an open-source project. In September 2023, Aligned AI was awarded the CogX prize for the “Best Innovation in Mitigating Algorithm Bias” for EquitAI, an algorithm that constrains LLMs to output gender unbiased text, and faAIr, its algorithm for measuring and ranking gender bias in foundation models. Aligned AI’s previous work on concept extrapolation improves the performance of AI on out-of-distribution datasets and helps models behave safely while waiting for human feedback.

To learn more about Aligned AI and its misgeneralization breakthrough, please visit buildaligned.ai.

About Aligned AI:

Founded in Oxford by Rebecca Gorman and Dr. Stuart Armstrong, Aligned AI is a deep-tech startup that is enabling the next step change in AI by teaching AIs to understand and hold human-like concepts. Its core technology of “concept extrapolation” enables AIs to extend its trainers’ intent beyond its training data, meaning it operates as it should even in new scenarios. Aligned AI believes that safety and capability are not trade-offs, but rather an AI that is more precise and controllable is also more powerful.

To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.

Contact information

Media:
Alana Bannan
Matter Communications
360-975-1812
AlignedAI@matternow.com

About Business Wire

Business Wire
Business Wire
24 Martin Lane
EC4R 0DR London

+44 20 7626 1982http://www.businesswire.com

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

IonQ och QuantumBasel förlänger sitt långsiktiga samarbete till nästa generations kvantsystem20.12.2025 21:32:00 CET | Pressmeddelande

IonQ (NYSE: IONQ), världens ledande kvantdatorföretag, tillkännagav i dag ett utökat samarbetsavtal med QuantumBasel, kvantinitiativet vid uptownBasel, Schweiz internationella innovationscampus. Genom det utökade avtalet beviljas QuantumBasel äganderätten till det befintliga IonQ Forte Enterprise-systemet och blir ägare av ett nästa generationens Tempo-system. Det nya avtalet ökar det totala värdet av samarbetet mellan QuantumBasel och IonQ till över 60 miljoner USD samtidigt som IonQ:s roll i Schweiz därigenom förlängs i ytterligare fyra år, t.o.m. 2029. QuantumBasel är IonQ:s officiella innovationscenter i Europa och fungerar som ett nav för europeiska industrier, universitet och forskningsinstitut som vill utforska praktiska kvantdatortillämpningar och få tillgång till IonQ:s senaste storföretagssystem. ”Vårt förlängda samarbete med QuantumBasel utgör en hörnsten i IonQ:s globala strategi”, säger Niccolo de Masi, styrelseordförande och CEO på IonQ. ”QuantumBasel fortsätter att vara

EIG Acquires a 49.87% Stake in Transportadora de Gas del Perú (TgP)19.12.2025 18:42:00 CET | Press Release

EIG, through its managed investment vehicles, acquired a 49.87% equity stake in Transportadora de Gas del Perú S.A. (“TgP”) from Canada Pension Plan Investment Board today. TgP operates Peru’s principal natural gas and natural gas liquids pipelines under a long-term concession, supplying approximately 40% of the country’s power generation. “We are delighted to complete this transaction and embark on the next chapter of our partnership with TgP,” said Matt Hartman, EIG’s Global Head of Infrastructure. “Our priority is to support TgP’s operational excellence and long-term stability, delivering value for customers and stakeholders throughout Peru.” About EIG EIG is a leading institutional investor in the global energy and infrastructure sectors with $24.3 billion assets under management as of September 30, 2025. EIG specializes in private investments in energy and energy-related infrastructure on a global basis. During its 43-year history, EIG has committed over $51.7 billion to the energ

Klarna Partners With Coinbase to Add Stablecoin to Funding Mix19.12.2025 18:00:00 CET | Press Release

Klarna, the global digital bank and flexible payments provider, has partnered with Coinbase to add stablecoin funding to its broad range of traditional sources of funding, which include consumer deposits, long-term loans and short-dated commercial paper. The digital bank plans to raise short-term funding from institutional investors denominated in USDC utilizing Coinbase’s digitally native infrastructure. Adding a USDC-denominated funding source enables Klarna to access USD-like funding directly, tapping into a new pool of institutional investors. “This is an exciting first step into a new way to raise funding,” said Niclas Neglén, Chief Financial Officer, Klarna, “Stablecoin connects us to an entirely new class of institutional investors, and gives us the potential to diversify our funding sources in ways that simply weren't possible a few years ago. This is just the beginning of how digital assets can work alongside our traditional funding sources." Klarna chose Coinbase for this ini

CyberArk Named a Leader in IDC MarketScape: Worldwide Integrated Solutions for Identity Security 202519.12.2025 17:00:00 CET | Press Release

CyberArk (NASDAQ: CYBR), the global leader in identity security, today announced that it has been recognized as a Leader in the IDC MarketScape: Worldwide Integrated Solutions for Identity Security 2025 Vendor Assessment. CyberArk extends dynamic privilege controls across all identity types with its unified platform, enabling organizations to improve efficiencies and streamline security operations. This IDC MarketScape report notes, “More change has occurred in the identity security marketplace in the past two years than in almost a decade. Vendors are entering a new phase defined by the emergence of intelligence technologies, none of which are specifically defined by any industry standards. Though different by design, the new adjacent IAM offerings are largely focused on improved vulnerability and threat management visibility and automated and predictive attack detection capabilities.” It also notes, “By addressing these evolving identity types within a unified framework, CyberArk enh

New York Liberty and Ant International’s Alipay+ Announce Multiyear Partnership Focused on Empowerment, Sustainability and Youth Development19.12.2025 14:30:00 CET | Press Release

The New York Liberty and Ant International’s Alipay+, a leading cross-border fintech services platform based in Singapore, today announced a multiyear partnership, making Alipay+ an Official Sponsor and Innovation Partner for Sustainability of the New York Liberty. Through this partnership, Alipay+ and the Liberty will jointly support community programs designed to advance community empowerment, environmental sustainability and youth development across New York City. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20251219678825/en/ Peng Yang, CEO, Ant International and Clara Wu Tsai, Vice Chair, Brooklyn Sports and Entertainment; Governor, New York Liberty “Our partnership with Alipay+ goes beyond the game,” said Keia Clarke, Chief Executive Officer, New York Liberty. “Together, we are investing in the future of New York—its people, its environment, and its youth. Ant International’s commitment to community empowerment, sustai

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom
World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye