Logical Intelligence Achieves 76 Percent on Putnam Benchmark, Highlighting Shift Beyond Large Language Models to Language-free, Mathematically Grounded Models

2.12.2025 15:15:00 CET | Business Wire | Press Release

Over the last decade, artificial intelligence (AI) has been largely built around large language models (LLMs). These systems are based on a language and guess words in a chain in the form of tokens. As a result, they frequently hallucinate and require vast compute and power infrastructure to solve tasks. The moment systems like public safety, national infrastructure, and industrial automation need logic, LLMs break and introduce safety risks. Token-free language independent models represent a new direction for AI. They do not predict words. They search for correct solutions and require less compute. Logical Intelligence is the first company building exclusively around mathematically derived, non autoregressive EBM (Energy Based Model) reasoning.

Today, Logical Intelligence announced that its Aleph tool achieved a 76 percent score on the Putnam Benchmark, one of the most demanding mathematical reasoning tests in artificial intelligence. The benchmark measures a model’s ability to solve formal mathematics problems by producing verified proofs rather than relying on text generation. While Aleph is an internal tool built on top of an LLM, its performance places it ahead of all publicly evaluated LLMs and the hybrid EBM systems that still depend on LLM scaffolding. The results are a strong signal that native EBM architectures offer a clear path to trustworthy AI.

“We built Aleph as an internal tool to test the mathematical rigor of the environment we are creating, not to be our core model,” said Eve Bodnia, founder and CEO of Logical Intelligence. “Aleph’s performance proves that our foundations are strong, even though Aleph itself was developed on top of an LLM. The tool represents a fraction of what we expect our core model to accomplish.”

Why Logical Intelligence Uses EBMs Instead of LLMs

Most AI systems reason the same way they write: one word at a time. This produces long, fragile chains of tokens that can fall apart with a single incorrect step. The model receives a “final grade” only at the end of the chain, with no idea where the reasoning failed. This makes LLMs unpredictable and unsuitable for environments that require guaranteed correctness.

Logical Intelligence uses EBMs because they operate on a different principle. An EBM does not think in words. It reasons in continuous mathematical states shaped by the structure of the problem. Instead of producing text token by token, the model updates its entire internal state at once. This allows it to correct course, explore alternatives, and converge on stable, verifiable answers. The system behaves closer to a trained mathematician than a predictive text engine.

EBMs are positioned to become the backbone of the systems where uncertainty is unacceptable. These include true self-driving vehicles, advanced aviation, automated manufacturing, power grids, defense systems, autonomous robotics, chip design, and national infrastructure. Any environment that depends on logic behaving the same way every time will require the type of deterministic reasoning that EBMs can provide.

“If you need certainty, you cannot rely on word prediction,” Bodnia said. “You need a system that works through the structure of a problem. EBMs give us the foundation for that.”

Why Aleph Matters

Aleph was created for one purpose. It is a tool that converts mathematical problems into formal statements and generates proofs that can be checked by a machine. This allows researchers to verify that an answer is mathematically correct. Even as an internal tool built on an LLM, Aleph’s ability to generate large volumes of verifiable proofs is a meaningful advancement. Most AI systems can describe mathematics. Very few can prove anything.

“Aleph gives us a new level of certainty in AI today,” Bodnia said. “It is the first signal of what is possible when you build systems around mathematical truth.”

Logical Intelligence is already working with a small group of organizations to test early applications of Aleph in controlled environments across key vertical industries. These pilots are designed to explore how mathematical verification can support real systems.

Logical Intelligence will release its general purpose model with formal machine verifiable reasoning in 2026. This system will go far beyond Aleph and demonstrate how mathematical reasoning can support complex, high-assurance environments at scale. The company will show how its approach can serve industries where perfect logic is the requirement.

“Aleph is our first milestone,” Bodnia said. “The full system is coming in 2026.”

For more information and to read the Aleph white paper, visit www.logicalintelligence.com/aleph-prover.html.

About Logical Intelligence

Logical Intelligence is an artificial intelligence research company building the first fully language-free, mathematically grounded Energy Based Models. These systems differ from LLMs and hybrid EBM approaches by reasoning directly in structured state space and generating proofs that can be checked for correctness. Logical Intelligence is designing its models to underpin critical infrastructure, advanced automation, and high-reliability computing. Its team includes researchers with advanced degrees in mathematics and computer science, ICPC and IMC medalists, contributors to major proof systems, a Fields Medalist, and a Turing Award laureate who guides the company’s long-term scientific direction. For more information, visit www.logicalintelligence.com or follow us on X at @logic_int and our founder & CEO at @EveLovesOlive.

View source version on businesswire.com: https://www.businesswire.com/news/home/20251202089385/en/

Contacts

media@logicalintelligence.com

Business Wire, a Berkshire Hathaway company, is the global leader in multiplatform press release distribution.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

The Estée Lauder Companies Fully Establishes Its “One ELC” Operating Model and Reaches Milestone in Its Profit Recovery and Growth Plan1.4.2026 23:00:00 CEST | Press Release

The Estée Lauder Companies Inc. (NYSE: EL) today announced WPP as its first-ever global media partner, marking a significant advancement of its One ELC operating model, a scalable system designed to operate faster, execute with greater discipline, and drive growth. In fully establishing One ELC, the Company also reached a significant milestone in its Profit Recovery and Growth Plan’s (PRGP) Restructuring Program — a key action plan priority of Beauty Reimagined. Stéphane de La Faverie, President and Chief Executive Officer, The Estée Lauder Companies, said, “With the appointment of WPP as our first-ever global media partner, our One ELC operating model is now fully established. This more unified and scalable system will enable us to be faster, more agile and efficient, and support unlocking additional growth. Together with our execution progress, we are confident that we are on a trajectory to deliver sustainable, profitable long-term growth.” de La Faverie added, “Building on our stro

Visual Bank Expands “Qlean Dataset” to Support Large-Scale Japanese Speech Foundation Models1.4.2026 21:45:00 CEST | Press Release

Visual Bank Inc. (CEO: Saneyuki Nagai), through its subsidiary amanaimages Inc., one of the largest digital asset providers for the marketing and advertising industry in Japan with over 40 years of history, today announced the expansion of its Qlean Dataset, a premium AI training data solution designed for developers building high-performance Japanese speech foundation models. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260401752248/en/ Visual Bank Group, leveraging over 40 years of expertise through amanaimages Inc., expands Qlean Dataset, delivering high quality, rights cleared Japanese language corpora, including 100,000+ hours of commercially usable audio. A new development within the Qlean Dataset division, which focuses on providing datasets for institutions engaged in research and development, with rights cleared for AI training and large-scale data applications, has positioned the company as a leading provider of

Manna Air Delivery Raises $50Million Series B as It Announces Plans to Expand in the United States1.4.2026 18:00:00 CEST | Press Release

Manna Air Delivery, a global leader in consumer drone delivery, has announced a $50 million funding round to scale its proven operations further in the United States and Europe. The round brings Manna’s total funding to $110million. Manna now operates one of the most active consumer drone delivery networks in the world, with more than 250,000 regulated commercial UAV flights completed. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260310714366/en/ Manna Air Delivery raises $50m Series B Investors in the round include ARK Invest, known for backing companies such as OpenAI, Anthropic, Tesla and SpaceX, the Ireland Strategic Investment Fund (ISIF) and Schooner Capital, alongside existing investors Coca-Cola HBC and Molten Ventures. As an unmanned aerial vehicle (UAV) delivery pioneer, Manna has operated in six locations across its native Ireland, as well as in Finland and Texas over the past seven years, delivering items inclu

Bureau Veritas Launches an Independent AI Assessment Offering for European Enterprises, Developed in Partnership with Amazon Web Services (AWS)1.4.2026 17:45:00 CEST | Press Release

Bureau Veritas, a global leader in Testing, Inspection, and Certification services (TIC), announces the launch of an AI systems audit to help European enterprises assess and demonstrate their compliance with the European Union's "AI Act" regulatory requirements. This offering combines on-site audits, document analysis, and direct testing to deliver an independent maturity report. Since the EU's AI regulation came into force in 2024, companies have faced major implementation challenges. According to a recent report*, 68% of them struggle to interpret the provisions of the text, while 60% have yet to put in place the governance needed to comply. Non-compliance can cost them up to 7% of annual revenue. Bureau Veritas has developed this new audit offering to help companies identify their compliance gaps and remedy them. Bureau Veritas's new audit offering comprises a pre-audit, document review, on-site audit, and direct testing, resulting in an independent report on the client's AI maturit

Greenland Resources Signs Eight Year Off-take Agreement With SSAB to Supply High Quality Molybdenum1.4.2026 16:29:00 CEST | Press Release

Greenland Resources Inc. (TSX:MOLY, FSE:M0LY) (“Greenland Resources” or the “Company”) is pleased to announce the Company has signed a binding off-take agreement with SSAB, a Nordic and US-based steel producer headquartered in Sweden. The company is a leading producer on the global market for advanced high-strength steels providing solutions to the defence, automotive, infrastructure and energy industries. A stock exchange press release from SSAB can be found on their website at www.ssab.com This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260401270749/en/ The off-take agreement provides an established price floor and price ceiling and will allow SSAB to secure high quality low carbon emission ferromolybdenum extracted in Greenland and refined in Belgium. SSAB will be able to ensure a stable and responsibly sourced long term secured primary molybdenum supply with high sustainability standards and low scope 1&2 emissions from a

In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.

Visit our pressroom