The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

First Benchmark for Legacy Code Comprehension Shows Specialized AI Approach Outperforms General-PurposeModels

LegacyCodeBench tests whether AI can understand COBOL well enough to document itaccurately not just generate plausible text

NEW YORK, NY, UNITED STATES, January 13, 2026 /EINPresswire.com/ — A new benchmark designed to measure whether AI systems can actuallyunderstand legacy enterprise code shows that specialized approaches significantlyoutperform general-purpose models. LegacyCodeBench, developed by Kalmantic (anapplied AI research lab) in collaboration with Hexaview Technologies, evaluates AIcomprehension of COBOL the language still processing 95% of ATM transactions and $3trillion in daily global transactions.
The benchmark finds that domain-specialized systems like Hexaview’s Legacy Insightsachieve 92% accuracy, compared to 86-90% for general-purpose models like GPT-4o andClaude Sonnet 4.

-Why This Matters
Over 220 billion lines of COBOL remain in production worldwide, but the engineers whowrote it are retiring. Modernization projects fail at rates exceeding 60%, and the pattern isusually the same: organizations try to replace systems they never fully understood.

“The risk everyone focuses on is the legacy technology itself, but that’s not actually whereprojects fall apart,” said Ankit Agarwal, Founder and CTO of Hexaview. “What kills these programs is undocumented business logic. We needed an objective way to measurewhether AI can actually understand these systems well enough to trust the output.”


-How It Works
Most AI benchmarks use another LLM to judge output quality, which creates reproducibilityproblems. LegacyCodeBench takes a different approach: it verifies claims against theoriginal program’s behavior.The process extracts specific behavioral claims from AI-generated documentation -statements like “PREMIUM is calculated by multiplying BASE-RATE by RISK-FACTOR” – andthen verifies them by executing the original COBOL program with test inputs. If the claimdoesn’t match what the code actually does, it fails.”We’re not testing whether documentation reads well,” said Nikita, co-author of the paper.”We wanted to know if you could actually trust it. There’s a difference.”The benchmark also penalizes gaming. Documentation that avoids making testable claimsscores zero on the behavioral track, which carries 50% of the total weight. And if the AIhallucinates variables that don’t exist in the source code, the entire task fails

-Results


| System | LCB Score | Structural | Doc Quality | Behavioral | T1 Basic | T4 Enterprise |
| ————————— | ——— | ———- | ———– | ———- | ——– | ————- |
| Legacy Insights (Hexaview) | 92% | 94% | 96% | 90% | 96% | 90% |
| Claude Sonnet 4 (Anthropic) | 90% | 96% | 78% | 91% | 92% | 92% |
| AWS Transform Mainframe | 88% | 98% | 68% | 91% | 88% | 87% |
| IBM Granite 13B | 87% | 93% | 72% | 90% | 89% | 84% |
| GPT-4o (OpenAI) | 86% | 92% | 71% | 89% | 91% | 82% |


Specialized systems (Legacy Insights, AWS Transform) outperform general-purposemodels, particularly on documentation quality. All models maintain reasonably strongperformance from basic programs (T1) to enterprise-scale COBOL (T4), though GPT-4oshows the largest drop (9 points).

“General-purpose models have gotten quite good at parsing legacy code, which is realprogress,” Agarwal said. “But there’s still a gap between understanding the syntax andunderstanding what the code is actually doing in a business context. That’s wherespecialization matters.”

-Open Source
LegacyCodeBench is fully open source with deterministic evaluation. The publicleaderboard is at legacycodebench.com, and the team welcomes submissions via GitHub

-Resources
• Website: legacycodebench.com
• Paper: Available at legacycodebench.com
• GitHub: github.com/kalmantic/legacycodebench
• Legacy Insights: legacyip.hexaview.ai


-About Hexaview
Hexaview is a strategic implementation partner for regulated enterprises, specializing inlegacy system preservation and modernization. Learn more: hexaviewtech.com

-About Kalmantic Labs Kalmantic is an applied AI research lab studying the challenges that emerge when AI meetsproduction systems. They publish research openly and build tools based on their findings.Learn more: kalmantic.com

LegacyCodeBench is open source under MIT license.

Ankit Agarwal
Hexaview Technologies
+1 845-653-3855
email us here

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Keeper Security Strengthens Atlassian Williams F1 Team’s Cyber Defences With KeeperPAM

Keeper Security Strengthens Atlassian Williams F1 Team’s Cyber Defences With KeeperPAM

LONDON, UNITED KINGDOM, January 13, 2026 /EINPresswire.com/ — Keeper’s unified, cloud-native PAM platform enables

January 18, 2026

New DA+ Platform Gives Superintendents 24/7 Access to Peer Network and Crisis-Ready Resources

New DA+ Platform Gives Superintendents 24/7 Access to Peer Network and Crisis-Ready Resources

DA+, a next-generation leadership platform, launches today, connecting superintendents and senior district leaders to a

January 18, 2026

Amatrium Announces GPT-5.1 Upgrade and AI Agents for AmatriumGPT

Amatrium Announces GPT-5.1 Upgrade and AI Agents for AmatriumGPT

Amatrium begins 2026 with a GPT-5.1 upgrade and multi-agent intelligence, delivering faster, more reliable results

January 18, 2026

DeVivo Companies Promotes Kevin DeVivo to Executive Vice President

DeVivo Companies Promotes Kevin DeVivo to Executive Vice President

DeVivo Companies promotes Kevin DeVivo to Executive Vice President, announcing strategic operational and commercial

January 18, 2026

Hush Express Freely Launches Groundbreaking Followers and Profile Visibility Features, Redefining Anonymous Social Media

Hush Express Freely Launches Groundbreaking Followers and Profile Visibility Features, Redefining Anonymous Social Media

Hush Express Freely, World’s Largest Anonymous Social App (2.5M users), launches Followers feature to drive lasting

January 18, 2026

Qualified Records Ends 2025 with Strong Roots Music Report Charting, Artist Recognition, and Sync Library Expansion

Qualified Records Ends 2025 with Strong Roots Music Report Charting, Artist Recognition, and Sync Library Expansion

2025 was a year of steady, artist-driven momentum marked by national #1 position radio charting, thoughtful critical

January 18, 2026

CraftyCrafty.tv Launches an Expansive Tutorials Hub to Inspire Creators Worldwide

CraftyCrafty.tv Launches an Expansive Tutorials Hub to Inspire Creators Worldwide

CraftyCrafty.tv today announced the continued expansion of its Tutorials section. SC, UNITED STATES, January 13, 2026

January 18, 2026

AIIR Consulting Names Dr. Joy Nissen as Managing Partner, Expanding Leadership Across EMEA and Asia

AIIR Consulting Names Dr. Joy Nissen as Managing Partner, Expanding Leadership Across EMEA and Asia

In her new role, Joy will lead the firm’s expanding presence across Europe, the Middle East, and Africa (EMEA) as well

January 18, 2026

Pathway Productions Announces ‘The Not So Naughty List’ Has Been Greenlit for Publication

Pathway Productions Announces ‘The Not So Naughty List’ Has Been Greenlit for Publication

Not just a children’s book—a timeless reminder that forgiveness belongs under every tree. A story that reminds us: the

January 18, 2026

Socrait Wins FETC Pitchfest 2.0, Capturing Hearts of Educators and Judges Alike

Socrait Wins FETC Pitchfest 2.0, Capturing Hearts of Educators and Judges Alike

Innovative edtech startup takes top honors in gamified competition featuring thousands of educator votes Winning FETC

January 18, 2026

BENX Launches Independent Online Media Highlighting the Realities of Freelancing and Digital Nomad Life in Indonesia

BENX Launches Independent Online Media Highlighting the Realities of Freelancing and Digital Nomad Life in Indonesia

BENX announces the launch of an independent online media platform documenting freelancing and digital nomad realities

January 18, 2026

Dr. Prabhat Sinha of Toms River, New Jersey Named NJ Top Doc For Ninth Consecutive Year

Dr. Prabhat Sinha of Toms River, New Jersey Named NJ Top Doc For Ninth Consecutive Year

NJ Top Docs has reviewed and approved Prabhat Sinha, MD of Ocean Family & Geriatric Associates, LLC in Toms River

January 18, 2026

MSSP Alert Unveils 2025 List of Top MSSPs

MSSP Alert Unveils 2025 List of Top MSSPs

Ninth annual list reveals leading MSSP, MDR and MSP security companies NEW YORK CITY, NY, UNITED STATES, January 13,

January 18, 2026

SXTC-DYADICA Global Brand Consulting Declare the New Battlefield of Branding: The Creation of their Brand Warfare Unit™

SXTC-DYADICA Global Brand Consulting Declare the New Battlefield of Branding: The Creation of their Brand Warfare Unit™

SXTC Global & DYADICA Global Declare New Battlefield of Branding: The Creation of their Brand Warfare Unit™. Brand

January 18, 2026

Firefighting’s Finest Moving & Storage Releases 2026 Cost-Benefit Analysis: The Hidden Inflationary Risks of DIY Moves

Firefighting’s Finest Moving & Storage Releases 2026 Cost-Benefit Analysis: The Hidden Inflationary Risks of DIY Moves

FORT WORTH, TX, UNITED STATES, January 13, 2026 /EINPresswire.com/ — As inflation continues to reshape the economic

January 18, 2026

Miraki Jewels Reveals the Meaning Behind Its Name and Mission: ‘Elegance with Purpose’

Miraki Jewels Reveals the Meaning Behind Its Name and Mission: ‘Elegance with Purpose’

Derived from 3 words Miraculous, Rare, and Kindred, the name Miraki reflects the brand’s belief that jewelry embodies

January 18, 2026

3 Simple Steps to Get Rid of Holiday Weight Gain and Stubborn Pounds

3 Simple Steps to Get Rid of Holiday Weight Gain and Stubborn Pounds

Zarina Del Mar’s 3D Movement System provides an effective, low-impact way to drop holiday weight gain and improve

January 18, 2026

K.A. Griffin Pushes Past Genre with The Accidental World, a Story-Driven Series Focused on Character and Choice

K.A. Griffin Pushes Past Genre with The Accidental World, a Story-Driven Series Focused on Character and Choice

K.A. Griffin’s The Accidental World series blends emotion, philosophy, and tension into a genre-fluid series where

January 18, 2026

Too Lost Acquires MAYK’s ‘Social Meme Music’ Catalog—Cementing Role in UGC Music Era

Too Lost Acquires MAYK’s ‘Social Meme Music’ Catalog—Cementing Role in UGC Music Era

Too Lost has acquired the full catalog of MAYK, the UGC music creation platform behind a wave of viral ‘social meme

January 18, 2026

Mike Chavez is Helping Redefine the Modern Painting Industry

Mike Chavez is Helping Redefine the Modern Painting Industry

Mike Chavez Painting highlights its approach, emphasizing fire-resistant coatings and sustainable materials. In our

January 18, 2026

Dallas Family Discovers $500,000 Coin Collection Hidden in Late Father’s Home

Dallas Family Discovers $500,000 Coin Collection Hidden in Late Father’s Home

Dallas family nearly donates $500K coin collection to Goodwill before discovering late father's 50-year treasure trove.

January 18, 2026

New Cognitive Fitness iPhone App Launches to Strengthen Critical Thinking and Reduce Political Partisanship

New Cognitive Fitness iPhone App Launches to Strengthen Critical Thinking and Reduce Political Partisanship

At a certain point, worrying about the political environment without doing anything about it just becomes exhausting”—

January 18, 2026

Marine Ingredient Enriched Nutraceutical Packaging Market to Reach $9.12B by 2033 – Strategic Revenue Insights (SRI)

Marine Ingredient Enriched Nutraceutical Packaging Market to Reach $9.12B by 2033 – Strategic Revenue Insights (SRI)

Market valued at $4.27B in 2024, projected 8.80% CAGR growth driven by fish oil, algae integration, ocean

January 18, 2026

Cantabile Youth Singers of Silicon Valley Announces New Executive Leadership

Cantabile Youth Singers of Silicon Valley Announces New Executive Leadership

Cantabile Youth Singers of Silicon Valley has appointed VanNessa Hulme Silbermann as Cantabile’s next Executive

January 18, 2026

RevlTek Announces New Product Launch with Collegiate

RevlTek Announces New Product Launch with Collegiate

Providing Credit Union Members with Low-Cost Education Finance Options GALVESTON, TX, UNITED STATES, January 13, 2026

January 18, 2026

Horse Stall Odor Eliminator Technologies for Modern Barn Environments

Horse Stall Odor Eliminator Technologies for Modern Barn Environments

PHILADELPHIA, PA, UNITED STATES, January 13, 2026 /EINPresswire.com/ — Aquelyst develops environmental and molecular

January 18, 2026

Genetic LifeSpan Named Global 100 Award Recipient for Best Health, Wellness, and Fitness Business

Genetic LifeSpan Named Global 100 Award Recipient for Best Health, Wellness, and Fitness Business

Genetic LifeSpan has received a Global 100 Award in the category of Best Health, Wellness, and Fitness Business. This

January 18, 2026

Curiex Announces Formal Launch as a Next-Generation Site Management Organization for Clinical Trials

Curiex Announces Formal Launch as a Next-Generation Site Management Organization for Clinical Trials

Site-led, performance-driven SMO unites top independent research sites within a harmonized, enterprise-grade

January 18, 2026

Applied Energetics to Present at the 28th Annual Needham Growth Conference

Applied Energetics to Present at the 28th Annual Needham Growth Conference

Applied Energetics, Inc. (OTCQB:AERG)TUCSON, AZ, UNITED STATES, January 13, 2026 /EINPresswire.com/ — Applied

January 18, 2026

Military Spouse Jobs & VetJobs Grow Employer and Job Seeker Impact Through Virtual Career Fairs

Military Spouse Jobs & VetJobs Grow Employer and Job Seeker Impact Through Virtual Career Fairs

Building on strong engagement, upcoming virtual career fairs are set for January 28 and March 4. FT. MYERS, FL, UNITED

January 18, 2026

4 Seasons Mobile Detailing Shares Their Prices For Services for Drivers in Royersford, Phoenixville, & Surrounding Areas

4 Seasons Mobile Detailing Shares Their Prices For Services for Drivers in Royersford, Phoenixville, & Surrounding Areas

HARLEYSVILLE, PA, UNITED STATES, January 13, 2026 /EINPresswire.com/ — Local business Four Seasons Mobile Detailing

January 18, 2026

GA-ASI and USN Test Expanded Sonobuoy Dispensing System For MQ-9B SeaGuardian(R)

GA-ASI and USN Test Expanded Sonobuoy Dispensing System For MQ-9B SeaGuardian(R)

SAN DIEGO, CALIFORNIA / ACCESS Newswire / January 13, 2026 / General Atomics Aeronautical Systems, Inc. (GA-ASI) and

January 18, 2026

Language Scientific Outlines What “Quality” Means in Medical Device Labeling Translation

Language Scientific Outlines What “Quality” Means in Medical Device Labeling Translation

January 13, 2026 – PRESSADVANTAGE – In the highly regulated field of medical device labeling translation, ensuring that

January 18, 2026

Language Scientific Examines Common Risks in Pharmaceutical Marketing Translation

Language Scientific Examines Common Risks in Pharmaceutical Marketing Translation

January 13, 2026 – PRESSADVANTAGE – n the fast-paced world of pharmaceutical marketing, translating content for global

January 18, 2026

Cloudonix and Dograh Partner to Bring Open-Source Agentic Voice to the Enterprise

Cloudonix and Dograh Partner to Bring Open-Source Agentic Voice to the Enterprise

This partnership unifies open-source voice agents with carrier-grade telephony orchestration enabling “Humans and AI

January 18, 2026

New York Drivers Face Rising Risk as Traffic Enforcement Tightens and Point Penalties Increase

New York Drivers Face Rising Risk as Traffic Enforcement Tightens and Point Penalties Increase

Stricter traffic enforcement and point rules in NYC and New York State are increasing fines, suspensions, and job risks

January 18, 2026

Demetra Dimokopoulos Partners with SuccessBooks® to Co-Author “Relentless” with Lisa Nichols

Demetra Dimokopoulos Partners with SuccessBooks® to Co-Author “Relentless” with Lisa Nichols

TORONTO, ONTARIO, CANADA, January 13, 2026 /EINPresswire.com/ — SuccessBooks® is proud to announce an exciting new

January 18, 2026

Rubenstein Public Relations Named Agency of Record for Global Book Tour of The Jewish Experience by Rabbi Mark Wildes

Rubenstein Public Relations Named Agency of Record for Global Book Tour of The Jewish Experience by Rabbi Mark Wildes

Leading NYC PR Agency to Drive Strategic Campaign to Build Awareness Among Millennials of Book Examining Core Beliefs

January 18, 2026

Atlanta Braves’ Hall of Famer Chipper Jones Named a Keynote Speaker At MSM’s Self-Storage Event, THE Show

Atlanta Braves’ Hall of Famer Chipper Jones Named a Keynote Speaker At MSM’s Self-Storage Event, THE Show

Legendary third baseman Chipper Jones will translate his success on the field into winning strategies for self-storage

January 18, 2026

Uber-for-Content Platform Beige AI Proves Real-Time Marketplace at CES 2026

Uber-for-Content Platform Beige AI Proves Real-Time Marketplace at CES 2026

Los Angeles creative technology company with 4,000+ clients served, addresses critical content challenges for

January 18, 2026