The content on this page was provided by an independent third party and syndicated by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

Synaptic Inc: Company Overview, Career Opportunities, and Community Impact

Synaptic Inc: Company Overview, Career Opportunities, and Community Impact

Explore Synaptic Inc’s company overview, careers, and community presence. We provide transparent details for job

March 6, 2026

Fullestop Recognized Among Top 12 AI Agencies Globally by DesignRush; Announces Launch of ‘AI Labs’ for SMEs in the USA

Fullestop Recognized Among Top 12 AI Agencies Globally by DesignRush; Announces Launch of ‘AI Labs’ for SMEs in the USA

Businesses have to change, and change is dramatic. AI adoption is critical for SMEs success and consultative approach

March 6, 2026

Marco Furia Media Commence Informative Features On Organising Winter Weddings & Events In Melbourne

Marco Furia Media Commence Informative Features On Organising Winter Weddings & Events In Melbourne

Lifestyle news web portal Marco Furia Media commence publishing a series of features on tips for organising Melbourne

March 6, 2026

Viminal Media Expands Video Production and Photography Services Across Santa Barbara County

Viminal Media Expands Video Production and Photography Services Across Santa Barbara County

Santa Barbara-based Viminal Media now serves all of Santa Barbara County with corporate video, drone footage,

March 6, 2026

Branded Hospitality Media Presents Women’s Leadership Panel on International Women’s Day at New York Restaurant Show

Branded Hospitality Media Presents Women’s Leadership Panel on International Women’s Day at New York Restaurant Show

Panel brings together generations of hospitality leaders to discuss evolving leadership, industry challenges, & the

March 6, 2026

Motherhood and Entrepreneurship: Why Community Is the Missing Strategy for Sustainable Success

Motherhood and Entrepreneurship: Why Community Is the Missing Strategy for Sustainable Success

CHICAGO, IL, UNITED STATES, March 5, 2026 /EINPresswire.com/ — In recent years, motherhood and entrepreneurship have

March 6, 2026

Chinese Vocal Educator Lan Tianyang Presents Lecture on Opera Vocal Techniques at Yale University

Chinese Vocal Educator Lan Tianyang Presents Lecture on Opera Vocal Techniques at Yale University

The lecture explores how traditional Chinese opera vocal methods can influence modern popular singing training.

March 6, 2026

Swiss Diamond® Leads the Charge in PTFE-Free Ceramic Cookware for Healthier, Sustainable Cooking

Swiss Diamond® Leads the Charge in PTFE-Free Ceramic Cookware for Healthier, Sustainable Cooking

NEW YORK, NY, UNITED STATES, March 6, 2026 /EINPresswire.com/ — As consumer awareness around kitchen safety and

March 6, 2026

ProcellaRX Launches Ascend: Decision Quality. Trusted Partners. One Ecosystem

ProcellaRX Launches Ascend: Decision Quality. Trusted Partners. One Ecosystem

A curated partner ecosystem uniting methodology, technology, and managed services to move life sciences from compliance

March 6, 2026

As ‘Agent-First’ AI Falters, Vera Introduces a Closed-Loop Model for Enterprise Execution

As ‘Agent-First’ AI Falters, Vera Introduces a Closed-Loop Model for Enterprise Execution

The real differentiator isn’t whether you have agents. It’s whether those agents are anchored to validated need,

March 6, 2026

Cribs for Kids Invites Professionals Nationwide to Join the 9th National Conference on Infant Safe Sleep

Cribs for Kids Invites Professionals Nationwide to Join the 9th National Conference on Infant Safe Sleep

PITTSBURGH, PA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — For more than 25 years, Cribs for Kids has worked

March 6, 2026

Factory Direct Tiny Homes Highlights Regulatory Considerations for Backyard Short-Term Rentals in Event-Driven Cities

Factory Direct Tiny Homes Highlights Regulatory Considerations for Backyard Short-Term Rentals in Event-Driven Cities

New Orleans among major U.S. markets where homeowners are evaluating accessory dwelling units for flexible lodging use

March 6, 2026

New Orleans Jazz & Heritage Festival to Feature Nearly 60 Local Food Vendors at Fair Grounds

New Orleans Jazz & Heritage Festival to Feature Nearly 60 Local Food Vendors at Fair Grounds

Annual festival highlights authentic Creole and Cajun cuisine alongside live music programming Food is an essential

March 6, 2026

Enterprise-Grade Volunteering Takes Center Stage at Goodera’s Global Volunteering Summit 2026

Enterprise-Grade Volunteering Takes Center Stage at Goodera’s Global Volunteering Summit 2026

SAN FRANCISCO, CA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — The third edition of the Global Volunteering

March 6, 2026

Downey Dental Arts Highlights the Importance of Prompt Emergency Dental Care for Families in Downey

Downey Dental Arts Highlights the Importance of Prompt Emergency Dental Care for Families in Downey

DOWNEY, CA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Dental emergencies can occur unexpectedly, leaving

March 6, 2026

Sustainability First: YTBIO Sets Standards as the Global Leading Plant Extracts Manufacturer

Sustainability First: YTBIO Sets Standards as the Global Leading Plant Extracts Manufacturer

XIAN, SHANXI, CHINA, March 6, 2026 /EINPresswire.com/ — The modern global health and wellness landscape is undergoing

March 6, 2026

cGMP Certified: The Standard of Excellence Maintained by a Key Chinese Dietary Supplement Supplier

cGMP Certified: The Standard of Excellence Maintained by a Key Chinese Dietary Supplement Supplier

XIAN, SHANXI, CHINA, March 6, 2026 /EINPresswire.com/ — In today's global marketplace, the trust consumers place in

March 6, 2026

Dentist of Long Beach Emphasizes the Importance of Timely Emergency Dental Care for Long Beach Residents

Dentist of Long Beach Emphasizes the Importance of Timely Emergency Dental Care for Long Beach Residents

LONG BEACH, CA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Dental emergencies can happen when people least

March 6, 2026

How Civil Engineer Aditya Nagtilak Prevented Structural NCRs Using a Disciplined RFI Process

How Civil Engineer Aditya Nagtilak Prevented Structural NCRs Using a Disciplined RFI Process

A field-tested approach to resolving design ambiguities before structural pours — based on real site coordination

March 6, 2026

Dentist of Anaheim Highlights the Role of Emergency Dentistry in Protecting Oral Health for Anaheim Families

Dentist of Anaheim Highlights the Role of Emergency Dentistry in Protecting Oral Health for Anaheim Families

ANAHEIM, CA, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Dental emergencies can occur suddenly and often bring

March 6, 2026

Goodera Redefines Corporate Volunteering With AI-Powered Technology Infrastructure Built to Scale Global Impact

Goodera Redefines Corporate Volunteering With AI-Powered Technology Infrastructure Built to Scale Global Impact

Our goal is to remove the operational friction that limits the scale of enabling intent to impact worldwide.”— Abhishek

March 6, 2026

China Best Precision Drill Bits Manufacturer Outlines Future Strategy for HSS Tooling Production

China Best Precision Drill Bits Manufacturer Outlines Future Strategy for HSS Tooling Production

DANYANG, JIANGSU, CHINA, March 6, 2026 /EINPresswire.com/ — The dawn of Industry 4.0 has fundamentally redefined the

March 6, 2026

Tiberius Aerospace Welcomes Professor Dame Fiona Murray to Advisory Board

Tiberius Aerospace Welcomes Professor Dame Fiona Murray to Advisory Board

LONDON, UNITED KINGDOM, March 6, 2026 /EINPresswire.com/ — Tiberius Aerospace, a modern defence technology company

March 6, 2026

China Top Metal Cobalt Drill Bit Exporter Examines the Role of Cobalt in Stainless Steel Machining

China Top Metal Cobalt Drill Bit Exporter Examines the Role of Cobalt in Stainless Steel Machining

DANYANG, JIANGSU, CHINA, March 6, 2026 /EINPresswire.com/ — The industrial processing of stainless steel presents a

March 6, 2026

Chinese Top 3 Cooling Plate Manufacturers in 2026: Leading the Way in Future Thermal Management Solutions

Chinese Top 3 Cooling Plate Manufacturers in 2026: Leading the Way in Future Thermal Management Solutions

Driven by rapid growth in EVs, energy storage and data centers, China’s leading cooling plate manufacturers are

March 6, 2026

100 Women in Finance Marks 25th Anniversary with Record Breaking London Gala 2026

100 Women in Finance Marks 25th Anniversary with Record Breaking London Gala 2026

The 25th Anniversary celebration convened global leaders across finance to invest in the next generation of women in

March 6, 2026

Pet Poisoning Prevention Month Tips from Modern Pest Services

Pet Poisoning Prevention Month Tips from Modern Pest Services

March is Pet Poisoning Prevention Month, and Modern Pest Services has developed easy tips to prevent pet poisonings and

March 6, 2026

Firefighters Move U Unveils Enhanced Website to Improve User Experience and Streamline Service Access

Firefighters Move U Unveils Enhanced Website to Improve User Experience and Streamline Service Access

Firefighters Move U unveils a new website with improved navigation, mobile optimization, and enhanced tools to

March 6, 2026

Kande VendTech Launches New Website to Bring AI-Powered Smart Vending Solutions to Las Vegas Businesses

Kande VendTech Launches New Website to Bring AI-Powered Smart Vending Solutions to Las Vegas Businesses

LAS VEGAS, NV, UNITED STATES, March 6, 2026 /EINPresswire.com/ — Kande VendTech, a Las Vegas-based smart vending

March 6, 2026

Founder Builds 11-Member AI Team Running 24/7 for Under $300 a Month

Founder Builds 11-Member AI Team Running 24/7 for Under $300 a Month

Palyan AI's open-source Nervous System framework manages autonomous AI agents handling outreach, coaching, legal, real

March 6, 2026

COAST: The China Smart Glass Film Manufacturer Bridging Aesthetics and Energy Efficiency

COAST: The China Smart Glass Film Manufacturer Bridging Aesthetics and Energy Efficiency

SHENZHEN, GUANGDONG, CHINA, March 6, 2026 /EINPresswire.com/ — Modern architecture faces a persistent paradox.

March 6, 2026

A Professional Guide to Choosing a High Quality Rare Earth Heat Blocking Window Film Manufacturer

A Professional Guide to Choosing a High Quality Rare Earth Heat Blocking Window Film Manufacturer

SHENZHEN, GUANGDONG, CHINA, March 6, 2026 /EINPresswire.com/ — The global push for sustainable architecture has

March 6, 2026

Guide to Choosing OEM / ODM Non-Woven Fabric Disposable Wet Wipes Supplier for Businesses

Guide to Choosing OEM / ODM Non-Woven Fabric Disposable Wet Wipes Supplier for Businesses

LECHANG, GUANGDONG, CHINA, March 6, 2026 /EINPresswire.com/ — In the modern consumer landscape, a wet wipe represents

March 6, 2026

Bowinscare: A Leading Wholesale Disposable Makeup Remover Pads Manufacturer with 20 Years of Expertise

Bowinscare: A Leading Wholesale Disposable Makeup Remover Pads Manufacturer with 20 Years of Expertise

LECHANG, GUANGDONG, CHINA, March 6, 2026 /EINPresswire.com/ — The global beauty and personal care industry continues

March 6, 2026

Empowering Green Buildings: China Smart Glass Film Manufacturer Providing Sustainable Privacy Solutions

Empowering Green Buildings: China Smart Glass Film Manufacturer Providing Sustainable Privacy Solutions

SHENZHEN, GUANGDONG, CHINA, March 6, 2026 /EINPresswire.com/ — The evolution of the modern skyline is no longer just a

March 6, 2026

Palm Holdings Ltd Announces Enhanced ESG Framework for Precious Metals Refining

Palm Holdings Ltd Announces Enhanced ESG Framework for Precious Metals Refining

The announcement follows the company’s recent structuring of a USD 300 million private placement designed for

March 6, 2026

Enlighten Designs Earns Migrate Enterprise Apps to Microsoft Azure Specialisation

Enlighten Designs Earns Migrate Enterprise Apps to Microsoft Azure Specialisation

MS Specialisation strengthens Enlighten’s Azure credentials and expands client access to Microsoft-backed funding and

March 6, 2026

Revive Design and Renovation Named #1 Remodeler in Tampa Magazine’s “Best of the City” Awards for Fifth Consecutive Year

Revive Design and Renovation Named #1 Remodeler in Tampa Magazine’s “Best of the City” Awards for Fifth Consecutive Year

The firm’s recognition underscores its leadership and commitment to delivering extraordinary home transformations.

March 6, 2026

Matta Highlights Digital Infrastructure as Key to Accelerating Africa’s Trade Integration

Matta Highlights Digital Infrastructure as Key to Accelerating Africa’s Trade Integration

For too long, African trade has operated with limited visibility across supply,Markets cannot trade what they cannot

March 6, 2026

LOXL2 Enzyme Discovery Offers New Hope for Jaw Arthritis

LOXL2 Enzyme Discovery Offers New Hope for Jaw Arthritis

Researchers uncover the protective role of LOXL2 protein in preventing cartilage damage CHENGDU, SICHUAN, CHINA, March

March 6, 2026