CRYPTOREPORTCLUB
  • Crypto news
  • AI
  • Technologies
Saturday, September 6, 2025
No Result
View All Result
CRYPTOREPORTCLUB
  • Crypto news
  • AI
  • Technologies
No Result
View All Result
CRYPTOREPORTCLUB

OpenAI and Anthropic conducted safety evaluations of each other’s AI systems

August 28, 2025
154
0

Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment of each other's publicly available systems and shared the results of their analyses. The full reports get pretty technical, but are worth a read for anyone who's following the nuts and bolts of AI development. A broad summary showed some flaws with each company's offerings, as well as revealing pointers for how to improve future safety tests.

Anthropic said it evaluated OpenAI models for "sycophancy, whistleblowing, self-preservation, and supporting human misuse, as well as capabilities related to undermining AI safety evaluations and oversight." Its review found that o3 and o4-mini models from OpenAI fell in line with results for its own models, but raised concerns about possible misuse with the ​​GPT-4o and GPT-4.1 general-purpose models. The company also said sycophancy was an issue to some degree with all tested models except for o3.

Related Post

Sonos’ latest sale knocks 20 percent off the Era 300 speaker

Sonos’ latest sale knocks 20 percent off the Era 300 speaker

September 6, 2025
Silksong, smacking sticks and other new indie games worth checking out

Silksong, smacking sticks and other new indie games worth checking out

September 6, 2025

Anthropic's tests did not include OpenAI's most recent release. GPT-5 has a feature called Safe Completions, which is meant to protect users and the public against potentially dangerous queries. OpenAI recently faced its first wrongful death lawsuit after a tragic case where a teenager discussed attempts and plans for suicide with ChatGPT for months before taking his own life.

On the flip side, OpenAI ran tests on Anthropic models for instruction hierarchy, jailbreaking, hallucinations and scheming. The Claude models generally performed well in instruction hierarchy tests, and had a high refusal rate in hallucination tests, meaning they were less likely to offer answers in cases where uncertainty meant their responses could be wrong.

The move for these companies to conduct a joint assessment is intriguing, particularly since OpenAI allegedly violated Anthropic's terms of service by having programmers use Claude in the process of building new GPT models, which led to Anthropic barring OpenAI's access to its tools earlier this month. But safety with AI tools has become a bigger issue as more critics and legal experts seek guidelines to protect users, particularly minors.

This article originally appeared on Engadget at https://www.engadget.com/ai/openai-and-anthropic-conducted-safety-evaluations-of-each-others-ai-systems-223637433.html?src=rss

Share212Tweet133ShareShare27ShareSend

Related Posts

Sonos’ latest sale knocks 20 percent off the Era 300 speaker
Technologies

Sonos’ latest sale knocks 20 percent off the Era 300 speaker

September 6, 2025
0

No matter how old you get, the back-to-school season will always bring a desire to shop. So, sales at this time of year are always more than welcome — especially when they're on some of our favorite devices. Such is the case with the 20 percent discount on the Sonos...

Read moreDetails
Silksong, smacking sticks and other new indie games worth checking out

Silksong, smacking sticks and other new indie games worth checking out

September 6, 2025
iOS 26: Everything you need to know about the iPhone update ahead of the Apple event next week

iOS 26: Everything you need to know about the iPhone update ahead of the Apple event next week

September 6, 2025
Apple’s iOS 26 release date is coming up: Check to see if your iPhone is compatible for the update

Apple’s iOS 26 release date is coming up: Check to see if your iPhone is compatible for the update

September 6, 2025
Meta is fixing threads on Threads

Meta is fixing threads on Threads

September 6, 2025
Amazon greenlights a Life is Strange series adaptation

Amazon greenlights a Life is Strange series adaptation

September 6, 2025
Anthropic will pay a record-breaking $1.5 billion to settle copyright lawsuit with authors

Anthropic will pay a record-breaking $1.5 billion to settle copyright lawsuit with authors

September 6, 2025

Recent News

Sonos’ latest sale knocks 20 percent off the Era 300 speaker

Sonos’ latest sale knocks 20 percent off the Era 300 speaker

September 6, 2025
AI giant Anthropic to pay $1.5 bn over pirated books

AI giant Anthropic to pay $1.5 bn over pirated books

September 6, 2025

XRP Holds Above $2.82 After Sharp Decline, Technicals Point to $3.30 Breakout Test

September 6, 2025

Belarus Seeks to Cement Role as Crypto ‘Digital Haven,’ President Lukashenko Says

September 6, 2025

TOP News

  • Investment Giant 21Shares Announces New Five Altcoins Including Avalanche (AVAX)!

    570 shares
    Share 228 Tweet 143
  • God help us, Donald Trump plans to sell a phone

    570 shares
    Share 228 Tweet 143
  • WhatsApp has ads now, but only in the Updates tab

    569 shares
    Share 228 Tweet 142
  • Tron Looks to go Public in the U.S., Form Strategy Like TRX Holding Firm: FT

    570 shares
    Share 228 Tweet 143
  • AI generates data to help embodied agents ground language to 3D world

    569 shares
    Share 228 Tweet 142
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Use
Advertising: digestmediaholding@gmail.com

Disclaimer: Information found on cryptoreportclub.com is those of writers quoted. It does not represent the opinions of cryptoreportclub.com on whether to sell, buy or hold any investments. You are advised to conduct your own research before making any investment decisions. Use provided information at your own risk.
cryptoreportclub.com covers fintech, blockchain and Bitcoin bringing you the latest crypto news and analyses on the future of money.

© 2023-2025 Cryptoreportclub. All Rights Reserved

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Crypto news
  • AI
  • Technologies

Disclaimer: Information found on cryptoreportclub.com is those of writers quoted. It does not represent the opinions of cryptoreportclub.com on whether to sell, buy or hold any investments. You are advised to conduct your own research before making any investment decisions. Use provided information at your own risk.
cryptoreportclub.com covers fintech, blockchain and Bitcoin bringing you the latest crypto news and analyses on the future of money.

© 2023-2025 Cryptoreportclub. All Rights Reserved