Cryptocnews-Crypto News, Cryptocurrency News, Blockchain News, NFT News
    What's Hot

    Bitcoin Options Traders Hedge For More Downside, Deribit Says

    06/12/2026

    Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch

    06/12/2026

    ‘Too Many Red Flags’: Bank of America Analyst Warns Signs That Typically Preceded a Bear Market Are Flashing: Report

    06/12/2026
    Facebook Twitter Instagram
    • Business
    • Markets
    • Get In Touch
    • Our Authors
    Facebook Twitter Instagram
    Cryptocnews-Crypto News, Cryptocurrency News, Blockchain News, NFT News
    • Home
    • Business

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      The Bitcoin 400-Day Cycle: Historical Performance Shows How Low The Bottom Goes

      06/11/2026

      Ripple CEO Takes Aim at JPMorgan’s Jamie Dimon Over Clarity Act Crypto Bill Criticism

      06/11/2026

      Travala launches first agentic AI travel protocol for autonomous bookings

      06/10/2026

      The Verdict Is In For Bitcoin: Majority Of Investors Say BTC Price Is Headed Lower, Here Are The Numbers

      06/10/2026
    • Technology
      1. Business
      2. Insights
      3. View All

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      The Bitcoin 400-Day Cycle: Historical Performance Shows How Low The Bottom Goes

      06/11/2026

      Ripple CEO Takes Aim at JPMorgan’s Jamie Dimon Over Clarity Act Crypto Bill Criticism

      06/11/2026

      Travala launches first agentic AI travel protocol for autonomous bookings

      06/10/2026

      Bitcoin Options Traders Hedge For More Downside, Deribit Says

      06/12/2026

      Solana Price Rallied 2,000% The Last Time This Happened, And It Just Triggered Again

      06/12/2026

      Bitcoin Price Just Entered The DCA Zone That Has Previously Triggered A 2,200% Rally To ATH

      06/12/2026

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      Elon Musk’s SpaceX IPO fever sparks $1 billion crypto bet before Nasdaq debut

      06/11/2026

      Microsoft President Asks Graduates to Stop Fearing AI and Start Adapting

      06/11/2026

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      UK mutual funds may soon be allowed to hold crypto ETNs, but only with a 10% leash

      06/10/2026
    • Insights
      1. Bitcoin
      2. Ethereum
      3. Eurozone
      4. Monero
      5. View All

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      Travala launches first agentic AI travel protocol for autonomous bookings

      06/10/2026

      Stellar faces renewed selling pressure amid bearish derivatives data

      06/09/2026

      Zcash developers propose ‘Ironwood’ upgrade, ZEC price rebounds, but there is a risk

      06/08/2026

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      Travala launches first agentic AI travel protocol for autonomous bookings

      06/10/2026

      Citi’s $5.5T Tokenization Forecast Puts Solana in the Spotlight

      06/10/2026

      Stellar faces renewed selling pressure amid bearish derivatives data

      06/09/2026

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      Travala launches first agentic AI travel protocol for autonomous bookings

      06/10/2026

      Stellar faces renewed selling pressure amid bearish derivatives data

      06/09/2026

      Five Years On, El Salvador Is Still Buying Bitcoin

      06/09/2026

      PI remains bearish as token unlocks threaten recovery

      06/11/2026

      Travala launches first agentic AI travel protocol for autonomous bookings

      06/10/2026

      Stellar faces renewed selling pressure amid bearish derivatives data

      06/09/2026

      Zcash developers propose ‘Ironwood’ upgrade, ZEC price rebounds, but there is a risk

      06/08/2026

      Bitcoin Options Traders Hedge For More Downside, Deribit Says

      06/12/2026

      Solana Price Rallied 2,000% The Last Time This Happened, And It Just Triggered Again

      06/12/2026

      Bitcoin Price Just Entered The DCA Zone That Has Previously Triggered A 2,200% Rally To ATH

      06/12/2026

      PI remains bearish as token unlocks threaten recovery

      06/11/2026
    • Markets
    • Get In Touch
    Cryptocnews-Crypto News, Cryptocurrency News, Blockchain News, NFT News
    Home»Uncategorized»Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch
    Uncategorized

    Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch

    adminBy admin06/12/2026No Comments4 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email



    In brief

    • Anthropic admitted its invisible LLM-development safeguards were “the wrong tradeoff” and will replace them with visible fallbacks to Claude Opus 4.8, starting this week.
    • Flagged requests on the API will now return a reason for their refusal, rather than silently delivering a degraded answer.
    • Making the safeguards visible means they’ll be easier to work around.

    Anthropic spent about 48 hours as the AI industry’s villain of the week before blinking.

    The company launched Claude Fable 5 this week to immediate backlash over a safeguard buried in its 319-page system card: The model, the first of the company’s new Mythos class, would secretly degrade its own responses for users it suspected were building competing AI models—no warning, no fallback message, just quietly worse output. By Thursday, Anthropic was apologizing.

    We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible.

    Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged…

    — ClaudeDevs (@ClaudeDevs) June 11, 2026

    “Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff,” the company posted on X. “You should have visibility into the safeguards we have in place, and why.”

    “We’re sorry for not getting the balance right.”

    Starting this week, flagged requests will visibly route to Claude Opus 4.8, a less capable model, instead of silently delivering degraded Fable output. API users will receive a stated reason when a request gets refused. Anthropic says server-side fallback notifications will roll out in the next few days.

    What was actually happening

    For non-technical readers, here’s what the controversy was actually about. Claude Fable 5 already had visible safeguards for cybersecurity and biology research—if you asked something that tripped those filters, you’d get a notification that your request was being rerouted to the older Opus 4.8 model. You knew something had changed. You could adjust your prompt or use a different tool.

    However, these safeguards were too extreme, some bio researchers noted.

    The LLM-development safeguard, however, worked differently. If Fable 5 detected you were working on things like pretraining AI systems, building distributed training infrastructure, or designing machine learning chips, the model would silently alter its own behavior—through prompt modification, steering vectors, or parameter tweaks—to give you a worse answer without telling you. You’d get a response. It just wouldn’t be from the Fable 5 you paid for.

    Fable 5 is billed as the public face of Anthropic’s most capable Mythos-class model, and researchers using it for legitimate machine learning work had no way to know their results were contaminated. A failed experiment looks the same whether your hypothesis is wrong or the model was quietly told to underperform. That’s the reproducibility problem that sent the AI research community into full meltdown mode.

    The problem was the classifier wasn’t that precise. AI research firm SemiAnalysis was among the first to publicly call them out after seeing their GPU inference research get flagged.

    BREAKING NEWS: Anthropic’s latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won’t notice. We are already seeing Anthropic’s latest model’s moderation filters our GPU… pic.twitter.com/9sa95cCSvS

    — SemiAnalysis (@SemiAnalysis_) June 9, 2026

    The catch in the fix

    Anthropic’s reversal comes with a direct admission of the tradeoff it’s accepting. Making safeguards visible makes them easier to bypass, which means the classifier has to cast a wider net to remain effective.

    More false positives—legitimate machine-learning work that gets caught and rerouted—are coming while the company tunes its systems. Anthropic said it’s working to reduce false positives “as fast as possible” but offered no timeline.

    The company is also applying the same cleanup to its biology and cybersecurity classifiers, which had drawn their own complaints about flagging harmless research prompts.

    That said, the remaining concern is that Anthropic isn’t dropping this category of restrictions—it’s only making them visible. For those who believe the restrictions themselves are wrong, Thursday’s apology is a partial fix. Fable 5 remains free on Pro, Max, Team, and Enterprise plans until June 22, after which it shifts to API usage credits only

    Daily Debrief Newsletter

    Start every day with the top news stories right now, plus original features, a podcast, videos and more.





    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    ‘Too Many Red Flags’: Bank of America Analyst Warns Signs That Typically Preceded a Bear Market Are Flashing: Report

    06/12/2026

    Bybit Named to Fortune Crypto 100 as It Accelerates Its Vision for The New Financial Platform

    06/12/2026

    Billionaire Ron Baron Issues Order To Buy $1,000,000,000 in SpaceX Shares, Predicts Huge Demand for SPCX

    06/12/2026

    Crypto Platforms Broaden Access to Elon Musk’s SpaceX Ahead of $1.75 Trillion IPO

    06/12/2026
    Add A Comment

    Leave A Reply Cancel Reply

    Top Posts

    Millennials Are Quitting Job to Become Day Traders

    01/20/2021

    Jack Dorsey Says Bitcoin Will Unite The World

    01/15/2021

    Hong Kong Customs Arrest Four in Crypto Laundering Bust

    01/15/2021

    Subscribe to Updates

    Get the latest sports news from SportsSite about soccer, football and tennis.

    Advertisement
    Demo
    Facebook Twitter Instagram Pinterest YouTube
    Top Insights

    Bitcoin Options Traders Hedge For More Downside, Deribit Says

    06/12/2026

    Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch

    06/12/2026
    Get Informed

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © {2025-2026} Copyright CryptocNews.com
    • Home
    • Business
    • Markets
    • Technology
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.