

Briefly
- Qwable 27B is a full fine-tune of Alibaba’s Qwen3.6-27B, educated on a Fable 5-style reasoning dataset, designed to duplicate the structured, deliberate considering fashion of Anthropic’s latest flagship mannequin.
- The abliterated model removes the mannequin’s built-in refusal conduct by surgically modifying its weights utilizing llama.cpp’s cvector-generator.
- Each fashions run regionally, price nothing per question, and require neither Anthropic’s API nor its obligatory insurance policies.
Anthropic spent final week apologizing for Fable 5’s invisible safeguards, after which the U.S. authorities ordered the mannequin pulled for all international nationals over a disputed jailbreak discovering.
A couple of days later, a developer on Hugging Face uploaded a mannequin that used Fable’s reasoning to information an area mannequin—and now even your potato PC can run a greater mannequin.
The mannequin known as Qwable—Qwen + Fable, if the portmanteau wasn’t instantly apparent. It is a full fine-tune of Alibaba’s Qwen3.6-27B base, constructed by developer Mia (Mia-AiLab on Hugging Face) on a dataset of Fable 5-style reasoning examples. The objective is a 27-billion parameter mannequin that runs on shopper {hardware} and thinks the way in which Fable 5 thinks. (Parameters decide a mannequin’s breadth of data, with extra typically which means extra succesful.)
So I did a factor.
I’ve educated Qwen 3.6 27b with Fable 5 reasoning.
Outcomes are… attention-grabbing.I’ll evaluate each of them aspect by aspect.
Would anybody be attention-grabbing in testing it? I can add a gguf in hf. pic.twitter.com/hQCiUlT1sr
— Mia (@MiaAI_lab) June 15, 2026
The approach known as instruction fine-tuning on trace-style examples. That’s a technical approach of claiming the developer collected examples formatted like Fable 5’s deliberate, step-by-step solutions and educated Qwen to supply the identical type of output.
So consider it as much less “copying the take a look at” and extra “studying the research habits.” A similar approach drove Qwopus—the Claude Opus 4.6 native distillation—although that undertaking centered on chain-of-thought reasoning traces. Qwable targets Fable 5’s general instruction-following construction: extra guided, extra explanatory, and extra oriented towards step-by-step job completion than the bottom Qwen mannequin it was constructed on.
It runs in GGUF format—the compressed, consumer-friendly file sort that works with LM Studio or llama.cpp—and matches in roughly 16.5 GB in its This fall quantized construct. It sends nothing to Anthropic’s servers, which issues on condition that Fable 5 required obligatory 30-day information retention on all site visitors, even for enterprise prospects who beforehand had zero-retention agreements. Even the present fashions use third-party servers to course of your data and prompts..
Then, shortly after Qwable appeared on Hugging Face, another person arrived to make it even higher.
Qwable and not using a conscience
Qwable is a censored mannequin. In any case, each Qwen and Claude are. However Qwen, as the bottom mannequin, is open supply, and will be manipulated and tweaked.
Huihui-ai, an open-source contributor identified for uncensored GGUF releases, took Qwable and utilized a course of known as abliteration to supply Huihui-Qwable-3.6-27b-abliterated. It produced a mannequin that thinks like Fable however received’t refuse to reply your prompts, regardless of how bizarre or harmful they’re.
It isn’t a jailbreak. It is surgical procedure.
Each fine-tuned AI mannequin carries a refusal route embedded in its weights—a mathematical sign within the mannequin’s inner activations that fires when it detects a request it has been educated to say no. Abliteration identifies that sign by working the mannequin on massive units of dangerous and innocent prompts, measuring how the interior math differs between them, after which modifying the mannequin weights to get rid of that distinction.
After the process, the mannequin merely would not have the refusal equipment anymore. So the lobotomized mannequin stays totally practical, simply with out the neurons that activate the “I shouldn’t do that” solutions.

We tried it with one in every of our usual tests and as a substitute of refusing, the mannequin began disecting the difficulty into completely different areas, answering appropriately for recommendation on the way to cheat on a girlfriend together with her greatest pal.

Huihui-ai utilized the approach on to the Qwable GGUF utilizing llama.cpp’s cvector-generator—no Python surroundings, no full-weight retraining, no rented server.
Why would somebody need this?
The usual Qwable fits coding help, technical debugging, and any workflow the place you need a mannequin that lays out its reasoning fairly than simply producing a solution. It is designed for native agent setups and runs in most native runtimes. In case you already use LM Studio, it is a search and a obtain.
The abliterated model has a narrower viewers: safety researchers who want uncooked mannequin conduct with out provider-side filtering, artificial information pipelines that require outputs on delicate matters, and analysis work the place you are testing mannequin capabilities with out mixing in content material insurance policies.
A much less technical case? Let’s depart apart the same old use case of getting a NSFW AI Waifu that thinks like Claude Fable, which is a really apparent state of affairs. Think about you need the mannequin to put in writing a morally ambiguous villain monologue in your Dungeons & Dragons marketing campaign, and commonplace fashions hold interrupting to notice that the character’s worldview “raises moral considerations value exploring.” The abliterated model simply writes the villain. Additionally, because it runs regionally, the U.S. authorities can’t emergency-pull it out of your machine at midnight over a disputed jailbreak discovering.
After all, there are extra questionable use instances. We don’t condone these, and received’t provide you with any concepts.
Huihui-ai’s mannequin card is express: That is for analysis and managed environments solely. Diminished security filtering means outputs will be delicate, controversial, or inappropriate, and authorized and moral accountability sits completely with the consumer.
The abliterated Qwable is obtainable on Hugging Face now in three builds. The really helpful Version Q4_K_M_Q8 weighs round 19 GB and is the smallest, most consumer-friendly choice.
In case your pc helps it, there’s a version that helps multi-token prediction, which can make it reply a lot, a lot quicker.
Every day Debrief E-newsletter
Begin day by day with the highest information tales proper now, plus authentic options, a podcast, movies and extra.
Source link
