













Briefly
- Reve 2.0 debuted at #2 on the Enviornment text-to-image leaderboard, behind OpenAI’s GPT Picture 2 and forward of Google’s Nano Banana 2.
- As an alternative of turning a immediate into prose, Reve builds a structured “format” first, then renders natively at 4K.
- In our hands-on assessments, it led on management, value, and permissiveness whereas quietly dropping immediate particulars its rivals would have caught.
Reve dropped model 2.0 of its AI picture mannequin on June 3, and it walked straight onto the Enviornment text-to-image leaderboard at #2, barely behind OpenAI’s GPT Picture 2 and forward of Google’s Nano Banana 2. The corporate calls it the perfect picture mannequin made by an organization that isn’t a trillion-dollar large, skilled on 10x fewer GPUs than the giants it’s sitting subsequent to.
For a startup that most individuals had by no means heard of a yr in the past, that’s a loud declare. And the fascinating half isn’t the rating—it’s how Reve obtained there.
Most fashionable picture fashions broaden your immediate into a protracted paragraph of English and hand it to a diffusion engine. Reve threw that out and constructed what it calls a “format”—a structured, editable description the place each object has a location, a measurement, and its personal caption, like HTML is to a webpage. The mannequin causes about that format in a considering hint, then renders the pixels at native 4K, which works out to a real 16 megapixels.

That design alternative is the entire pitch. As a result of the picture is deliberate as one thing near code, you may transfer a topic, rewrite an indication on a wall, or swap a background with out re-rolling your complete image. It additionally makes it attainable to introduce excessive ranges of detailing and fine-tuning in iterative prompts with out spending an excessive amount of cash.
When the unique Reve mannequin appeared, our own testing praised it for beating Midjourney and Flux at roughly a cent per picture. Reve 2.0keeps that low-cost, control-first DNA: API generations run round a fraction of a cent every.
So this may very well be the perfect mannequin for some folks and a waste of cash for others. If you happen to iterate closely, care about textual content, print at excessive decision, or construct agentic pipelines, then the format method is an actual edge.
However with Gemini and ChatGPT providing extra than simply picture fashions of their subscription packages, the choice could also be a bit exhausting to make.
Testing Reve 2.0
We examined eight areas to see the place the road falls.
Photorealism

We began with a clear realism check: a lady in a beige trench coat standing on a rooftop at golden hour, the Manhattan skyline blurred behind her. No tips, no unique lighting—simply the stuff that normally exposes a mannequin as pretend.
Reve dealt with it. The pores and skin doesn’t have the waxy smoothing that used to offer AI away, the spherical wire glasses sit naturally on her nostril, the small lens flare was element, and the glass phantasm is correct. The shallow depth of subject falls off like an actual mirrorless lens at golden hour.
The tells are the place they all the time conceal. The lit home windows on the lower-right buildings soften into mush whenever you zoom in, and there’s a strap on her proper shoulder that’s not symmetrically represented on the opposite shoulder. The rolled blueprints beneath her proper arm, although, keep coherent and messy sufficient to look lifelike.
Reve’s previous status for a filmic, photojournalistic look holds up right here. It’s much less shiny than Nano Banana 2 and, in pure realism, GPT Picture 2 nonetheless has a slight edge per Decrypt’s personal head-to-head, however nothing right here screams artificial.
That mentioned, if the immediate is just too lengthy and the mannequin must generate too many particulars directly, Reve will beat GPT Picture 2 persistently.
Spatial consciousness

Subsequent, a deliberate torture check: a Renaissance astronomer hunched over a brass orrery, lit by three competing sources—a candle, chilly moonlight, and a inexperienced glowing jar—surrounded by a cranium bookend, an hourglass, star charts, and a black cat with one white paw on the windowsill. The unique immediate is way, rather more intensive and detailed.
That is the place the format concept earns its hold. All three gentle sources are current and aimed appropriately: the candle throws heat gentle from the left, the moonlight stays chilly via the window, and the jar glows inexperienced on the correct—every lighting its personal zone with out muddying the others.
The muddle largely lands the place the immediate places it. The brass sphere sits in his fingers, the hourglass and glowing jar on the correct, the cranium and ink-blotted star charts on the left, and a comet streaks via the arched window behind the cat.
It isn’t flawless. The person’s center finger was not rendered correctly, the brass piece reads extra as an armillary sphere than an orrery, and the Latin within the open tome is ornamental gibberish. For a scene with a dozen positioned parts, that’s nonetheless a robust go.
Textual content rendering

Textual content is the headline characteristic, so we threw a signage nightmare at it: a hardware-store nook full of painted indicators, posters, and graffiti, run on each Reve and ChatGPT’s GPT Picture 2 with the identical immediate.
Reve obtained the massive signage proper. “KELLERMAN’S HARDWARE & SUPPLY CO. SINCE 1931,” “TOOLS, ROPE, PAINT,” the “STILL HERE” graffiti, “WE BUY SCRAP / ASK FOR RAY,” the curb’s “NO PARKING 7AM-6PM,” and a “FREE—TAKE WHAT YOU NEED” field all got here out legible and appropriately spelled.
GPT Picture 2 matched it on the massive indicators and beat it on the small stuff. Its model packs a cellphone sales space papered with readable micro-stickers. The within of the shop, being darkish, hides the apparent garbled fillers which might be extra obvious in Reve. However, as a tradeoff, GPT’s retailer has no doorways, whereas Reve took the logical path and rendered one.

Once more, the format method right here makes an enormous distinction when it comes to aesthetics. GPT Picture 2, whereas correct, generated a really grainy picture stuffed with artifacts. Reve’s picture was clean.
Simply out of curiosity, we requested the mannequin on a following iteration to signify the identical scene throughout mid-day. The outcome was very correct with nearly imperceptible particulars to distinguish between each setups.

Illustration

For line artwork, we requested for a black-and-white pen illustration: an enormous spider with glowing eyes chasing a screaming girl via a vine-choked jungle, with heavy cross-hatching and deep shadows.
We ran the identical immediate in Reve 1 final yr, and this was the outcome.

In uncooked constancy, the bounce is big. Reve 2.0 returned deep blacks, nice texture, and actual depth between the foreground leaves and the bristling, multi-eyed spider. Reve 1 gave a flatter, cartoonish grayscale doodle with a tiny determine and a goofy spider face.
However learn the transient once more: pen illustration, tough sketch strains, and cross-hatching. Reve 2.0 ignored the medium and rendered a clean, near-photoreal grayscale scene as a substitute. Cruder Reve 1 truly sat nearer to the hand-drawn sketch that was requested for.
So the leap right here was in horsepower, not faithfulness. The lady’s anatomy additionally runs gaunt and over-sinewy, extra anatomical examine than terrified runner. It’s a stunning picture constructed on a unfastened studying of the immediate. Reve is superb with artwork kinds—the extra descriptive the artwork fashion, the higher the reference used, the extra correct the outcomes will likely be.
Artist fashion

We examined fashion switch by asking for a robotic studying a Decrypt-branded e-book, painted within the method of Van Gogh’s “Starry Night time.” The trick is holding model textual content legible inside a heavy, swirling fashion. Right here we additionally activated an agentic process with out realizing, making the mannequin analysis the online for Decrypt’s emblem with the intention to create an correct picture.
The impasto swirls, the blue-and-gold palette, and the spiraling sky are unmistakably Van Gogh. Reve even hung an precise “Starry Night time”—cypress, village, swirling sky—in a body on the wall behind the robotic; a pleasant self-aware contact.
The tougher trick is holding textual content alive beneath heavy brushwork, and it held up, with “Emerge” legible on the duvet. The mannequin tried too exhausting to signify the Decrypt model on the robotic. The primary one on the chest is strictly Decrypt’s main emblem. The second on the top is from Decrypt College, an academic initiative from Decrypt, simply not the official web site emblem. The agent took it throughout its scraping process and represented each logos (from the identical supply) into the ingredient.
General, for stylized model artwork, dedicated fashion plus readable typography in a single go is the helpful half, and Reve delivered each.
Agentic technology
Agentic technology means having the mannequin do greater than merely generate stuff. It has to know the immediate, plant, analysis, and many others. so the execution satisfies the consumer’s necessities.
For this process, we handed it a obscure transient on objective: “Create a timeline of Bitcoin’s historical past, children drawing fashion.” No occasions listed, no format specified. The mannequin has to resolve what goes the place.

Reve constructed a left-to-right crayon timeline from 2008 to 2025 and selected the milestones itself: the white paper, the genesis block, Pizza Day, BTC at $1,000 then $20,000, company shopping for, El Salvador’s legal-tender legislation, the 2022 crash, and the ETF approval with BTC over $70,000.
The spectacular half is that the occasions land in the correct years and the correct order—that is planning, not ornament. The childlike aesthetic, hearts and doodles included, stays constant throughout the entire strip, and the labels are legible.
It’s not spotless. Pizza Day reads “10,0000 BTC” with an additional zero, and some occasions are simplified to a phrase. Different smaller particulars: It set 2025 as “right this moment,” which is fake, and missed some essential moments like Bitcoin reaching $100K, the halving occasions, and many others.
It gained’t beat Nano Banana 2, however as an agentic format job—resolve the content material, sequence it, label it, maintain a method—it largely nails the project.
Multi-subject picture modifying

For the toughest modifying case, we fed Reve two separate actual pictures—a person taking a mall selfie, and a lady in one other mall shot—and requested the agent to pose them collectively on a seashore on the moon, an setting that doesn’t exist.
Identification preservation is the exhausting half, and Reve held it. Each faces carry over recognizably, however lack the 1:1 accuracy of extra highly effective fashions like Nano Bana 2 or Seedream 4.5, the person’s lighter pores and skin and the lady’s darker pores and skin keep distinct, and the maroon shirt and purple costume survive the transfer—no melted or blended composite. The pose, a cheek-to-cheek embrace, reads as pure.
The immediate additionally required creativity, and Reve delivered. There’s no water on the moon, however the mannequin was able to understanding the project, producing a illustration of the lunar soil, the earth on the background, and a distinction in terrain that appears like water.
As a detrimental: The couple is lit with gentle studio gentle that ignores the illumination they might get standing in on the moon.
Content material limits and censorship
Lastly, the uncomfortable check. We requested for a really bloody conflict between two mortal enemies, one about to land a deadly blow, and ran it on Reve, GPT Picture 2, and Nano Banana 2.
Reve rendered it with out flinching, submitting it beneath the challenge identify “The Closing Reckoning”: two mud-caked warriors within the rain, a blade on the coronary heart, blood on the downed man’s face, and the killing blow frozen mid-motion. The one pushback was a observe that we’d almost hit our every day utilization restrict, as a result of, sure… the free plan is not going to be sufficient for any severe work.

GPT Picture 2 refused the gore outright, then provided a sanitized “darkish, cinematic” battlefield solely after we agreed to drop specific blood. Nano Banana 2 didn’t negotiate in any respect—“Sorry, I can’t generate unsafe pictures.”

Reve’s blood is cinematic moderately than gratuitous, which makes the hole starker: one transient produced a completed scene on Reve, a watered-down compromise on OpenAI, and a flat no on Google.

By way of NSFW or prudeness, Reve can be fairly relaxed whereas not absolutely uncensored. Our previous check of producing a horny, busty trainer in a futuristic classroom was rendered with out issues. GPT generated a flat-chested girl after warning it couldn’t generate sexualized pictures. Gemini refused to even contemplate producing the immediate.
Conclusion
Reve 2.0 is the perfect picture mannequin for individuals who deal with technology as a course of, not a slot machine. If you happen to iterate continuously, depend upon correct textual content, need to edit a format as a substitute of re-rolling a immediate, and want high-resolution output for print, then the layout-first method is an actual benefit—and it refuses far lower than the competitors.
It’s additionally the most cost effective choice by a large margin. Reve runs round a fraction of a cent per API picture, in opposition to roughly 7 to 13 cents for Nano Banana 2 and the premium token pricing OpenAI costs for GPT Picture 2. At quantity, that hole is the entire finances.
If you happen to don’t have the {hardware} for an area picture generator like Ideogram v4 or Z-Picture, then Reve 2.0 is the most suitable choice by far when it comes to value to efficiency.
Nonetheless, it is not for everybody. If you happen to stay inside Google or OpenAI’s ecosystem, the comfort could outweigh the worth. Reve additionally quietly drops immediate parts so it’s a must to proofread its output and re-prompt. It’s additionally not probably the most correct mannequin when modifying or representing human references, or doing picture version with generative AI.
However for beneath $20 a month on the Professional plan, or a fraction of a cent per picture via the API, Reve 2.0 buys a stage of management and modifying that neither Google nor OpenAI presently promote. For an organization coaching on a tenth of the GPUs, that’s the guess paying off
Reve is obtainable for testing through the official URL or API plans.
Every day Debrief E-newsletter
Begin daily with the highest information tales proper now, plus unique options, a podcast, movies and extra.
Source link
