I Received An Early Have A Look At ChatGPT Pictures 2.0, And It Is Spectacular - With One Exception

Elyse Betters Picaro / ZDNET

Observe ZDNET: Add us as a most well-liked supply on Google.

ZDNET’s key takeaways

OpenAI reframes photographs as a visible language.
Considering mode builds context-aware infographics.
Model constancy remains to be inconsistent in early testing.

Immediately, OpenAI introduced ChatGPT Pictures 2.0, its next-generation picture mannequin, which the corporate says is targeted on precision, usability, and complicated visible duties.

Probably the most notable new functionality is the flexibility to mix textual content and pictures to construct complicated, lovely pages. OpenAI is reframing the entire thought of picture technology from a course of that creates decorations (their phrase) to a language (additionally their time period).

Additionally: One of the best AI picture turbines of 2026: There’s just one clear winner now

OpenAI describes it as, “A good image does what a good sentence does — it selects, arranges, and reveals. It can explain a mechanism, stage a mood, test an idea, or make an argument.”

Considering capabilities allow complicated workflows

Along with its vastly improved capacity to combine textual content and graphics, the brand new mannequin makes use of enhanced considering capabilities. It will possibly generate a number of photographs per immediate with continuity throughout outputs. This strategy is feasible as a result of the mannequin truly integrates reasoning into the picture output.

Created by ChatGPT/Screenshot by David Gewirtz/ZDNET

This shift is huge. As an alternative of simply producing a picture that just about matches the immediate particulars, Pictures 2.0 can take a a lot vaguer immediate, like “Generate an infographic about activities I should do with tomorrow’s weather in San Francisco in mind.”

Additionally: Easy methods to change from ChatGPT to Gemini

From this immediate, the AI will collect climate and exercise information about San Francisco, decide actions acceptable to the climate, after which construct a picture or set of photographs that match the outcomes.

In keeping with OpenAI, “In this model, Images 2.0 acts more like a visual thought partner, helping carry a project from rough concept to finished asset with significantly less work on your part.”

Precision and design management enhance usability

Many people have lengthy struggled to persuade ChatGPT to generate photographs in a particular desired side ratio. Typically, the AI stubbornly produces what it needs. However now, with Pictures 2.0, the mannequin has assist for “aspect ratios as wide as 3:1 and as tall as 1:3.”

The mannequin additionally helps higher-fidelity outputs that (largely) produce correct object placement, detailed textual content rendering, and complicated compositions. We’ll see if we are able to take away the phrase “mostly” from that sentence after the product is formally launched.

Additionally: I attempted Private Intelligence, and it was correct (however unsettling)

The AI additionally helps small textual content, UI components, and stylistic constraints at as much as 2K decision. Cool.

Testing the preview

I used to be given entry to a day-before-release preview, and the mannequin is spectacular, largely. I fed it a screenshot of the ZDNET house web page and a draft of the Pictures 2.0 press launch.

Then I instructed, “Based on the contents of the press release, generate a 16:9 infographic about the new image update and generate it using the ZDNET brand style as shown in the ZDNET home page document.”

Additionally: I attempted Google Pictures’ new AI Improve software: The way it crops, relights, and fixes your pictures – typically

The mannequin did an important job on the infographic, however strive as it would, it couldn’t reproduce the ZDNET brand. On its first strive, it rendered the Z in ZDNET with a slight droop.

I attempted quite a lot of requests on the order of, “Fix the ZDNET Logo. The Z droops in your version but is not droopy in the actual logo.” However Pictures 2.0 by no means managed to repair it.

So I began a brand new session. This time, I included the instruction, “Use special care to reproduce the ZDNET logo accurately.”

Additionally: I examined ChatGPT Plus vs. Gemini Professional to see which is best – and if it is value switching

This is the place issues received very odd. For its first run, the mannequin by some means dug up a duplicate of ZDNET’s brand from earlier than our 2022 redesign. This brand is nowhere to be discovered on our present house web page. Weirdly, it rendered that previous brand utilizing the present coloration scheme. The mannequin then pushed the brand and the infographic info off the left fringe of the picture. It additionally selected a lightweight blue for “Images 2.0” that is not a ZDNET model coloration.

I attempted mightily to persuade it to make use of the present brand. I managed to get it to push the picture to the suitable, so nothing was lower off. However including the immediate, “Use the ZDNET logo that is on the provided page. Do not search for an alternative logo,” did nothing to repair the issue.

I took yet one more shot on the problem earlier than deciding to return to ending up this text. As soon as once more, I began a brand new session so the AI did not have muscle reminiscence from its earlier miscalculations.

Additionally: This highly effective Gemini setting made my AI outcomes far more private and correct

The mannequin tousled the brand once more. This time, the AI determined so as to add a rudder form to the stem of the stretched-out capital D.

To be honest, I am utilizing a pre-release model of Pictures 2.0. I will be again with a way more complete take a look at run of the mannequin after the official product launch.

I additionally tried the same take a look at utilizing a special doc with Google’s Nano Banana Professional, however as a result of it did not deal with the synthesis the way in which that this new model of OpenAI’s product does, it wasn’t actually capable of repeat the outcomes I received right here. We’ll know extra as we do extra superior exams

Pricing and availability

The brand new mannequin is offered at this time to all ChatGPT and Codex customers. Superior outputs and the considering functionality can be found to ChatGPT Plus, Professional, Enterprise, and Enterprise customers. You should definitely choose “Thinking” from the ChatGPT dropdown bar on the high of the display.

On the time of writing, earlier than launch, the brand new Pictures 2.0 mannequin is simply obtainable on the desktop. However OpenAI guarantees that these capabilities can be within the cellular model as effectively, together with the flexibility to finger-select photographs utilizing your cellular touchscreen.

The pictures are additionally obtainable through API utilizing the gpt-image-2 mannequin. API pricing varies relying on the standard, thinkiness (my phrase), and desired picture decision.

If an AI can deal with structure and content material together, will that change the way you strategy design tasks? Tell us within the feedback under.

You possibly can comply with my day-to-day challenge updates on social media. You should definitely subscribe to my weekly replace publication, and comply with me on Twitter/X at @DavidGewirtz, on Fb at Fb.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.

Top Posts

15 of the best Prime Day laptop deals (I’d actually buy myself)

Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

“Whales Quietly Load Up on $950M in ETH While Bullish Bottom Narratives Clash With a Missing Piece”

I received an early have a look at ChatGPT Pictures 2.0, and it is spectacular – with one exception

15 of the best Prime Day laptop deals (I’d actually buy myself)

AI Red Teaming Decoded: The Essential Guide to Outsmarting Cyber Threats

Mastering Claude Code: The Definitive Alignment Playbook

Sakana Marlin Debuts with AB-MCTS, Empowering Enterprises to Auto-Generate Comprehensive 100-Page Reports and Slide Decks

sktime in Python: A Practical Guide to Building Time-Series Machine Learning Models

Windows Subsystem for Linux 3: The Game-Changer That Makes Developers Loyal to Microsoft

15 of the best Prime Day laptop deals (I’d actually buy myself)

Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

“Whales Quietly Load Up on $950M in ETH While Bullish Bottom Narratives Clash With a Missing Piece”

Chinese Hackers Exploit Google Workspace to Pilfer Sensitive Research and Defense Emails

Microsoft Partners with AWS for GitHub Resources as Azure Faces Legal Battle

Industrial AI Evolves: Prioritizing Knowledge Preservation Over Predictive Maintenance

AI Red Teaming Decoded: The Essential Guide to Outsmarting Cyber Threats

The Protocol That Transformed Our Agent Architecture

Trending

15 of the best Prime Day laptop deals (I’d actually buy myself)

Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

Latest Posts

Not More Data, but Better World Models – Unite.AI

OpenAI Is Hiring Head of Preparedness, Amid AI Cyberattack Fears

Subscribe to Updates

Top Posts

I received an early have a look at ChatGPT Pictures 2.0, and it is spectacular – with one exception

ZDNET’s key takeaways

Considering capabilities allow complicated workflows

Precision and design management enhance usability

Testing the preview

Pricing and availability

Related Posts