OpenAI unveils chatgpt images 2.0 with visual‑language thinking mode

SiliconFeed EditorialApril 22, 2026

openai image generation ai design visual language api

Sections and tags — in the Topics menu Search the feed

At a glance:

OpenAI releases ChatGPT Images 2.0, a visual‑language model that blends text and graphics for complex infographics
New "thinking" mode lets the model reason across multiple outputs and maintain continuity
Early preview shows strong aspect‑ratio control and high‑fidelity rendering, but brand‑logo fidelity remains inconsistent

What images 2.0 brings to the table

OpenAI announced the next‑generation image model, ChatGPT Images 2.0, positioning it as a visual language rather than a simple decoration tool. The company describes a good image as doing for vision what a good sentence does for text: selecting, arranging, and revealing information. To that end, the model now supports aspect ratios as wide as 3:1 and as tall as 1:3, and it can generate outputs up to 2 K resolution while preserving small‑text legibility, UI elements, and stylistic constraints.

Beyond raw resolution, Images 2.0 adds a “thinking” capability that integrates reasoning into the image generation pipeline. This enables the model to produce multiple images per prompt with continuity, and to pull in external data such as weather forecasts when asked for context‑aware infographics. For example, a vague request like “Generate an infographic about activities I should do with tomorrow’s weather in San Francisco in mind” triggers the model to retrieve weather data, select appropriate activities, and compose a visual layout that reflects the results.

Early testing highlights strengths and hiccups

In a pre‑release preview, the author fed the model a screenshot of the ZDNET homepage and a draft press release, then asked for a 16:9 infographic in ZDNET’s brand style. The resulting graphic displayed accurate layout, proper text rendering, and correct aspect‑ratio handling, but it repeatedly failed to reproduce the current ZDNET logo. The first attempt rendered a drooping “Z”; a later run pulled an outdated 2022 logo and even added a rudder‑shaped flourish to the “D”. Re‑prompting with explicit instructions to use the provided logo did not resolve the issue, suggesting brand‑fidelity remains a challenge in this early version.

The author also compared the experience to Google’s Nano Banana Pro, noting that the latter could not synthesize text‑and‑image workflows in the same way. While the visual quality of Images 2.0 was impressive, the logo inconsistency underscores that the model’s reasoning about brand assets still needs refinement before enterprise designers can rely on it for brand‑critical work.

Availability, pricing and API access

ChatGPT Images 2.0 is live for all ChatGPT and Codex users as of the article’s writing, but advanced “thinking” features are gated behind ChatGPT Plus, Pro, Business, and Enterprise plans. Users must select the “Thinking” option from the dropdown bar in the ChatGPT UI. Currently the model is only accessible on desktop; OpenAI promises a mobile rollout with touch‑based finger‑selection of images.

Developers can call the model via the API using the gpt-image-2 endpoint. Pricing varies by output quality, the degree of “thinkiness,” and resolution, though exact rates were not disclosed in the preview. OpenAI indicates that higher‑resolution and more reasoning‑intensive calls will cost more, aligning with its broader strategy of tiered AI services.

What this means for designers and developers

If a single AI can handle layout, data gathering, and visual synthesis, it could reshape how design teams prototype and iterate. Rather than manually assembling charts, pulling data, and tweaking branding, creators could describe a concept in natural language and receive a near‑final infographic in seconds. However, the logo‑fidelity issue highlighted in the preview suggests that for brand‑sensitive projects, human oversight will remain essential until the model’s visual‑memory and asset‑recognition improve.

Future updates are expected to tighten brand‑recognition, expand mobile capabilities, and possibly introduce fine‑grained style controls. As the model matures, we may see it integrated into broader creative suites, offering a true visual‑thought partner for marketers, product teams, and developers alike.

Editorial SiliconFeed is an automated feed: facts are checked against sources; copy is normalized and lightly edited for readers.

FAQ

What new aspect‑ratio options does Images 2.0 support?

Images 2.0 can generate outputs as wide as a 3:1 ratio and as tall as a 1:3 ratio, giving designers far more flexibility for banners, posters, and mobile graphics compared with earlier models that were limited to square or 16:9 formats.

Is the Images 2.0 model available on mobile devices?

At the time of the preview, Images 2.0 is only accessible on desktop. OpenAI has promised a mobile rollout that will include touch‑based finger‑selection of images, but no release date has been announced.

How does the "thinking" mode change the way prompts are handled?

Thinking mode integrates reasoning into the image generation pipeline, allowing the model to pull in external data (like weather forecasts), maintain continuity across multiple outputs, and produce complex infographics from vague natural‑language prompts.

More in the feed

Prepared by the editorial stack from public data and external sources.

Original article