AI NewsGPT-5.5 Instant

GPT-5.5 Instant: ChatGPT switches its default — what it means and how to use it

GPT-5.5 Instant is now the ChatGPT default. We walk through the capability delta from GPT-5, the latency profile, and what to change in your prompts and tool calls.

中澤圭志

@keishi_nakazawa

Sales Claw maintainer

May 13, 2026·13 min

GPT-5.5 Instant: ChatGPT switches its default — what it means and how to use it

This English article is a concise version of the original. For the full Japanese deep-dive, see the Japanese original.

Key Facts

Release

2026-05-13

Default model

GPT-5.5 Instant

"ChatGPT's responses feel shorter and more grounded lately. Am I imagining it? Or did the model change?" You're not imagining it. On 2026-05-05, OpenAI quietly swapped ChatGPT's default to GPT-5.5 Instant. This article walks through what changed and how it lands on workplace writing like sales-form copy generation, using primary sources.

There was no big launch event. Unlike a Claude Code 2.1.140-style patch or Gemini CLI's same-day dual release, ChatGPT users worldwide were silently migrated to the new model — an industry-scale event by stealth.

Primary sources: OpenAI's GPT-5.5 Instant launch post, System Card, API docs, ChatGPT Release Notes, and OpenAI API Changelog. Secondary outlets (TechCrunch / Bloomberg / Fortune) are referenced only in body text — not in JSON-LD citations.

1. What GPT-5.5 Instant is — the day ChatGPT's default "quietly" switched

The GPT-5.5 family was announced on 2026-04-23, with Thinking / Pro variants rolling out first for Pro / Business / Enterprise customers. About two weeks later, on 2026-05-05, the Instant variant landed all at once as ChatGPT's default model.

As the name suggests, Instant is not the "heavy reasoning" type. It's tuned as the chat default for latency, cost, and workplace-friendly tone. Deep reasoning and complex code generation continue to live with GPT-5.5 (frontier), GPT-5.5 Pro, and the Codex family.

Rollout timeline for GPT-5.5 Instant. Horizontal time axis with four milestones. 2026.04.23 — GPT-5.5 announced (Pro/Business/Enterprise, Thinking/Pro variants, purple marker). 2026.05.05 — GPT-5.5 INSTANT becomes default (rolling out across ChatGPT plans plus API alias chat-latest, green marker). 2026.05.05+ — personalization expansion (Plus/Pro Web first, then Mobile / Free / Go / Business / Enterprise, green marker). ~2026.08 — GPT-5.3 INSTANT retirement (paid users keep settings-based access for 3 months, orange marker). Footnote: rollout varies by plan and region; based on openai.com announcements. Source: openai.com/index/gpt-5-5-instant/, developers.openai.com/api/docs/changelog. — Figure: Figure: GPT-5.5 Instant rollout timeline (per primary sources)

Hero image for the GPT-5.5 Instant article. Dark navy background with the headlines 'GPT-5.5 INSTANT' and 'ChatGPT default model quietly refreshed.' Three stacked cards transition left to right: 'Previous default,' 'Switched silently, no user action,' 'New default.' An abstract editorial illustration evokes the magazine-cover feel of the silent worldwide migration on 2026-05-05. — Figure: Figure 1: Hero — on 2026-05-05, the ChatGPT default switched to GPT-5.5 Instant without a single notification to users

Split of duties between Instant and Thinking / Pro:

項目	GPT-5.5 Instant (default chat)	GPT-5.5 / GPT-5.5 Pro (reasoning)
Primary use	Daily chat / work docs / short writing	Deep reasoning / code / complex analysis
Latency	Fast (true to the Instant name)	Heavier (thinking time)
API price	$5 / $30 per MTok (via chat-latest)	Same $5 / $30 (frontier) / Pro is Responses API only
API ID	chat-latest (auto-tracking)	gpt-5.5 / gpt-5.5-pro
In ChatGPT	Default model across all plans	Plan-tiered option (Plus and above)

Tasks like Sales Claw's "generate one contact-form message" are typically Instant territory. They need tone, brevity, and reliability against misinformation — not deep reasoning.

2. Verifiable changes — numbers and tone

Before / After comparison of response style between GPT-5.3 Instant and GPT-5.5 Instant. Left (GPT-5.3 INSTANT): hand-drawn chat bubbles with 'Of course!', 'That's great!', heavy emoji use, casual vibe. Bottom tags: 'long', 'too many emoji', 'casual'. Right (GPT-5.5 INSTANT): organized business document style, focused bullet points, mint / teal accents. Bottom tags: 'short', 'restrained emoji', 'WORKPLACE-SAFE'. Editorial visualization of OpenAI's official phrasing 'informal, practical, workplace-safe without overexplaining.' — Figure: Figure 2: Same prompt, different response style (Before: 5.3 Instant / After: 5.5 Instant — editorial visualization)

Hallucination drop — 52.5% / 37.3%

OpenAI emphasizes the factuality improvement most. Two numbers from the launch post:

52.5% drop: internal-eval hallucination rate on high-stakes factuality-heavy prompts in medical / legal / financial domains (vs GPT-5.3 Instant)
37.3% drop: inaccurate claims in conversations users flagged as factually wrong (vs GPT-5.3 Instant)

The System Card describes the methodology. A web-search-enabled LLM grader judges "does the response contain factual errors," and compares the rate. The key caveat from OpenAI: these evaluations are deliberately hard and do not represent production prevalence directly.

Shorter responses — 30.2% / 29.2%

Quantification of the tone shift. Same task, responses are ~30% shorter by word count.

Not just "lower token billing." This affects editing cost when drafting work emails or proposals. If you were cutting blocks every time, you should now see fewer blocks to cut.

"Workplace-safe" tone and personalization expansion

OpenAI took an explicit position on tone: "informal, practical, and workplace-safe without overexplaining" is the new default tone.

The launch post is explicit that overexplaining, unnecessary follow-up questions, and excessive emoji are reduced "by default." This is not "tunable via system prompt" — it's built into the model's default behavior.

Personalization across past chats, uploaded files, and connected Gmail is also expanding — Plus / Pro Web first, then mobile / Free / Go / Business / Enterprise. For sales work, "the company context you previously taught ChatGPT" is now more likely to flow into subsequent responses without prompting.

3. Reading the official numbers — what 52.5% does and doesn't mean

Before / After comparison of GPT-5.3 Instant → GPT-5.5 Instant across four metrics. GPT-5.3 Instant baselined to 100, GPT-5.5 Instant values shown as bars. Hallucination rate on high-stakes: 100 → 47.5 (-52.5%). Inaccurate claims on flagged conversations: 100 → 62.7 (-37.3%). Word count: 100 → 69.8 (-30.2%). Line count: 100 → 70.8 (-29.2%). Source: OpenAI launch post — internal evaluation, not production prevalence. — Figure: Figure: GPT-5.3 → GPT-5.5 Instant across four metrics — internal evaluation improvement, not production prevalence

Reading OpenAI's own wording carefully, evaluation was selected on three conditions:

Factuality-heavy — questions where facts matter most
Previous failures — examples earlier models got wrong (a hard-cases pool)
High-stakes scenarios — medical, legal, financial — where misinformation has the biggest impact

In short, the improvements are measured on a worst-case-curated evaluation. The same ratios won't carry into casual chat, summarization, or code-completion error rates.

That doesn't make the improvement bad. "The worst cases got better" means the tail risk of the system shrunk. For sales work that has to write facts pointed at a counterparty, reducing the "catastrophic 1% hallucination" matters more than improving the average case.

4. API usage — the `chat-latest` alias and pricing

Per the OpenAI API docs (developers.openai.com/api/docs/models/chat-latest), chat-latestis "a snapshot of the Instant model currently in use in ChatGPT" — it auto-tracks OpenAI's updates.

On 2026-05-05, the OpenAI API Changelog updated chat-latest to target GPT-5.5 Instant. Custom API clients pointing at chat-latest got the new model the same day with zero code changes.

Pricing and the 272K threshold

Pricing as of 2026-05-13, from the official docs:

Item	Price	Condition
Input tokens (≤ 272K)	$5 / 1M	Standard
Output tokens (≤ 272K)	$30 / 1M	Standard
Input tokens (> 272K)	$10 / 1M	Remaining session 2x
Output tokens (> 272K)	$45 / 1M	Remaining session 1.5x

Generating one contact-form message rarely crosses 272K. Per-company total is typically 3,000–15,000 tokens — standard pricing applies. The 272K case only matters in "full submission history + bulk site reads" configurations.

Minimal request example

Existing OpenAI Chat Completions code works unchanged — just pass chat-latest as model.

# Chat Completions (curl)
curl https://api.openai.com/v1/chat/completions \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "chat-latest",
    "messages": [
      {"role": "system", "content": "You are a workplace-safe sales copywriter."},
      {"role": "user", "content": "Draft a 400-character contact-form message ..."}
    ]
  }'

Calling this from Sales Claw: feed approachObjective and approachGuardrails into the system message, put the Phase A analysis into the user message.

Using GPT-5.5 Instant at work? Frame 'who / what / how to write' with Sales Claw guardrails first.

無料・MIT ライセンス。インストールせずにライブデモも試せます。

無料でダウンロードライブデモを試す GitHub

5. How it lands on sales-form copy — three Sales Claw-perspective effects

Sales-form copy has constraints that casual chat, summaries, and code generation don't. Through those constraints, here's how GPT-5.5 Instant lands.

Effect 1: cuts "catastrophic fact errors"

The worst sales-form failure is getting the counterparty's business wrong. "Regarding your SaaS business..." — except they're a custom-dev shop — escalates from no-meeting to complaint.

The 52.5% high-stakes hallucination drop, with its benchmark caveat, still points in the right direction: suppress fact errors in factuality-heavy contexts. Don't overweight it — OpenAI's caveats are explicit. Always keep the guardrail "use only the facts from the analysis."

Effect 2: less "template-y long" output

With the old 5.3 Instant, even "keep it short" in the system prompt didn't always work — background paragraphs, boilerplate self-introductions, and closing flourishes crept in.

5.5 Instant is 30% shorter by word count. Same system prompt, same instruction strength, shorter result. Sales Claw's existing messageTemplates.approachGuardrails ("mention their business," "don't say everything," "1–2 sharp strengths") now actually works as written.

Effect 3: workplace-safe tone matches the requirement

This is the biggest win. OpenAI's "informal, practical, and workplace-safe without overexplaining" phrasing aligns directly with what corporate contact-form copy needs.

項目	Sales-form copy requirements	GPT-5.5 Instant defaults
Tone	Polite / no pushy sales	workplace-safe
Length	~300–500 characters	30.2% shorter (vs 5.3 Instant)
Emoji	None	Default suppresses gratuitous emoji
Follow-up questions	Unneeded (single send)	Default suppresses unnecessary follow-ups
Overexplaining	Avoid (respect the reader's time)	Reduced by default

Sales Claw's existing approachGuardrails ("start from what the reader wants to do," "avoid template feel," "win-win is a hint, not a slogan") was written in this direction. GPT-5.5 Instant brings the model default closer.

6. Sales Claw integration sketch — `approachObjective` + `chat-latest`

Architecture sketch for adding GPT-5.5 Instant (chat-latest) to Sales Claw. Left: dashboard preferences.aiProvider settings box. Center: Phase A (company analysis + copy generation). Right: Phase B (form fill + screenshot + awaiting_approval). Phase A calls OpenAI Chat Completions API with chat-latest, system prompt carrying approachObjective and approachGuardrails, user prompt carrying analysis. After generation, Sales Claw's existing complianceFooter / do-not-contact detection / opt-out insertion gates run. Dark navy base with teal and emerald accents. — Figure: Figure 3: Sales Claw × GPT-5.5 Instant integration sketch — add chat-latest to the Phase A copy generation route while keeping all automated check gates in place

Minimum additional route

Currently preferences.aiProvider is one of claude / codex / gemini. To add OpenAI Chat Completions, options include:

Add openai as a value with the same managed-PTY architecture, wrapping chat-latest under the hood
Two-stage in Phase A: Claude / Codex drafts first → chat-latest rewrites "workplace-safe and shorter"
Review-only route: existing model writes the copy; chat-latest reviews for "wrong company / pushy tone"

In any configuration, keep the existing safety gates running (complianceFooter auto-append, do-not-contact detection, CAPTCHA-on-detect stop, rate limit, action-log.json audit log). Changing models doesn't remove the responsibility to run the checks. See Configuration reference for details.

Passing approachGuardrails into the system prompt

Sales Claw already auto-injects messageTemplates.approachObjective and messageTemplates.approachGuardrails into CLI prompts. For direct OpenAI calls, pass them as the system message:

// Pseudocode: Sales Claw → chat-latest call (provider extension)
const settings = require('./settings-manager.cjs');
const objective = settings.getMessageTemplates().approachObjective;
const guardrails = settings.getMessageTemplates().approachGuardrails;
const analysis = phaseAResult.analysis;  // Site analysis

const response = await openai.chat.completions.create({
  model: 'chat-latest',
  messages: [
    {
      role: 'system',
      content: [
        'You are a sales copywriter producing contact-form messages.',
        'Strictly follow these rules:',
        `[Objective]\n${objective}`,
        `[Guardrails]\n${guardrails}`,
        'Output only the message body. No subject line, no signature, no emoji.',
      ].join('\n\n'),
    },
    {
      role: 'user',
      content: `Write a ~400-character message to this company:\n${JSON.stringify(analysis)}`,
    },
  ],
});

Using chat-latest means OpenAI updates flow through without client string changes. When the next Instant lands, you'll get it automatically (with regression validation, of course).

7. Misinformation risk that workplace-safe doesn't erase, and safe-by-design

Switching to GPT-5.5 Instant doesn't change Sales Claw's responsibility to "lower misfire and policy-violation risk via automated checks." Sales Claw is OSS designed to do exactly that through policy control, pre-send checks, do-not-contact detection, CAPTCHA-on-detect stop, rate limits, audit logging, and automatic stop conditions.

Legal / compliance

Japan's Specified Electronic Mail Act: auto-append sender info (4 fields) via preferences.complianceFooter: true. Even when GPT-5.5 Instant shortens the body, the footer is appended by a separate path.
Terms: pages with "no sales" notices auto-skip.
No CAPTCHA bypass: stop into awaiting_approval, log on detect.
Rate limit: suppress consecutive sends to the same domain.
Opt-out: auto-insert "reply to opt out."

Don't over-trust "52.5% reduction"

Three operational checkpoints:

Use only facts from analysis: don't let the model bring "imagination" into copy. Be explicit in the system prompt.
Copy proper nouns directly: models tend to auto-complete name variants. Pull company / service names directly from the analysis.
Pre-send check: verify proper nouns in the body exist in the analysis. One added line in the existing check gate.

Don't change the auto-stop conditions

Risks that remain

Workplace-safe tone doesn't structurally eliminate "wrong company" mistakes (proper-noun completion tendency)
Personalization expansion (past chats / Gmail) is great in UI but a confidentiality risk vector. For API business use, note that this is separate from the ChatGPT-UI personalization features.
chat-latest changes behavior when OpenAI updates the snapshot. Maintain a regression suite before production.

8. Pre-production checklist + bottom line

Before adopting GPT-5.5 Instant for Sales Claw / workplace writing

Read the OpenAI launch post + System Card; understood the evaluation conditions behind 52.5% / 37.3% / 30.2% / 29.2%
Added "is the company name correct" check to the pre-send check gate
Defined how to pass approachObjective / approachGuardrails into the system prompt
Have a 30-case regression suite for chat-latest to catch behavior drift
API keys come from a secrets manager — not committed to repos or logs
Sending still goes through awaiting_approval; only items that pass automated checks are submitted
preferences.complianceFooter is on (sender-info 4 requirements auto-appended)
CAPTCHA-on-detect: do not send; log to audit
Understood that ChatGPT-UI personalization (past chats / Gmail) is distinct from API routes
Code review prevents confidential data from entering API requests

Bottom line — how to actually use the "silent default"

GPT-5.5 Instant is not a flashy feature release. It's the "ChatGPT users around the world migrated without noticing"kind of update. Precisely because of that, it's worth re-reading tone, brevity, and factuality on the user side.

For sales writing, all three changes (less hallucination, shorter, workplace-safe) blow in the right direction. But: "don't over-trust the numbers," "don't remove the check gates," and "copy proper nouns straight from the analysis" — these three keep policy-controlled autonomous operation honest.

Next action: add a "chat-latest review" step to your existing analysis → copy generation pipeline. You don't have to swap the whole pipeline. A thin final-check layer is enough to feel the tone and length effects. Sales Claw integration starts at the Quick start; current message-generation design is in Message generation. Free download: Download page.

Even with a workplace-safe model, the line of operational responsibility doesn't move.

無料・MIT ライセンス。インストールせずにライブデモも試せます。

無料でダウンロードライブデモを試す GitHub

この記事の著者

中澤圭志

Sales Claw maintainer

Sales Claw の設計・開発を担当。BtoB 営業自動化と AI 活用の実践者として、現場目線で情報発信中。

note

すべての記事

Share this article

GPT-5.5 Instant: ChatGPT switches its default — what it means and how to use it

1. What GPT-5.5 Instant is — the day ChatGPT's default "quietly" switched