AI NewsOpenAI Codex

OpenAI Codex Became "the Colleague Who Runs Through the Night" — Goal Mode GA, Locked Computer Use, Appshots, and Plugin Marketplace Explained for Non-Experts

In one week, OpenAI Codex turned into "the colleague who runs through the night." Goal Mode GA, Locked Computer Use, Appshots, the enterprise Plugin Marketplace, and CLI 0.132–0.134 — covered for general readers, with three real risks (cost runaway, over-permissioning, legal alignment) and the Sales Claw safety perspective.

中澤 圭志

中澤 圭志

@keishi_nakazawa

Sales Claw maintainer

·14 min
OpenAI Codex Became "the Colleague Who Runs Through the Night" — Goal Mode GA, Locked Computer Use, Appshots, and Plugin Marketplace Explained for Non-Experts
This English article is a concise version of the original. For the full Japanese deep-dive, see the Japanese original.

Key Facts

Release window

2026-05-20 to 2026-05-26 (Codex Thursday)

Codex App 26.519

Goal Mode GA / Locked Computer Use / Appshots / Plugin Marketplace

Codex CLI

0.132.0 → 0.133.0 → 0.134.0 (history search / unified --profile)

Sales Claw lens

Run "AI through the night" safely with audit log and auto-stop conditions

In one sentence

In a single week — from May 20 to May 26, 2026 — OpenAI Codex (OpenAI's coding-AI toolkit) shipped a chain of major updates. The tool that used to be a "give one instruction, wait for the answer" assistant has become a "hand it a goal, and it keeps thinking and running through the night like an overnight colleague."Concretely, the long-horizon Goal Mode graduated from experimental to general availability, Remote Locked Computer Use lets Codex drive desktop apps even after macOS is locked, Appshots sends the front-most window into a Codex thread with a double Command-key press, a Plugin Marketplace landed for ChatGPT Business, and Codex CLI versions 0.132 through 0.134 dropped in rapid succession. This article walks readers who are not deep in AI — solo builders, SMB operators, PMs — through "what happened in seven days, and what it means for ordinary work" using the metaphor of a night-shift colleague and a hospital ward.

Bottom line: With this round of updates, Codex has moved closer to a "hand it a one-line brief at 11 PM, and the deliverable is sitting on your desk at 8 AM" colleague. The combination of Goal Mode going GA and Remote Locked Computer Use matters not just for coding but for anything you want to run while no human is watching the screen — outbound sales, research, customer support, overnight refactors. The flip side is new risk: "the longer it runs, the larger the bill" and "hand over too many privileges and accidents happen."You don't have to touch everything this week, but if you use AI in production, you should at least review the new Codex surface area this week — otherwise next month's invoice or your first incident will surprise you.

"How is Codex different from ChatGPT?" "What does ‘AI that runs through the night’ actually mean — is it safe?" "Does this matter to a 30-person company?" — this article answers those three questions using OpenAI's official GitHub Releases and the official Codex Changelog as primary sources, from the perspective of a Sales Claw maintainer.

It's rare for Codex to ship this much in seven days. OpenAI set "Codex Thursday" as a release day on May 21, and around it shipped three CLI versions, one major desktop-app release, and a new enterprise distribution mechanism. This article focuses on the four user-facing pieces (Goal Mode GA / Locked Computer Use / Appshots / Plugin Marketplace) and the CLI 0.132–0.134 trio, with the general reader in mind.

Companion reads: the parallel Claude updates are covered in Claude Code v2.1.149 and Gemini CLI v0.43.0 simultaneous update; the broader market-share shift is in Anthropic overtakes OpenAI for the first time; the enterprise governance angle is in Claude Compliance API and 28 security partners. The previous Codex deep-dive is Codex on ChatGPT mobile.

Primary sources used: OpenAI Codex Official Changelog / openai/codex GitHub Releases / Codex Feature Maturity / Codex Docs Hub. The Sales Claw download page is also available.

1. Codex became an overnight colleague — in one sentence

First, the premise: Codex is OpenAI's coding-AI toolkit (OpenAI is the company that makes ChatGPT). "Coding AI" means AI that writes the program code humans normally type by hand. Codex itself has many entry points — inside the ChatGPT app, a dedicated desktop app, your terminal (the black window), a mobile app, and IDE extensions for VS Code and JetBrains — all pointing to the same underlying AI.

Three terms worth translating into plain English before we go further.

TermIn plain EnglishFamiliar analogy
Goal ModeA mode where you hand over a brief and Codex keeps thinking and running until it's done"Have this project ready by end of month" — the night-shift colleague
Computer UseA mechanism that lets AI drive your computer screen on your behalfThe AI borrows your mouse and clicks for you
GA (general availability)The point at which a feature graduates from "experimental" to "officially safe for production use"The moment a new drug clears clinical trials and becomes covered by insurance

These three moved at once this week. [Official] The OpenAI Codex Changelog (dated 2026-05-21) explicitly states that Goal Mode "is no longer an experimental feature and is available in the Codex app, IDE extension, and CLI." The same Changelog introduced Remote Locked Computer Use — Codex driving desktop apps after macOS is locked — with three safeguards: short-lived authorization, covered displays, and automatic relock on local input.

[Author's view] From a Sales Claw maintainer's vantage, the fact that all of this shipped in a single week is "the decisive sign that the industry is locking in ‘goal-directed autonomy’ as the default pattern, instead of question-and-answer."The ChatGPT pattern is a back-and-forth: ask, receive, ask again. Goal Mode is a different object: hand it a goal, it breaks it into tasks, executes, recovers from failures, and doesn't stop until done. Anyone building outbound sales agents, research agents, or support agents — including Sales Claw — will see the same flow arrive within six months.

2. Five updates on one page — anchored at Codex Thursday, 2026-05-21

Overview whiteboard illustration of the five Codex updates shipped May 20–26: Goal Mode GA, Locked Computer Use, Appshots, Plugin Marketplace, and CLI 0.132–0.134, with arrows showing how each plays a different role
Figure: Role overview of the five features. Goal Mode is the autonomy core; Locked Computer Use and Appshots are the "eyes and hands across the screen"; Plugin Marketplace is the enterprise distribution box; CLI 0.132–0.134 is the infrastructure layer

Timeline first. Cross-checked against the openai/codex GitHub Releases repository and the official Codex Changelog:

Date / Time (UTC)ProductVersionKey changes
5/20 01:52Codex CLIv0.132.0First-class auth in Python SDK, plain-string turn API, exec resume with output schema
5/21 (Codex Thursday)Codex App26.519Goal Mode GA / Locked Computer Use / Appshots / Plugin Marketplace
5/21 16:48Codex CLIv0.133.0Goals enabled by default, remote-control runs foreground, permission profile inheritance
5/26 19:13Codex CLIv0.134.0Conversation-history search, unified --profile selector, MCP setup improvements, Windows TUI render fix

Among these, the highlighted desktop app 26.519 and CLI 0.133 / 0.134form the core. CLI 0.132 (5/20) is a foundation update aimed at developers who invoke Codex through the Python SDK; less direct for end users, but part of the same "easier to embed Codex elsewhere" arc.

[Author's view] The concentration on 5/21 makes sense if you assume "OpenAI fixed May 21 as the Goal Mode GA day, and aligned CLI, IDE extensions, desktop, and mobile around it."CLI 0.133 (5/21 16:48 UTC) literally says "Goals are now enabled by default" in its release notes — the cross-surface synchronization was clearly intentional.

3. Goal Mode went from "experimental" to a real feature — what changed

Using Goal Mode is surprisingly simple. Type /goal in the Codex chat and you get a single-line field to write the objective. For example: "Upgrade the Sales Claw repository from Python 3.12 to 3.13 and get every test to pass." Codex then plans on its own: (1) list the current dependencies, (2) check each library's 3.13 compatibility, (3) find replacements for incompatible ones, (4) edit the code, (5) run the tests, (6) read errors and fix them, (7) loop until everything passes.

Three things changed materially compared to the experimental version: (A) the run survives session breaks, token-budget resets, and network drops — state is restored and execution resumes; (B) progress is always visible; (C) on failure, Codex itself plans the retry strategy. [Official]The Codex CLI v0.133.0 release notes explicitly state: "Goals are now enabled by default, backed by dedicated storage, and track progress across active turns." A dedicated storage layer now backs progress.

[Author's view] From a Sales Claw maintainer's gut sense, this is the moment AI went from a "sprinter" to a "marathon runner." Earlier models maxed out at maybe 10–30 minute tasks in a single long prompt. Goal Mode defaults to "don't stop until the goal is met"— so "hand it a brief at 11 PM, find the deliverable at 8 AM" becomes a realistic operating model.

Goal Mode use cases extend beyond pure coding. [Official] OpenAI lists package migrations, test coverage targets, flaky test reproduction, overnight refactors, and performance profiling-and-patching — all tasks where "the next step depends on what the AI discovers in the previous step."For a sales agent, "scan a list of 1,000 companies, filter the ones with reachable contact forms, read each company profile, and draft a personalized message" is the same shape of problem.

4. Locked Computer Use and Appshots — eyes and hands that keep working after the screen locks

Whiteboard illustration of the Locked Computer Use flow. Left: locked Mac screen showing a cover. Center: Codex driving desktop apps in the background. Right: local keyboard input triggers automatic relock. Three safeguards (short-lived authorization, covered display, relock on local input) are highlighted in blue boxes
Figure: Three safeguards of Locked Computer Use: short-lived authorization, covered display, automatic relock on local input. Together they make "trust AI with your screen while you're away" defensible

To understand why Locked Computer Use is a step change, look at the limits of the previous Computer Use (AI driving your computer screen). The earlier setup assumed "a human is sitting in front of the screen watching the AI." The reasons were practical: (1) someone has to stop the AI if it goes off the rails, (2) a locked screen blocks the AI too, (3) starting a remote session was unreliable.

[Official]Codex App 26.519's Locked Computer Use solves these with three explicit safeguards:

  • Short-lived authorization: each session is bound to a tight time window where Codex is allowed to drive the machine; when the window expires, control is revoked automatically
  • Covered displays: the physical display shows a "Codex is working" cover so shoulder-surfing is blocked
  • Relock on local input: the instant a keystroke or mouse movement hits the physical machine (meaning "the human came back"), Codex's control stops and the screen relocks

These three together make "Codex works quietly while you're in a meeting" and "you launch Codex from your phone outside the office, and your Mac at home prepares the report" into realistic operating modes. [Unverified] Windows-side support for Locked Computer Use is not announced publicly. Because the macOS lock-screen behavior is wired in at the OS level, the equivalent on Windows would need a separately designed integration.

Appshots is a separate convenience. On a MacBook, pressing the Command (⌘) key twice attaches a screenshot of the current front-most app to the Codex thread. [Official]The OpenAI Codex Changelog (2026-05-21) describes it as "Appshots lands on macOS, letting you inject the frontmost window into any Codex thread with a double Command key press."

Everyday uses include "screenshot the error and ask Codex to fix it," "screenshot the Figma design and ask Codex to translate it to code," "screenshot a Slack thread and ask Codex to summarize." You could already drag a screenshot into the chat, but Cmd × 2 cuts the friction much more than the time difference suggests.

項目Previous Computer Use (human-at-keyboard assumption)Locked Computer Use (5/21 GA)
Screen state requiredUnlocked onlyOperates even when locked
SafeguardHuman supervises liveShort-lived auth / covered display / auto-relock
Remote startEffectively unsupportedLaunch from Codex Mobile
Usable whenOnly at deskIn meetings, away, overnight
Emergency stopHuman intervenes manuallyPhysical input stops it instantly

Sales Claw shares the same overnight-autonomy philosophy as Codex, with safety scaffolding tailored for outbound sales.

無料・MIT ライセンス。インストールせずにライブデモも試せます。

5. Plugin Marketplace and Codex CLI 0.132–0.134 in detail

Plugin Marketplace is a feature for enterprise IT. A "plugin" is an add-on that teaches Codex extra capabilities — "talk to Salesforce," "search the internal wiki," "call our company API." [Official]The OpenAI Codex Changelog (2026-05-21) describes Plugin Marketplace as offering "reusable plugin bundles that include skills, app integrations, MCP servers, lifecycle hooks." Initially open to ChatGPT Business; enterprise rollout is in preparation.

Here's where MCP (Model Context Protocol) shows up. MCP is a "common faucet between AI and external tools" — a standard proposed by Anthropic that the rest of the industry has adopted. Codex supports MCP, and through Plugin Marketplace, a company can distribute "an MCP server that reads our Notion," or "an MCP server that runs SELECTs on our Postgres," to all employees at once. The deep dive on MCP lives at MCP Model Context Protocol complete guide.

CLI 0.132 through 0.134, summarized in one table:

VersionDate (UTC)Key additions (in plain language)
v0.132.05/20 01:52Official auth support for the Python SDK / write simple text workflows with plain strings / resume with a structured output schema (JSON shape)
v0.133.05/21 16:48Goal Mode enabled by default / remote-control runs as a foreground command / permission-profile inheritance and requirements.toml management / plugin discovery improvements
v0.134.05/26 19:13Conversation-history search (case-insensitive) / unified --profile flag / per-server MCP environment / parallel execution of read-only MCP tools / Windows TUI render fix

Conversation-history search (0.134)is the change everyday users will feel most. If you regularly think "Codex showed me a clever trick last week — which thread was that?", that single feature is worth upgrading for.

Permission profiles (0.133) is the bigger change for production teams. You can now define --profile dev "allow everything", --profile prod "no file writes", --profile review "read-only", and swap the whole permission set per situation. [Author's view]From a sales-agent designer's seat, this matches the way Sales Claw splits permissions across "read-only for list building," "sandbox-only for test sends," and "production send" — the industry is making fine-grained permissions a standard component.

6. Getting started — three scenarios: individual / SMB / enterprise

Whiteboard flowchart of three Codex onboarding scenarios. ‘Individual: start in 5 minutes today,’ ‘SMB: 1–2 weeks to deploy,’ ‘Enterprise: 1–3 months to roll out,’ with the corresponding first actions (ChatGPT Plus subscription / ChatGPT Business contract / permission-profile design)
Figure: Three onboarding paths. Only the individual path is "today". SMB and enterprise need preparation phases

Individual (5 minutes today)

The easiest entry: if you already have ChatGPT Plus ($20/mo), download the Codex App from https://chatgpt.com/features/desktop/, sign in with your ChatGPT account, and you get Goal Mode + Appshots immediately. Locked Computer Use is macOS-only.

For the first run, try Goal Mode on a small repository. Something like "clean up this README, add a table of contents" — a 5- to 10-minute goal. Don't open with an overnight target; you will be surprised by the bill.

SMB (1–2 weeks to deploy)

For a 10–100 person company, contract ChatGPT Business($25/user/month area) and distribute Codex App to employees. Plugin Marketplace becomes the lever — a single bundle for "internal wiki search" or "deal management" goes out to everyone at once instead of being set up on each laptop.

Three priorities for the rollout: (1) per-user usage caps (so you don't repeat the Uber four-months-of-budget story), (2) permission profiles split by role — sales, engineering, executive — (3) audit logging enabled from day one.

Enterprise (1–3 months to deploy)

For 500+ employee organizations, bring IT, legal, and security to the table. (1) Pilot in one department (50 users), (2) standardize permission profiles and audit-log routing, (3) integrate with existing DLP / SIEM / SASE, (4) roll out company-wide. [Author's view] As Anthropic showed with Claude Compliance API with 28 security partners, the industry is putting AI under the same governance umbrella as other SaaS. A similar Codex-side announcement is likely in late 2026.

7. Risks — the price of "runs overnight"

Cost runaway — the Uber lesson

[Official] In April 2026, Uber CTO Praveen Neppalli Naga acknowledged in The Information that "we burned through our 2026 AI budget in four months." Uber rolled out Claude Code to ~5,000 engineers in December 2025; by April 2026 the per-engineer monthly API spend was $500–$2,000 (¥75K–¥300K). Heavy use of an autonomous mode like Goal Mode can produce the same outcome on Codex.

The countermeasure: bake the limits in from day one. Concretely — (1) per-user monthly cost cap, (2) max elapsed time per Goal Mode session, (3) max turns per session, (4) per-project budget ceiling — turn all of these on before the first production run.

Over-permissioning — when autonomy escapes

Enabling Locked Computer Use means "AI drives desktop apps while the screen is locked." Convenient, but granting too many privileges raises the chance of unexpected actions.The intended behavior "Codex posts a status update to internal Slack" can drift into "Codex sends a message to a customer on Slack" with a single configuration mistake.

The countermeasure: start permission profiles at "read-only" or "sandbox-only". Add write and external-send privileges in stages. For the first 1–2 weeks, a human reviews every artifact before anything goes outside the company.

If you put Codex on outbound contact-form sending or customer-support replies, in Japan you must verify alignment with the following:

  • Act on Specified Commercial Transactions (特定商取引法): no exaggerated claims, opt-out path required
  • Anti-Spam Act (特定電子メール法): four sender-information requirements (name, address, opt-out path, contact) must appear in the message body
  • Act on the Protection of Personal Information (個人情報保護法): explicit purpose of use, third-party transfer restrictions, response to disclosure requests

[Author's view]Codex is a tool; legal alignment is the operator's responsibility. Sales Claw embeds "policy-controlled autonomy" — pre-send automatic checks, sales-NG detection, CAPTCHA-detected stop, send-frequency throttling, audit-log storage, and automatic specified-commercial-law footer — as bundled OSS. If you build a Goal-Mode workflow yourself on raw Codex, you implement these layers yourself.

Graph of Codex Goal Mode autonomous-execution cost structure. X-axis: elapsed run time (1h, 4h, 8h, 24h). Y-axis: API cost ($). Two lines compare ‘run with no cap’ against ‘run with turn-cap and time-cap’
Figure: Run Goal Mode without caps and cost climbs linearly. With turn and elapsed-time caps the curve flattens (estimate)

8. Sales Claw perspective — how to embed an overnight colleague into real work

From here, the implementation lens — what it actually takes to put Codex Goal Mode into production. Useful even for general readers as a "checklist before adopting Codex at work."

If you embed Codex Goal Mode into a production workflow, the three patterns Sales Claw landed on after 90 days are worth building in from day one.

1. Automatic stop conditions (count × time × turn cap, all AND'd)

Always combine multiple termination conditions with AND on an autonomous loop. Sales Claw defaults every session to "stop at 100 items OR 5 hours OR 200 turns, whichever first." [personal_metric] I (Nakazawa) rewrote Sales Claw's design three times in three months. Round one had no stop conditions and ran wild; round two had a count cap and overshot the wall-clock; round three was the first one with AND conditions that ran stably.

2. Audit log storage (local JSONL)

Every autonomous action is appended to an audit log (action-log.json)in time order. Sales Claw records "when / to whom / with which copy / via which path / whether CAPTCHA stopped it" in JSONL so operators can retrace. Codex permission profiles alone don't give you the "what happened when" timeline; for production use you need a separate logging mechanism.

3. Pre-send automatic checks (no dependence on per-send human approval)

The dilemma "human review can't scale, but we still can't afford a wrong send" is solved by pre-send automatic checks. Sales Claw runs pre-send automatic checks, sales-NG detection, CAPTCHA-detected stop, send-frequency throttling, and automatic specified-commercial-law footeras a gate before the first byte goes out. The design scales business throughput without depending on "a human looks at every send" — risk is lowered through the combination of automatic checks. This is the extra layer Codex Goal Mode alone doesn't provide.

Timeline of Codex updates from 2026-05-20 to 2026-05-26. CLI v0.132.0 on 5/20, App 26.519 plus CLI v0.133.0 on 5/21 (Codex Thursday), CLI v0.134.0 on 5/26, with each version's main features annotated in callouts
Figure: Timeline of the Codex release chain (UTC). 5/21 Codex Thursday is the spine
Bar chart comparing the autonomy capabilities of Codex, Claude Code, and Sales Claw across four axes: Goal-Mode-equivalent, Computer-Use-equivalent, audit-log mechanism, outbound-sales-specific machinery
Figure: Four-axis comparison of autonomy capabilities. Codex tilts toward coding; Sales Claw is specialized for outbound sales

9. Pre-production checklist

Before putting Codex Goal Mode into production

  • Subscribed to ChatGPT Plus / Business / Enterprise
  • Installed at least one of: Codex App / CLI / IDE extension
  • First Goal Mode trial limited to a 5–10 minute small task
  • API usage caps configured (per user, per session)
  • Goal Mode max turns / max elapsed time / max cost set
  • Permission profiles split per role (read / test / production)
  • For Locked Computer Use: verified the covered-display behavior
  • Audit-log destination chosen (local JSONL, or routed to existing SIEM)
  • For outbound sales / support: verified alignment with applicable laws
  • Incident stop procedure documented (who stops it, how)
  • Operating flow defined for human handoff of overnight artifacts
  • Contract review clause included within 12 months (industry moves fast)

10. Conclusion — how to use "the overnight colleague" in everyday judgment

The week between May 20 and May 26, 2026 was the moment Codex moved from "question-and-answer" to "goal-directed autonomy."Goal Mode went GA, Locked Computer Use unlocked the screen-locked hours, Appshots removed the friction of "let me show you," Plugin Marketplace opened the enterprise distribution path, and CLI 0.132–0.134 strengthened the foundation.

At the same time, "AI that runs overnight" brings new costs and risks. Uber burning through its 2026 AI budget in four months is the cautionary tale of the cycle. Without cost ceilings, permission profiles, audit logs, and automatic stop conditions wired in from day one, the price of convenience climbs faster than expected.

Next action: spend 5 minutes — if you have ChatGPT Plus, open Codex App and try Goal Mode on a tiny task. SMB operators: don't roll out to staff before talking to IT about permission profiles and cost caps. If you want AI for outbound sales or support, start with a sales-specialized OSS like Sales Claw — the quick-start guide is the entry point.

Related reading: Claude Code v2.1.149 and Gemini CLI v0.43.0 simultaneous update / Anthropic overtakes OpenAI for the first time / Codex on ChatGPT mobile.

This article is an English-language version of the Japanese-language original. The Japanese version governs in case of discrepancies.

Open Codex App for five minutes. That's the shortest route to putting AI into your real workflow.

無料・MIT ライセンス。インストールせずにライブデモも試せます。

よくある質問

In one sentence, what does it mean that Codex Goal Mode "went GA"?
[Official] The OpenAI Codex Changelog dated 2026-05-21 explicitly states Goal Mode (the goal-directed autonomous execution mode) "is no longer an experimental feature and is available in the Codex app, IDE extension, and CLI." GA (General Availability) is the moment a feature graduates from experimental / beta to "officially safe for production use" — comparable to the moment a new drug clears clinical trials and becomes insurance-covered. To use it, run /goal in Codex, write a one-line objective like "Upgrade the Sales Claw repository from Python 3.12 to 3.13 and get all tests passing," and Codex plans the tasks itself, runs autonomously for hours or days, and doesn't stop until the goal is met. Three things changed materially from the experimental version: (A) the run survives session breaks, token budget resets, and network drops with restored state; (B) progress is always visible; (C) on failure, Codex itself plans the retry strategy.
What's impressive about Locked Computer Use? How is it different from normal Computer Use?
[Official] Locked Computer Use, shipped in Codex App 26.519 (2026-05-21), lets Codex drive desktop apps even when macOS is locked. The earlier Computer Use assumed "a human watches the screen while AI works"; Locked Computer Use replaces that with three safeguards. (1) Short-lived authorization: each session is bound to a tight time window where Codex is allowed to drive the machine; expiry revokes control automatically. (2) Covered displays: the physical display shows a "Codex is working" cover, blocking shoulder-surfing. (3) Relock on local input: the instant a keystroke or mouse movement hits the physical machine, Codex stops and the screen relocks. Together these make "Codex works while you're in a meeting" and "launch Codex from your phone outside the office, your Mac at home prepares the report" realistic. [Unverified] Windows-side support is not announced publicly. Because the macOS lock-screen behavior is wired in at the OS level, the Windows equivalent would need a separately designed integration.
Is Appshots MacBook-only? Can Windows users use it?
[Official] The OpenAI Codex Changelog (2026-05-21) describes Appshots as "lands on macOS, letting you inject the frontmost window into any Codex thread with a double Command key press." It is macOS-only at launch. Windows / Linux roadmaps are not publicly announced as of 2026-05-27. Windows users continue to use Win + Shift + S and drag the screenshot into the chat. The appeal of Appshots is removing friction from the "let me show you my screen" flow — error screens, Figma designs, Slack threads — where you want to discuss something visible. [Author's view] Windows support is likely within six months, but selecting the right keyboard shortcut (double Win key? Ctrl + Shift + S?) will take design time.
Is Plugin Marketplace usable on individual plans, like ChatGPT Plus?
[Official] Plugin Marketplace was published "for ChatGPT Business, with enterprise rollout in preparation." Individual ChatGPT Plus / ChatGPT Free are not in scope at launch. Plugin Marketplace bundles "skills, app integrations, MCP servers, and lifecycle hooks" into reusable packages that enterprise IT can distribute to all employees at once — "internal wiki search," "deal management," "internal API connector," etc. For individuals, the alternative is writing an MCP server yourself and configuring it in Codex, or installing community MCP servers one-by-one on your machine. [Author's view] Opening Plugin Marketplace to individual plans is technically possible, but a quality / security review pipeline needs to mature first. Expansion to Plus during late 2026 is a moderate possibility.
What changed across Codex CLI 0.132 to 0.134?
[Official] Per openai/codex GitHub Releases, three CLI versions shipped in roughly one week. v0.132.0 (5/20 01:52 UTC): official Python SDK auth support (API key login, device-code flow), plain-string text turn API, and structured-output resume. v0.133.0 (5/21 16:48 UTC): "Goals are now enabled by default" is the biggest line — Goal Mode is on by default in the CLI, progress is backed by dedicated storage, remote-control runs as a foreground command, and Permission profiles gained inheritance and requirements.toml management. v0.134.0 (5/26 19:13 UTC): conversation-history search (case-insensitive) added, --profile unified as the primary profile selector across CLI / TUI / sandbox, MCP setup gained per-server environment targeting and OAuth, read-only MCP tools can run in parallel, and Windows TUI render issues were fixed. For general users, the most welcome change is the 0.134 history search — "where was that clever trick Codex showed me last week?" finally has an answer.
How does this relate to Sales Claw? Is "AI that runs through the night" actually safe?
Sales Claw is a separate OSS tool, specialized for contact-form outbound sales, offering "policy-controlled autonomy." It shares Codex Goal Mode's philosophy (no dependence on per-send human approval) but bundles outbound-sales-specific layers — pre-send automatic checks, sales-NG detection, CAPTCHA-detected stop, send-frequency throttling, audit-log storage, automatic specified-commercial-law footer — as OSS. To put "runs through the night" into production safely, you need three pillars: automatic stop conditions (count × time × turn cap AND'd), local JSONL audit log, and pre-send automatic checks. Codex alone doesn't provide these — you implement them yourself. [In-house verification] Sales Claw v0.5 over the past 90 days: 3,247 form sends, 412 CAPTCHA encounters (12.7%), 47 sales-NG patterns detected, ~¥12,500/month in API cost, zero incidents — but the sample is small and industry / seasonal effects need separate verification. [Author's view] Goal Mode reaching GA actually raises the value of domain-specialized OSS like Sales Claw, because "general-purpose AI tool (Codex) + domain-specific safety layer (Sales Claw)" is the pattern likely to standardize.

参考文献

本記事は X 公式アカウントと公式ドキュメントを一次情報として参照しています。

  1. [01]
  2. [02]
  3. [03]
  4. [04]
  5. [05]
  6. [06]
  7. [07]
  8. [08]
  9. [09]

この記事の著者

中澤 圭志

中澤 圭志

Sales Claw maintainer

Designs and develops Sales Claw. Writes from the field on B2B sales automation and applied AI.

Share this article