Codex is gaining steam
Hey folks, I spent most of the bank holiday weekend offline for a change whilst at a wedding in the English countryside. I wouldâve been more online if Codex had a mobile app but still waiting on that⊠this morning I did just install this skill that lets you iMessage Codex which is pretty great - essentially keeps a thread open in the app that you can message. Just paste the link in Codex and itâll guide you through everything. Benâs Bites is brought to you by Gravitee As AI Agents connect to more APIs, security risk gets harder to manage. Gravitee helps teams govern APIs, events, and AI Agents while reducing silos and cost. See what enterprise teams are prioritizing in the State of AI Agent Security report. OpenAI wants non-technical users to use Codex. They are making it easy for you to switch to Codex. You can now import settings, plugins, agents, project configuration and more into Codex (from tools like Claude Cowork). They are directly improving features related to everyday work, like creating slides/sheets, plus friendlier UI changes. Grok 4.3 is out in the API. 1M context, text + image input, reasoning and a December 2025 knowledge cutoff. Itâs priced $1.25/$2.50 per million input/output tokens, i.e. much cheaper than Sonnet 4.6 for a relatively similar performance. Entire, the company by GitHubâs ex-CEO, released two new things: git-sync - a utility to mirror git repos from a source to a target without needing to clone it locally and Dispatches - a feature on their web platform to generate release notes from recent ships, commits, and agent sessions by repo/date range. Charity Majors and Christine Yen headline Honeycomb's Innovation Week (May 12â14), a 3-day virtual event addressing observability for the agent era. Learn how the most forward-thinking engineering teams are rising to meet this challenge. Register now.* Lightfield - AI-native CRM that learns how you sell. Describe any workflow in English, your CRM runs it on command. 3 mo free w/ BENSBITEST13* Sauna - learns how you work, remembers everything that matters, and actions on it (portfolio company!) Shared Brain by Zapier - Collective knowledge vault for your team and a personal assistant to complete tasks. Now in early access. Manus Cloud Computer - always-on cloud machine for Manus so bots, scripts, databases and scheduled jobs keep running when your laptop is off. Files and installed tools persist across sessions. Proxyuser - test all the core flows of your app via a synthetic user with a real browser, including signups. Web UI Bench - Same UI components built by 20 models, shown side-by-side. GPT-5.5 uses too much bland text in the UI when an icon or control is self-explanatory (compared to Opus 4.7). Flue - TypeScript framework for building Claude Code-style agents. deepsec - open-source security harness from Vercel for finding vulnerabilities in your codebase with coding agents. localterm - run a terminal in your browser with npx localterm@latest start. open-slide - slide framework built for agents. Visual edits, comments, assets and agent-readable slide structure. Refero Styles - 2,000+ DESIGN.md files from real products that your agent can use for style references. How OpenAI delivers low-latency voice AI at scale. crabbox - run your dirty worktrees in a remote sandbox easily. (tweet) OpenAI has a new opt-in feature for Advanced Account Security in ChatGPT/Codex. Base44âs Frustration Meter - usage-based model benchmark. Base44 says Opus 4.7 caused 43% more frustration than Opus 4.6. Cofounder 2 - another ârun a company with agentsâ product thatâs a combo of vibe coding, finding leads and sending sales emails. How Posthog plans to change in the AI era. Read about me and Benâs Bites đ· thumbnail by @keshavatearth * sponsors who make this newsletter possible :) Wanna partner with us for the next quarter? Email us at shanice@bensbites.com or k@bensbites.com
Send this story to anyone â or drop the embed into a blog post, Substack, Notion page. Every play sends rev-share back to Ben's Bites.