Builders
Hey folks, I spent most of last week in San Francisco, but mostly sat in my hotel obsessing over my Stanford talk while trying to work out where Iād find the specific colour presents my kids had requested. Arabella wanted purple. Max wanted green. Jenni is the main reason I went to SF. She invited me to Stanford and it was such a privilege. Plus, sheās brilliant. Sharp, high-agency, but warm and very personable. She also runs an AI consultancy for CEOs and gets my highest recommendation. If you get the chance to work with her, take it. SF has this energy - how do I go faster, grow bigger, become more successful? I get it. I like that energy. But I also came home thinking that my own version of ambition looks a bit different. Yesterday I finally read the profile of Lenny from Lennyās Newsletter and a lot of it stuck with me. Because it was about building a life where you can care deeply about the work without letting the work eat everything else. That felt so āmeā I had to text him and let him know! I spent an unreasonable amount of time on my Stanford talk. Not because I didnāt know what I wanted to say, but because I cared about how it landed and was presented. And also because I was trying to make the course and the talk at the same time. Working on it also made it clearer to me that I sit in this weird middle ground where non-technical people think Iām technical, and developers donāt. But I think a lot of you are somewhere in that messy middle too. Truth is, itās all a bit messy - everyone is still figuring out agents and AI. Working with agents isnāt about becoming a developer, but understanding the shape of things; files, tools, systems. With a thick dollop of taste on top. If you can steer agents, you can get technical pretty quickly. I did. You just have to be realistic that youāll hit tons of bumps in the road. Your job is to use your agent to figure it out - and youāll probably learn something thatāll come up again and again. Thatās what I want Benās Bites to be more about. Taking you along my exploration: what Iām seeing, what Iām trying, how Iām thinking about it, what bumps I ran into, come along and try whatever sounds useful. Too much āeducationā out there is just thirsty growth-hacks to sell you something. Iāve sold a company. Iāve got three tiny kids at home. I want to do excellent work, make good money, back great companies, and build something useful without accidentally creating a job I donāt want. There are big, valuable companies that want me to work with them. Theyād give me clout, access, money. But I hesitate because I donāt know if I can give that kind of thing my all. Even with my fund. Iām invested in funds who have more money, better process, bigger pipelines. And yet my funds are outperforming. But I struggle to fundraise because my story isnāt presented cleanly and my process doesnāt look like a workflow. I tinker. I talk to developers. I try tools. I see what people click. I back founders building things I think will matter. Itās all connected. The newsletter gives me a read on what builders care about. The fund lets me back the tools that might become important. The course/workshop stuff is me trying to teach the shift Iām living through myself. Devtools built for developers today become the tools agents use. Humans steering, agents operating. Theyāll pick up the tools, compose them, run the commands, change the files, connect the systems. We just gotta understand enough of the shape of the work to direct it well. A lot of what I do comes down to feel. Caring about the thing enough to make it good, but not needing to turn every good thing into a machine. And squeezing every last drop for growthās sake. So I came back from SF without a plan to scale. Mostly I came back thinking, I want to keep building for this new class of builders. People who are curious, increasingly technical, and trying to use AI to become more capable. That feels like a good place to spend my time. Raising the floor, not the ceiling (h/t Jenni/Jenās Bites). Exploring, tinkering and teaching. p.s. the kids got their toys, and Iāll continue working on the ācourseā with care š p.p.s. speaking of care, my brother Adam just launched Hono UI - a UI kit (like Shadcn) but for projects that use Hono. Proud of him! Do me a favour and blow up his launch post š Benās Bites is brought to you by Attio, the AI CRM Honestly, no one gets excited about a CRM. But then they try Attio. It connects to Claude Code and n8n through its MCP server, completely bridging the gap between my customer data and apps. Wait, there's more, like flagging churn risk and turning customer feedback into Linear projects. Try it now. OpenAI released GPT-5.5 - On paper, itās a much better model than Opus 4.7 (minus frontend design - but using their new image gen model and asking 5.5 to turn that into code is a great workaround). This comes with a price increase making it 2x more expensive than GPT-5.4 and even slightly more than Opus 4.7 on a per-token basis. But claims itās 40% more token efficient, so per-task cost doesn't change much (Ramp reported similar results). Itās become my default model since Anthropic cracked down on usage outside of their own harness. Iām in āthinking:lowā camp, super smart and fast. Iām also trying to use the codex desktop app over a terminal and thereās lots to like - but Iām just desperate for cloud/mobile use too. One more thing⦠itās clear OpenAI is focusing on builders, theyāre shipping like crazy, are very vocal on X, responding to users and resetting rate limits. We love to see it. Claude got worse. Itās official. Anthropic posted a breakdown of what and why it happened. Basically, changing the default thinking mode and system prompt changes led to a combined fall in the quality of Claude's responses, especially in Claude Code. They still clarify that they donāt switch to a quantised or āworseā model. Memory on Claude Managed Agents is now in public beta. Context: Managed Agents is Claudeās API product for companies to easily plug in claude code-like agents into their products without managing the hassle of infrastructure. Cursor needs compute and a model. XAI needs users. And competition is higher than ever for coding agents. Cursor was raising $2Bn at $50Bn but has been paused because they struck a deal with SpaceX/XAI that gives SpaceX the option to buy Cursor for $60B later in 2026, or instead pay $10B for the collaboration. Itās a weird structure Iāve not seen before - so we will see what happens. Never a dull day. Your CRM knows nothing because you never update it. Lightfield captures every email, call, and meeting automatically ā then lets you ask anything. āWhy do we keep losing to [competitor]?ā Real answers from real conversations. Get 3 months off with code BENSBITESS13.* QuickCompare helps you choose the right LLM for your use case. Compare 50+ models on your own data in minutes with Trismik. Try free now!* Flipbook - every pixel on your screen, streamed live directly from a model. Another similar demo. Clicky lets you talk to AI and spawn an agent on your Mac. Farza built and open-sourced a version of this a few days ago, but the new clicky is closed source. OpenAI already had a version of this which you can clone. Things I learned working at OpenAI. Exa for Claude - plugin to give your Claude access to websites, people, companies, and more. Ora.run - scan and rank how well agents can find and use your business. Tolaria is another app to read/write markdown files on your desktop with second-brain-esque features. Reelful - from a camera roll to a finished reel in just 10 minutes. trunks by layerbrain - turn your own storage into git remote with a minimal CLI. webpull - instantly pull any website into a directory of clean, searchable markdown files. slacrawl - cli terminal app for slack with sqlite backend. When and how to use MCP for apps built for production. create-agent-tui - skill for building your own agent harness + terminal UI. Amaā¦
Send this story to anyone ā or drop the embed into a blog post, Substack, Notion page. Every play sends rev-share back to Ben's Bites.