đď¸ This week on How I AI: GPT 5.5, Claude Design, and GPT Images 2.0 hands-on reviewsâplus an inside look at Memelord
Listen now on YouTube ⢠Spotify ⢠Apple Podcasts Claire put GPT 5.5 to the test on real, messy problemsâfrom a six-hour autonomous migration to a hardware hack no other model could crack. GPT 5.5 is incredibly smart, but most ChatGPT users donât have problems complex enough to justify its intelligence or cost. Claire struggled to find meaningful use cases in her personal ChatGPT account because everyday tasks donât require super-intelligence. The model spent 17 minutes thinking about how to build a simple subtraction app for her first-graderâimpressive, but overkill. The real value unlocks when you have genuinely hard technical problems. The âI trust you, figure it outâ prompt unlocks autonomous multi-hour workflows. Claire gave GPT 5.5 a complex data migration problem involving 2 million rows of unstructured data with endless edge cases. She told it: âI trust you to make a call, figure out how to spawn a subagent to do this, test it, identify issues, repair them, and get this ready for production.â The model worked autonomously for almost six hours with zero follow-up prompts, zero steering, and only one approval request. This is the first time Claire has seen truly long-running autonomous agent behavior. GPT 5.5 passed the ultimate intelligence test: hacking proprietary hardware. Claire spent months trying to reverse-engineer a Chinese Bluetooth speaker with proprietary encoding. She tried Claude Code, GPT-4, everythingânothing worked. She went full detective mode: downloaded Bluetooth profiling tools, hooked up packet sniffers, crawled Chinese documentation repositories. When she finally threw all this context at GPT 5.5, it cracked the bitmap encoding and Bluetooth transport mechanism. Now she can send messages to the speaker from the terminal and has built Codex notification hooks that display on the device. The model is expensive, but cheaper than human engineering time. GPT 5.5 Pro costs $30 per million input tokens and $180 for output tokensâexpensive. But when Claire reflects on what it accomplished (six hours of autonomous work, 2 million rows validated, six months of tech debt eliminated), the ROI is obvious. Itâs cheaper than her time and cheaper than her engineering teamâs time, and it solved problems that would have required significant human coordination and focus. Fix the âbaked potato personalityâ with slash commands. Out of the box, Codex with GPT 5.5 has what Claire calls a âbaked potato personalityââdull and robotic. But if you type â/personalityâ in Codex, you can change it to something friendlier. Some testers complained it became âtoo Gen Z,â but Claire prefers that over the default bland responses. Itâs a small quality-of-life improvement that makes working with the model more enjoyable during long sessions. My GPT-5.5 ReviewâA 6-Hour Autonomous Task and the Bluetooth Hack No Other Model Could Solve: https://www.chatprd.ai/how-i-ai/openai-gpt-5.5-review âł Reverse-Engineer a Proprietary Hardware Protocol with AI: http://chatprd.ai/how-i-ai/workflows/reverse-engineer-a-proprietary-hardware-protocol-with-ai âł Perform an Autonomous Data Migration with an AI Agent: https://www.chatprd.ai/how-i-ai/workflows/perform-an-autonomous-data-migration-with-an-ai-agent âł Automate Security Vulnerability Remediation with AI: https://www.chatprd.ai/how-i-ai/workflows/automate-security-vulnerability-remediation-with-ai Listen now on YouTube ⢠Spotify ⢠Apple Podcasts Brought to you by: Claire tests Claude Design and ChatGPT Images 2.0 by building real assets like landing pages, decks, and brand kits, showing what actually works, whatâs slow, and where traditional tools like Figma still win. Design systems are now first-class citizens in AI design tools. Claude Designâs entire workflow starts with importing your design systemâfonts, colors, components, brand assetsâand structuring them into a format AI can use consistently. This is a fundamental shift from prototyping tools that ignore your brand. Google just released Design MD as a proposed standard for how to describe design systems to AI agents, signaling that this is where the entire industry is heading. Claude Design excels at marketing assets but struggles with product UX. If youâre building landing pages, marketing sites, or presentation decks that need to match your brand, Claude Design is genuinely impressive. It adheres to design systems well for these use cases. But for app components and complex user experience flows, it doesnât reason as effectively with design system constraints. Know what youâre building before choosing your tool. Figma still wins on iteration speed, and that matters more than you think. Claude Design takes 5 to 10 minutes to generate designs, and every tweak requires another LLM call. Figma lets you drag, change fonts, adjust colors instantlyâno model in the loop. We underestimate how valuable that immediate feedback is when youâre iterating on design. AI design tools are great for getting to a first draft, but traditional tools still dominate the refinement phase. The number one Claude Design slop tell: italicized serif fonts everywhere. Just like Claude Code has its telltale phrases (âin summaryâ), Claude Design has a design signatureâit absolutely loves italicized serif fonts in landing pages. Once you see it, you canât unsee it. This is useful for both identifying AI-generated designs and knowing what to specifically override in your prompts. GPT Images 2.0 finally nailed layout and typography for brand work. The new model can generate multi-page brand kits with proper text rendering, consistent layouts, and sophisticated typographyâthings previous image models completely failed at. For marketers who need brand assets that combine images, text, and layout, this is a real breakthrough. The quality looks expensive, not obviously AI-generated. Let AI run wild without design systems for the most creative results. When Claire asked Claude Design to create a â90s GeoCities version of Lennyâs Newsletter without any design system constraints, it produced âLennyâs Product Zoneâ with Comic Sans, brick backgrounds, and exceptional copy like âYour OKRs are cringe (and seven ways to fix them before Q3).â The lesson: reference styles and creative direction work better than rigid constraints when you want something unexpected. Content-to-slides is Claude Designâs killer practical use case. Take an article, add your design system, and Claude Design generates a beautiful, on-brand presentation deckâcomplete with code-based elements like animated terminals with blinking cursors. For product marketers, enablement teams, and anyone creating customer-facing decks, this workflow is immediately valuable and actually works well. How I Put Claude Design and GPT Images 2.0 to the Test: Building Landing Pages, Slides, and Brand Kits: https://www.chatprd.ai/how-i-ai/claude-design-and-gpt-images-2-building-landing-pages-slides-and-brand-kits âł How to Generate a Professional Brand Kit with GPT Images 2.0: https://www.chatprd.ai/how-i-ai/workflows/how-to-generate-a-professional-brand-kit-with-gpt-images-2-0 âł How to Convert an Article into a Polished Slide Deck with AI: https://www.chatprd.ai/how-i-ai/workflows/how-to-convert-an-article-into-a-polished-slide-deck-with-ai âł How to Build a High-Fidelity Landing Page with Claude Design: https://www.chatprd.ai/how-i-ai/workflows/how-to-build-a-high-fidelity-landing-page-with-claude-design Listen now on YouTube ⢠Spotify ⢠Apple Podcasts Brought to you by: Jason Levin explains how he grew Memelord to $100K ARR without writing code, then rebuilt it as an API-first product for agentsâplus why every marketer should vibe code and what happens when you let them ship. Let your marketers cookâor watch them leave your company. Jason has one rule at Memelord: every marketer has to vibe code. This isnât some abstract CEO mandateâitâs a survival strategy. His free tools section (built entirely by non-technical marketers using Cursor) has generated hundreâŚ
Send this story to anyone â or drop the embed into a blog post, Substack, Notion page. Every play sends rev-share back to Lenny's Newsletter.