Apple's on-device AI changes the math for small apps
Welcome to the first issue of Zero to MVP Weekly. The deal is simple: one signal, one tool, one MVP idea — five minutes, every Saturday. No link dumps. Let's go.
1. The signal of the week
Apple made AI features free for small apps.
At WWDC on Monday, Apple expanded the Foundation Models framework. The headline that matters for builders: developers with fewer than two million first-time App Store downloads now get free access to Apple's models running on Private Cloud Compute. The framework also picked up three things worth noting:
- Image input — so you can feed it photos, receipts, screenshots.
- Provider swapping — call Claude or Gemini through the same Swift API, with no code changes.
- A Python SDK (it's not Swift-only anymore) and a "Dynamic Profiles" system for multi-agent workflows.
It goes open source later this summer.
Why it matters: inference cost is the silent killer of small AI apps — it's the reason almost every indie AI app ships with a subscription from day one. Most indie apps live their entire life under two million downloads (that's not failure, that's the normal shape of a small product). For all of them, the cost of running AI features just dropped to zero. The margin math for small paid apps changed in a single keynote.
One caveat to keep you honest: Apple hasn't published rate limits for the free tier — only the download threshold. "Free" with an unknown ceiling is something you plan around, not something you bet the whole app on yet.
2. My take
The free inference is the obvious headline, but I think the more interesting part is the unified Swift API. In theory, you can now switch between Apple, Claude, and Gemini without rewriting the app logic. That could reduce provider lock-in in one of the most painful places — the application layer.
But I'd keep two caveats in mind. First, these are not frontier models, so I would expect them to be good enough for extraction, summaries, and structured output, not for complex reasoning. And honestly, that is probably enough for many real app features.
Second, this also means building deeper into Apple's ecosystem. And since the free tier limits are not public yet, I would prototype with it, but I would not base a serious cross-platform product on it right away.
If you already build Swift apps, this may become the cheapest AI stack for small apps — because part of it is now built into the OS.
3. Tool worth trying
MiniMax M3 — the open counterpoint to today's signal. Apple gives you free but closed on-device AI; M3 is the frontier-grade model you fully control.
Ran MiniMax M3 for the first time, in the cloud — it's far too big to run locally — and first impressions are strong. The main strength: the model genuinely handles large context well. I threw in an entire project codebase — dozens of files, thousands of lines — and it actually navigated the structure, knew where everything was, and never lost the thread.
Coding is a mixed bag. On medium-complexity tasks — refactoring, tests, small spec-driven features — it performs confidently, sometimes almost on par with Claude Opus. On harder tasks, instability kicks in: it can confidently go in the wrong direction several times in a row. An async bug took three attempts to get right.
Where it really shines is agentic mode. I ran it through Hermes Agent on a deprecated API search-and-replace task: it walked through the files on its own, found the problem spots, made the changes, and verified the result. For methodical, high-volume code work — it's genuinely useful.
The main downside is unpredictability. Sometimes it works brilliantly; other times it burns a huge number of thinking tokens on trivially simple tasks. Sparse attention occasionally causes it to miss relevant context and start making mistakes out of nowhere.
Bottom line: M3 is not a Claude replacement and not a GPT killer. It's a cost-effective workhorse for agentic tasks and high-volume routine work. But for tricky bugs, architecture decisions, and important technical calls — I'd still reach for Claude.
4. MVP idea of the week
The zero-cost-AI receipt scanner.
- Problem: freelancers hate expense tracking, and existing receipt-scanner apps charge $5–10/month — largely because every scan costs the developer cloud-AI money.
- Solution: an iOS receipt scanner where the AI runs free — on-device and on Apple's Private Cloud Compute. Charge once instead of monthly, with "your receipts never leave Apple's infrastructure" as the privacy hook.
- For whom: freelancers and sole proprietors. The EU is a natural first market — privacy culture plus paperwork culture.
- Minimal v1: photo → vendor, amount, date, category → monthly CSV export. Foundation Models handles the extraction (image input is new as of Monday). No backend at all.
- Demand check: a landing page testing "pay once" against the incumbents' subscriptions; App Store keyword volume for "receipt scanner"; twenty honest replies from freelancer communities before you write a line of code.
- Why now: a month ago this was economically dead — inference costs forced a subscription, and the subscription market belongs to incumbents. Free inference reopens the pay-once niche.
5. Useful links
- Claude Fable 5 — the first Mythos-class model in public hands — included in paid Claude plans at no extra cost until June 22, then usage credits. Practical move: test it in Claude Code on a real branch before the clock starts.
- Microsoft's MAI family — seven in-house models — the "cheap and good enough" tier is getting crowded. Good news for your API bills.
- WebMCP — make your web app callable by browser agents — announced at Google I/O (May 19), now in a Chrome 149 origin trial. Early data: 67% fewer errors than visual scraping; Booking, Shopify, Instacart and Intuit are already committing. If you run a SaaS, skim it — "agent-ready" may become this decade's "mobile-friendly".
- Apple's Foundation Models updates in detail — the full WWDC developer picture behind today's signal. Ready to build? Start with the official Foundation Models docs.
- The coding-agent market keeps reshuffling — Windsurf is now Devin Desktop, Copilot's flex billing went live. Six months of the agent wars in one read.
That's issue one. If something here made you open a code editor, hit reply
and tell me what you're building — I read every reply.
— Nick
