building xeve

AI Didn't Make Us Faster. It Changed What We Were Willing to Build.

June 18, 20266 min read

Anthropic's 2026 Agentic Coding Trends Report landed last month with the usual metrics: session lengths up, delegation efficiency improving, context engineering becoming the key skill. Most coverage focused on one stat — developers use AI in 60% of their work but fully delegate only 0-20% of tasks — because it confirmed a frustration most developers already felt. But there's a different number in the report that didn't get much attention: 27% of AI-assisted work is work developers say they wouldn't have attempted at all without AI.

That's not a speed story. That's a scope story.

The entire productivity conversation around AI has been about throughput: are you coding faster, are you shipping more pull requests, is your cycle time improving? Those are real questions. But for someone building a product alone, they're not the most important ones. The constraint on a solo developer isn't usually "can I implement this fast enough" — it's "is this worth attempting given what I can do alone."

AI changes that second constraint more than it changes the first.

What the 27% Actually Means

When Anthropic surveyed developers about which work AI enabled, they found three broad buckets: fixing things that were always nagging at them, building internal dashboards and tooling they'd perpetually deferred, and running experiments that previously weren't worth the effort. In each case, the common thread is that these are decisions, not just tasks. The developer would have looked at the work, assessed it against what they could realistically do, and declined to start it. AI didn't make the work faster — it changed whether the work entered the queue at all.

This is a fundamentally different kind of productivity gain than the ones researchers are trying to measure. You can measure PR throughput and commit velocity and token spend. You cannot measure "the features that stayed on the maybe-someday list indefinitely." The 27% is an invisible number — it lives in decisions, not in any version control history.

When the report also shows that agent usage for new feature development jumped from 14% to 37% of sessions, and code design and architecture work grew from 1% to 10%, that pattern tracks. AI isn't just being used to implement things developers already knew how to do. They're using it to attempt work they would previously have classified as requiring skills or time they didn't have.

The Solo Builder Case

On a large team, speed gains do something specific: they give you effective headcount you didn't hire. If ten engineers each get 20% faster, you've added two engineers worth of output. That math is legible and it's why enterprise productivity arguments focus on throughput.

For a solo developer, the math is different. You can absorb only so much velocity before you become your own bottleneck — reviewing agent output, making architectural decisions, writing specs, handling the integration work that only you have the context to do. Speed beyond a certain point stops compounding. There's no team to absorb the additional output.

What does compound, for a solo developer, is the ability to credibly commit to building things you otherwise couldn't. The product gets to have a Windows tracker. The iOS companion gets a widget layer. The correlation engine runs across 19 metric pairs instead of stopping at 5 because 19 became feasible once you weren't writing every RPC function manually.

These aren't faster versions of what I was going to build anyway. They're categories of work that would have stayed on the roadmap indefinitely, deferred with good reason: I had no prior C# or WinUI 3 experience, and hiring a Windows developer for a free-tier product at early access isn't a decision anyone should make. The iOS companion's HealthKit + CoreLocation + WidgetKit stack represents three separate Apple frameworks I hadn't touched before shipping it. Building the correlation engine required enough custom PostgreSQL functions that the implementation time alone would have made it the kind of project I'd prototype once and ship never.

None of those things entered the roadmap because AI would make them faster. They entered the roadmap because AI made them thinkable.

The Architecture It Enables

There's a specific pattern that makes this possible. The work that AI can unlock most reliably isn't creative or architectural — it's implementation work inside a well-constrained domain where you can describe success criteria clearly, where you know what "correct" looks like, and where the main barrier was time and unfamiliarity with a framework rather than judgment about what to build.

Building a WinUI 3 system tray application fits that description exactly. I understood what the app needed to do. I knew how the macOS equivalent was structured. I had never written C#. Given the right context — architecture description, comparable Swift code, desired behavior for each component — an agentic session could implement the core correctly. My job was specification and review, not line-by-line implementation in a framework I'd need weeks to learn.

The same pattern held for the iOS widget layer. WidgetKit has well-documented behavior and clear constraints. The hard part is knowing what you want the widget to show and how it should update. The implementation, given a clear spec and reference to the HealthKit queries that were already working elsewhere, was tractable in a way it wouldn't have been from zero knowledge with a week deadline.

This is why Anthropic's session length data matters in this context. Average Claude Code sessions went from 4 minutes to 23 minutes between Q1 2025 and Q1 2026. That's not faster autocomplete. That's developers spending sustained time on work that requires the full session — work they wouldn't have started without confidence that a focused agentic session could get to a mergeable result.

The Problem With Measuring This

The 27% figure is interesting partly because of what it implies about how incomplete most productivity measurement is.

If you're tracking developer productivity via coding time, commit frequency, or PR throughput, you're measuring the work that happened. You have no visibility into the decisions about what to attempt. The choice not to build the Windows tracker doesn't appear in any dataset. The judgment that the correlation engine was too expensive to build alone — and then the reversal of that judgment — shows up only in the eventual commit history, which looks the same as any other feature landing.

This is a problem for how organizations think about AI ROI. The standard metrics — throughput up, cycle time down, PRs merged — capture whether AI is accelerating work that was already in the queue. They miss the expansion of what goes into the queue in the first place. If 27% of your team's AI-assisted work is work that otherwise wouldn't have happened, your productivity measurement is missing roughly a quarter of AI's actual contribution.

For a solo developer, the measurement gap is if anything more significant, because the scope decision is yours alone. The judgment of what's buildable, what's worth attempting, what has a realistic path to shipping — that's made in your head, not in a planning doc. The decisions AI reversed are invisible to any tool that only sees what you built.

What This Changes About the Productivity Conversation

The throughput conversation isn't going away, and it shouldn't. PR cycle time and post-merge bug rate and token spend are real signals that tell you real things about how agentic workflows are functioning.

But they answer a specific question: given that you're doing a piece of work, how efficiently does AI help you complete it? They don't answer the question that might matter more, especially for someone building alone: does AI change your assessment of what's worth starting?

We shipped four native apps and a web dashboard in a product that's been live for less than a year. A meaningful fraction of that is accelerated work — things we would have built anyway, finished faster. A smaller but significant fraction is work that entered the roadmap because the calculus changed. iOS widgets instead of a placeholder. Windows native tracking instead of a web stub. Nineteen correlation pairs computed on demand instead of three hard-coded ones.

Those features exist in the product not because we coded them faster. They exist because we started them at all. That's the number that isn't showing up in anyone's productivity dashboard, and it might be the more honest account of what AI is actually doing for people building alone.

Written by Kevin — builder of xeve

++related posts

building xeve

When Everyone Can Build, Taste Is the Moat

6 min read

building xeve

When You Build Alone, Nobody Sees It Coming

6 min read

Track your apps, coding, music, and health — all in one place.

try xeve free