Frozen Cursor & Spinner: Why Inline AI Feels Slow (and How to Fix It)

The pitch was “invisible” editing: select, press, done. The reality is you hit the hotkey, a little spinner appears, your cursor locks up, and you sit there for five, eight, ten seconds watching a loading wheel before anything changes. Do that thirty times a day and the “invisible” tool is the most visible thing on your screen.

The slowness is real, but most of it is avoidable. Here’s what’s actually taking the time, and what separates a tool that feels instant from one that makes you wait.

Where the seconds actually go

When you trigger an inline rewrite, several things happen in sequence:

  1. The tool reads your selection (fast — milliseconds).
  2. It sends your text to a cloud AI model over the internet (network round-trip).
  3. The model generates the rewrite token by token (this is the slow part).
  4. The tool writes the result back into your document (fast).

Step 3 dominates. A frontier cloud model writing a few sentences genuinely takes a handful of seconds — it’s generating the text one piece at a time. Add network latency (step 2) and you’re at 5–10 seconds for a non-trivial rewrite. That’s inherent to using a capable cloud model; you can’t make the model think faster from the client side.

So if the work genuinely takes several seconds, why do some tools feel slow and others feel instant? It’s not the wait — it’s how the wait is presented.

The real problem isn’t latency. It’s blocking.

Slow tools make two mistakes:

The fix isn’t to make the model faster. It’s to stop blocking and start streaming.

How streaming makes the same wait feel instant

Modern AI models can stream their output — they send each piece of the rewrite the moment it’s generated, rather than holding everything back until the whole thing is done. A well-built inline tool uses this:

The total time is the same — but the experience is completely different. Progress you can see doesn’t feel like waiting; it feels like the tool is working with you. This is the single biggest factor in whether inline editing feels “invisible.”

What you can do about a slow tool today

How EditSnappy fixes this at the root

EditSnappy streams the rewrite into place instead of freezing the cursor behind a spinner. The text starts arriving in a fraction of a second and flows in token by token, right where you’re typing — so even a complex cloud rewrite feels like the tool is working with you, not making you wait on it. There’s no modal loading wheel locking your cursor.

[[MISSING: confirm whether EditSnappy’s smart local↔cloud routing — fast/local model for trivial edits, cloud for complex — is in scope for launch; master-sales-copy §5 lists it as a reach goal, not a table-stakes feature.]] Where it applies, routing trivial fixes to a fast path makes the common case feel instant and reserves the cloud wait for edits that actually need it.

See how EditSnappy works and feel the difference streaming makes.


Part of the Why Inline AI Editors Fail troubleshooting hub · EditSnappy home.