The Field Guide

AI & Workflows

AI implementations that actually stick

The pattern behind AI projects that get used six months later — and the ones that quietly die after the kickoff demo.

Read

10 min

Sections

8

Words

~703

Most AI projects die quietly. The demo is impressive, the executive sponsor is excited, the team builds the thing — and six months later nobody is using it. The handful of implementations that stick share a pattern that has almost nothing to do with the model and almost everything to do with the workflow it lives inside. Here's what we've learned shipping AI into operating businesses.

Section 01

Start with the workflow, not the model

Almost every failed AI project starts with 'let's use GPT for X' instead of 'let's understand workflow X, find the bottleneck, and ask whether AI is even the right tool.' Spend the first week mapping the current process step by step — who does what, how long it takes, where the errors happen. Half the time the answer isn't AI, it's a better form or a removed handoff. The other half, you now know exactly where AI fits.

Section 02

Solve a narrow, high-frequency problem first

Broad, low-frequency problems ('summarize any document') sound impressive in a demo but rarely produce measurable value. Narrow, high-frequency problems ('classify these specific support tickets into one of these specific buckets') produce real ROI and build the organizational trust required for bigger swings later.

Section 03

Design for the moment the AI is wrong

Every AI system is wrong some percentage of the time. The question is what happens when it is. The implementations that stick design the wrong-answer path with as much care as the right-answer path: a clear human override, a confidence threshold that routes uncertain cases to a person, a feedback loop that improves the model over time. Without those, one bad answer destroys six months of trust.

Section 04

Put a human in the loop until the data says you don't need to

The fastest way to ship is human-in-the-loop: AI drafts, human approves. The fastest way to build trust is the same. After three months of human-approved outputs, you have a dataset of agreements and disagreements that tells you exactly where the model can be trusted to run autonomously — and where it can't.

Section 05

Don't hide that it's AI

Trying to disguise AI as 'magic' backfires when something goes wrong. Be explicit about what the AI does and doesn't do, what data it can see, and how humans can correct it. Transparency builds the trust required for adoption; opacity guarantees the first error becomes the last conversation about the project.

Section 06

Measure adoption, not just accuracy

A 95%-accurate model used by nobody produces zero value. An 85%-accurate model used daily by every operator produces enormous value. Track weekly active users of the AI feature, not just precision and recall. If adoption is flat, the problem isn't the model — it's the workflow design.

Section 07

Plan for the model to change underneath you

Foundation models are improving and changing pricing every quarter. The implementations that survive are wrapped in abstractions that let the underlying model be swapped without rewriting the integration. Hard-coding to a specific model version is a tax you pay six months later.

Section 08

Run a quarterly ROI review

Every AI feature should have a measurable ROI story by quarter two — hours saved, errors avoided, revenue accelerated. If you can't tell that story by then, the feature is decoration and should be retired. Letting failed AI projects linger is the single biggest reason organizations get skeptical of the next one.

The takeaway

AI sticks when the workflow is mapped first, the problem is narrow, the wrong-answer path is designed deliberately, and adoption is measured every quarter. Everything else is demo theater.

More from the field.

All articles