Home › Embedding drift

Embedding drift is a day-one decision

Most teams treat embedding drift as a future problem. It is not. It costs almost nothing to get right early, and it is catastrophically expensive to retrofit once you have live users.

Future-proof my index →

The one rule

Never query against a mixed-version index. The moment two embedding model versions share an index, your similarity scores stop meaning one thing. Retrieval quality degrades in ways you cannot cleanly explain.

Three config fields and one job

Tag every chunk with its model version

At write time, every chunk records which embedding model produced it. That single field keeps versions separable forever. It costs nothing to add on day one.

Maintain parallel indexes during transitions

When you upgrade models, you build the new index alongside the old one. Queries route to a single, consistent version. No silent mixing, no mystery regressions.

Retire old indexes on a schedule

A background job, not a fire drill

Built in iteration 1, not iteration 7

A scheduled job retires stale indexes once a transition completes. This is three config fields and one background job. Skip it and you will rebuild the entire pipeline later, under pressure, while debugging degradation you cannot name.

Why we build it first

We have watched teams do the painful retrofit twice, and it is always avoidable. Getting this right early turns model upgrades into a routine operation. Your retrieval quality stays explainable as you grow.

The retrofit you are avoiding

Picture the retrofit you are signing up for by waiting. It is iteration seven, you have live users, and retrieval quality is sliding for reasons no dashboard explains. You discover two model versions have quietly shared one index for months.

Now every fix is a production migration. You re-embed the whole corpus while traffic flows, comparing scores that were never comparable in the first place. The version field and the parallel-index habit would have cost a few hours in week one, and they turn that future emergency into a scheduled, boring background job.

Common questions

We are not changing embedding models soon.

You will. Version tagging costs nothing today and removes the upgrade from your list of future crises.

Doesn't keeping parallel indexes cost storage?

Briefly, and only during transitions. That cost is trivial next to a full rebuild under live traffic.

Version-proof your pipeline in 20 minutes

We set the three fields and the retirement job now, so your next model upgrade is boring instead of dangerous.

Book my technical call →