HATracker: Add a local cache warmup on startup #7213

SungJin1212 · 2026-01-14T09:04:20Z

This PR introduces local cache warmup logic on HA tracker startup that fetches all keys from the KV store and warms the local cache.

Previously, whenever the Distributor started, it suffered from initial cold cache misses. Since the local map was empty, the HATracker treated every incoming request as a new entry. This caused unnecessary CAS operations to the KV store even for existing valid keys.

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

friedrichg · 2026-01-15T01:56:01Z

Do you see distributors in clusters with large number of pairs using more memory after this change? Do we need a flag for this?

SungJin1212 · 2026-01-15T08:01:02Z

@friedrichg
I added a benchmark for syncKVStoreToLocalMap under various key size.
I ran a benchmark and the results indicate that the overhead is negligible even at a very large scale (10000 keys, about 18MB).

goos: darwin
goarch: arm64
pkg: github.com/cortexproject/cortex/pkg/ha
cpu: Apple M4 Max
BenchmarkHATracker_syncKVStoreToLocalMap
BenchmarkHATracker_syncKVStoreToLocalMap/keys=100
BenchmarkHATracker_syncKVStoreToLocalMap/keys=100-14         	   10740	    113366 ns/op	  203197 B/op	    2532 allocs/op
BenchmarkHATracker_syncKVStoreToLocalMap/keys=1000
BenchmarkHATracker_syncKVStoreToLocalMap/keys=1000-14        	    1174	    973137 ns/op	 1834239 B/op	   24289 allocs/op
BenchmarkHATracker_syncKVStoreToLocalMap/keys=10000
BenchmarkHATracker_syncKVStoreToLocalMap/keys=10000-14       	     130	   9065987 ns/op	18199241 B/op	  240955 allocs/op
PASS

friedrichg

@SungJin1212 Thanks, this is great work.

pkg/ha/ha_tracker.go

Signed-off-by: SungJin1212 <[email protected]>

yeya24

I am a bit doubt of how much this cache warmup would help.
I think at least we never see this issue in production. If we think it is good to have I prefer to put this behind a feature flag and disable by default.

I am a bit worried that during a big fleet scale up distributors ended up sending too many requests to KV stores just to warmup cache and causing much bigger impact.

Signed-off-by: SungJin1212 <[email protected]>

SungJin1212 · 2026-01-16T08:31:53Z

@yeya24
I've put the warmup logic behind the flag so that users can opt-in only if they need.

friedrichg

I think this flag might also prove useful for #7220.

At least bring us closer

friedrichg · 2026-01-16T16:56:55Z

@SungJin1212 maybe also mark it as experimental and include the flag in https://github.com/cortexproject/cortex/blob/master/docs/configuration/v1-guarantees.md

Signed-off-by: SungJin1212 <[email protected]>

SungJin1212 · 2026-01-17T04:23:22Z

@friedrichg
I added it, thanks!

pull-request-size bot added the size/L label Jan 14, 2026

dosubot bot added the component/ha-tracker label Jan 14, 2026

SungJin1212 force-pushed the sync-ha-tracker-on-start branch 4 times, most recently from 3567f6f to 7786687 Compare January 15, 2026 01:51

SungJin1212 requested a review from friedrichg January 15, 2026 01:51

friedrichg approved these changes Jan 15, 2026

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 15, 2026

friedrichg reviewed Jan 15, 2026

View reviewed changes

pkg/ha/ha_tracker.go Show resolved Hide resolved

friedrichg reviewed Jan 15, 2026

View reviewed changes

pkg/ha/ha_tracker.go Show resolved Hide resolved

SungJin1212 added 2 commits January 16, 2026 11:14

HATracker: Add a local cache warmup on start

e949878

Signed-off-by: SungJin1212 <[email protected]>

Add benchmark

09cce53

Signed-off-by: SungJin1212 <[email protected]>

yeya24 reviewed Jan 16, 2026

View reviewed changes

Add flag

0850259

Signed-off-by: SungJin1212 <[email protected]>

SungJin1212 force-pushed the sync-ha-tracker-on-start branch from 2644e14 to 0850259 Compare January 16, 2026 08:23

friedrichg approved these changes Jan 16, 2026

View reviewed changes

mark it as experimental

4ad3fd6

Signed-off-by: SungJin1212 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HATracker: Add a local cache warmup on startup #7213

HATracker: Add a local cache warmup on startup #7213

Uh oh!

SungJin1212 commented Jan 14, 2026 •

edited

Loading

Uh oh!

friedrichg commented Jan 15, 2026

Uh oh!

SungJin1212 commented Jan 15, 2026

Uh oh!

friedrichg left a comment

Uh oh!

Uh oh!

Uh oh!

yeya24 left a comment •

edited

Loading

Uh oh!

SungJin1212 commented Jan 16, 2026

Uh oh!

friedrichg left a comment

Uh oh!

friedrichg commented Jan 16, 2026

Uh oh!

SungJin1212 commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HATracker: Add a local cache warmup on startup #7213

Are you sure you want to change the base?

HATracker: Add a local cache warmup on startup #7213

Uh oh!

Conversation

SungJin1212 commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

friedrichg commented Jan 15, 2026

Uh oh!

SungJin1212 commented Jan 15, 2026

Uh oh!

friedrichg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yeya24 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SungJin1212 commented Jan 16, 2026

Uh oh!

friedrichg left a comment

Choose a reason for hiding this comment

Uh oh!

friedrichg commented Jan 16, 2026

Uh oh!

SungJin1212 commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SungJin1212 commented Jan 14, 2026 •

edited

Loading

yeya24 left a comment •

edited

Loading