-
Notifications
You must be signed in to change notification settings - Fork 658
Open
Labels
bugSomething isn't workingSomething isn't working
Description
What is the bug?
Following zone-wise restarts, ingesters are failing to rebalance effectively both across zones and within individual zones. This leads to uneven load distribution, where some pods experience significant overload while others remain underutilized. Though spread_minimization is enabled
spread_minimizing_zones: zone-a,zone-b,zone-c
token_generation_strategy: spread-minimizing
Ingesters memory utilization - without proper spreading

How to reproduce it?
- Deploy Mimir 2.14.2v
- Push 60-65M load
- Wait until ingesters holds 13hrs of data
- Perform restarts on ingesters with one zone at a time
What did you think would happen?
During zone-wise restarts, some ingesters flush their data and come back online faster than others. These early-starting ingesters begin receiving the bulk of incoming data, leading to overutilization, while the ingesters that restart later remain underutilized.
- With spread-minimizing enabled, all ingesters should ideally restart simultaneously to prevent skewed load distribution.
- Even if restart delays occur, the system should rebalance and evenly spread the load across all ingesters once they are back online.
What was your environment?
Kubernetes
Helms
Mimir 2.14.2v
Any additional context to share?
No response
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working