Report recent tasks updates when master starved by DaveCTurner · Pull Request #139518 · elastic/elasticsearch

DaveCTurner · 2025-12-15T12:12:05Z

Today if the elected master is unable to clear its queue for too long we
log the warning pending task queue has been nonempty for [${DURATION}]
but it can be challenging to determine what is keeping it busy like
this. With this commit we add some simple tracking of recent cluster
state updates and a log message to report the updates executed recently.

Today if the elected master is unable to clear its queue for too long we log the warning `pending task queue has been nonempty for [${DURATION}]` but it can be challenging to determine what is keeping it busy like this. With this commit we add some simple tracking of recent cluster state updates and a log message to report the updates executed recently.

elasticsearchmachine · 2025-12-15T12:12:31Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

elasticsearchmachine · 2025-12-15T12:12:31Z

Hi @DaveCTurner, I've created a changelog YAML for you.

pxsalehi · 2025-12-15T14:36:21Z

server/src/main/java/org/elasticsearch/cluster/service/MasterService.java

                maxTaskWaitTime.millis()
            );
+
+            if (logger.isInfoEnabled()) {


why in a separate log line and not with the warn above?

I expect we might want to filter this one out separately (it could be quite long) and I believe we have dashboards looking at the warning so I didn't want to change it too much either

bcully

LGTM

bcully · 2025-12-15T17:39:53Z

server/src/main/java/org/elasticsearch/cluster/service/MasterService.java

+                Strings.collectionToDelimitedStringWithLimit(
+                    (Iterable<String>) (() -> Iterators.map(executionHistory.iterator(), ExecutionHistoryEntry::getDescription)),
+                    ", ",
+                    MAX_TASK_DESCRIPTION_CHARS,
+                    descriptionBuilder


This is nice, thanks!

I expect we'll see a bunch of duplicate lines. We might be able to get deeper history if we collected runs of the same record into a single record + count line?

Hmm yes that's true, tho then we would lose the ordering which I think is going to be more informative in many cases.

I'll proceed with this for now, and we can follow up with a change to report counts grouped by queue name if it turns out it's still needed.

Just to be clear, I had in mind to collect runs together in order to keep the ordering, rather than producing only a task/count table, e.g.:

1-20: HIGH unbatched task-queue-1, 21: HIGH unbatched task-queue-2, 22-33: HIGH unbatched task-queue-1, ...

But yes, we can see if that would be helpful later.

Ah ok I see. I opened #139555 to do that. I suspect in the case of shard allocation it's not that useful as we'll be going round a loop of different tasks (allocate a shard and then mark the shard as started) but yes it might be nicer in other cases.

Following elastic#139518, this commit groups together consecutive equal entries in the log to represent the same information more densely.

Today if the elected master is unable to clear its queue for too long we log the warning `pending task queue has been nonempty for [${DURATION}]` but it can be challenging to determine what is keeping it busy like this. With this commit we add some simple tracking of recent cluster state updates and a log message to report the updates executed recently.

Following #139518, this commit groups together consecutive equal entries in the log to represent the same information more densely.

DaveCTurner requested review from bcully and pxsalehi December 15, 2025 12:12

elasticsearchmachine added the Team:Distributed Coordination (obsolete) Meta label for Distributed Coordination team. Obsolete. Please do not use. label Dec 15, 2025

Update docs/changelog/139518.yaml

9f84be2

Turn magic number into setting

d1be895

DaveCTurner requested a review from a team as a code owner December 15, 2025 12:18

Whitespace

da56851

pxsalehi reviewed Dec 15, 2025

View reviewed changes

Merge branch 'main' into 2025/12/15/MasterService-execution-history

cbe7e87

bcully approved these changes Dec 15, 2025

View reviewed changes

DaveCTurner merged commit 082205e into elastic:main Dec 15, 2025
35 checks passed

DaveCTurner deleted the 2025/12/15/MasterService-execution-history branch December 15, 2025 17:49

DaveCTurner added a commit to DaveCTurner/elasticsearch that referenced this pull request Dec 15, 2025

Group entries in MasterService history log

d3dd17b

Following elastic#139518, this commit groups together consecutive equal entries in the log to represent the same information more densely.

DaveCTurner mentioned this pull request Dec 15, 2025

Group entries in MasterService history log #139555

Merged

DaveCTurner added a commit that referenced this pull request Jan 8, 2026

Group entries in MasterService history log (#139555)

f2351e8

Following #139518, this commit groups together consecutive equal entries in the log to represent the same information more densely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report recent tasks updates when master starved#139518

Report recent tasks updates when master starved#139518
DaveCTurner merged 5 commits intoelastic:mainfrom
DaveCTurner:2025/12/15/MasterService-execution-history

DaveCTurner commented Dec 15, 2025

elasticsearchmachine commented Dec 15, 2025

elasticsearchmachine commented Dec 15, 2025

pxsalehi Dec 15, 2025

DaveCTurner Dec 15, 2025

bcully left a comment

bcully Dec 15, 2025

DaveCTurner Dec 15, 2025

bcully Dec 15, 2025

DaveCTurner Dec 15, 2025

Uh oh!

Labels

4 participants

Conversation

DaveCTurner commented Dec 15, 2025

elasticsearchmachine commented Dec 15, 2025

elasticsearchmachine commented Dec 15, 2025

pxsalehi Dec 15, 2025

Choose a reason for hiding this comment

DaveCTurner Dec 15, 2025

Choose a reason for hiding this comment

bcully left a comment

Choose a reason for hiding this comment

bcully Dec 15, 2025

Choose a reason for hiding this comment

DaveCTurner Dec 15, 2025

Choose a reason for hiding this comment

bcully Dec 15, 2025

Choose a reason for hiding this comment

DaveCTurner Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Labels

4 participants