Add FailoverHeartbeatTTL to config (#11127)

FailoverHeartbeatTTL is the amount of time to wait after a server leader failure
before considering reallocating client tasks. This TTL should be fairly long as
the new server leader needs to rebuild the entire heartbeat map for the
cluster. In deployments with a small number of machines, the default TTL (5m)
may be unnecessary long. Let's allow operators to configure this value in their
config files.
This commit is contained in:
Matt Mukerjee
2021-10-06 15:48:12 -07:00
committed by GitHub
parent 215bf04bc6
commit 0881b94201
9 changed files with 38 additions and 0 deletions

View File

@@ -273,6 +273,7 @@
"job_gc_threshold": "12h",
"max_heartbeats_per_second": 11,
"min_heartbeat_ttl": "33s",
"failover_heartbeat_ttl": "330s",
"node_gc_threshold": "12h",
"non_voting_server": true,
"num_schedulers": 2,