nomad/website/content/docs/configuration/server.mdx at 26485c45a2fb4df27f0c1df920213d6d42a3bc66

mirror of https://github.com/kemko/nomad.git synced 2026-01-01 16:05:42 +03:00

Files

Brendan MacDonell 26485c45a2 Add job_max_count option to keep Nomad server from running out of memory (#26858 )

If a Nomad job is started with a large number of instances (e.g. 4 billion),
then the Nomad servers that attempt to schedule it will run out of memory and
crash. While it's unlikely that anyone would intentionally schedule a job with 4
billion instances, we have occasionally run into issues with bugs in external
automation. For example, an automated deployment system running on a test
environment had an off-by-one error, and deployed a job with count = uint32(-1),
causing the Nomad servers for that environment to run out of memory and crash.

To prevent this, this PR introduces a job_max_count Nomad server configuration
parameter. job_max_count limits the number of allocs that may be created from a
job. The default value is 50000 - this is low enough that a job with the maximum
possible number of allocs will not require much memory on the server, but is
still much higher than the number of allocs in the largest Nomad job we have
ever run.

2025-10-06 09:35:10 -04:00

29 KiB

Raw Blame History

View Raw

29 KiB Raw Blame History

29 KiB

Raw Blame History