nomad

mirror of https://github.com/kemko/nomad.git synced 2026-01-01 16:05:42 +03:00

Author	SHA1	Message	Date
Piotr Kazmierczak	f9b95ae896	scheduler: account for infeasible nodes when reconciling system jobs (#26868 ) Node reconciler never took node feasibility into account. In cases when there were nodes excluded from allocation placement due to constraints not being met, for example, the desired total or desired canary numbers were never updated in the reconciler to account for that. Thus, deployments would never become successful.	2025-10-02 16:17:46 +02:00
Piotr Kazmierczak	eaa0fe0e27	scheduler: always set the right deployment status for system jobs that require promotion (#26851 ) In cases where system jobs had the same amount of canary allocations deployed as there were eligible nodes, the scheduler would incorrectly mark the deployment as complete, as if auto promotion was set. This edge case uncovered a bug in the setDeploymentStatusAndUpdates method, and since we round up canary nodes, it may not be such an edge case afterall. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2025-09-30 09:18:59 +02:00
Piotr Kazmierczak	46dfd9d992	scheduler: do not create deployments for system job reschedules (#26789 ) System jobs that get rescheduled should not get new deployments.	2025-09-18 14:54:54 +02:00
Tim Gross	ce614e6b7a	scheduler: `upgrade` block testing for system deployments (#26579 ) This changeset adds system scheduler tests of various permutations of the `update` block. It also fixes a number of bugs discovered in the process. * Don't create deployment for in-flight rollout. If a system job is in the middle of a rollout prior to upgrading to a version of Nomad with system deployments, we'll end up creating a system deployment which might never complete because previously placed allocs will not be tracked. Check to see if we have existing allocs that should belong to the new deployment and prevent a deployment from being created in that case. * Ensure we call `Copy` on `Deployment` to avoid state store corruption. * Don't limit canary counts by `max_parallel`. * Never create deployments for `sysbatch` jobs. Ref: https://hashicorp.atlassian.net/browse/NMD-761	2025-09-05 10:22:42 -04:00
Piotr Kazmierczak	a083495240	system scheduler: correction to Test_computeCanaryNodes (#26707 )	2025-09-05 16:20:34 +02:00
Piotr Kazmierczak	276ab8a4c6	system scheduler: keep track of previously used canary nodes (#26697 ) In the system scheduler, we need to keep track which nodes were previously used as "canary nodes" and not pick them at random, in case of previously failed canaries or changes to the amount of canaries in the jobspec. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2025-09-05 15:32:08 +02:00
Piotr Kazmierczak	14e98a2420	scheduler: fix promotions of system job canaries (#26652 ) This changeset adjusts the handling of allocations placement when we're promoting a deployment, and it corrects the behavior of isDeploymentComplete, which previously would never mark promoted deployment as complete.	2025-09-03 16:09:36 +02:00
Piotr Kazmierczak	8b8e21dc0e	scheduler: check if system job deploy is complete before other guards (#26651 )	2025-08-28 17:29:13 +02:00
Piotr Kazmierczak	de342ee48b	scheduler: correct dstate total/canary counts for system deployments (#26641 )	2025-08-28 16:24:52 +02:00
Piotr Kazmierczak	ca96de15d0	scheduler: correct handling of MaxParallel and obsoleting Stagger in the system scheduler (#26631 )	2025-08-27 09:38:35 +02:00
Piotr Kazmierczak	3d373c9a6a	scheduler: support canary deployments for system jobs (#26499 ) This changeset introduces canary deployments for system jobs. Canaries work a little different for system jobs than for service jobs. The integer in the update block of a task group is interpreted as a percentage of eligible nodes that this task group update should be deployed to (rounded up to the nearest integer, so, e.g., for 5 eligible nodes and canary value set to 50, we will deploy to 3 nodes). In contrast to service jobs, system job canaries are not tracked, i.e., the scheduler doesn't need to know which allocations are canaries and which are not, since any node can only run one system job. Canary deployments are marked for promotion and if promoted, the scheduler simply performs an update as usual, replacing allocations belonging to a previous job version, and leaving new ones intact.	2025-08-22 15:02:40 +02:00
Piotr Kazmierczak	0e6e5ef8d1	scheduler: handle deployment completeness in the node reconciler (#26445 ) This PR introduces marking deployments as complete if there are no remaining placements to be made for a given task group.	2025-08-21 18:34:59 +02:00
Piotr Kazmierczak	c33e30596c	scheduler: support deployments in the `NodeReconciler` (#26318 ) This is the initial implementation of deployments for the system and sysbatch reconciler. It does not support updates or canaries at this point, it simply provides the necessary plumbing for deployments.	2025-08-21 18:34:59 +02:00
Tim Gross	80ddb7392a	scheduler: fix debug-level logging for node reconciler (#26583 ) In #26169 we started emitting structured logs from the reconciler. But the node reconciler results are `AllocTuple` structs and not counts, so the information we put in the logs ends up being pointer addresses in hex. Fix this so that we're recording the number of allocs in each bucket instead. Fix another misleading log-line while we're here. Ref: https://github.com/hashicorp/nomad/pull/26169	2025-08-19 15:17:17 -04:00
Tim Gross	eb47d1ca11	scheduler: eliminate dead code in node reconciler (#26236 ) While working on property testing in #26216, I discovered we had unreachable code in the node reconciler. The `diffSystemAllocsForNode` function receives a set of non-terminal allocations, but then has branches where it assumes the allocations might be terminal. It's trivially provable that these allocs are always live, as the system scheduler splits the set of known allocs into live and terminal sets before passing them into the node reconciler. Eliminate the unreachable code and improve the variable names to make the known state of the allocs more clear in the reconciler code. Ref: https://github.com/hashicorp/nomad/pull/26216	2025-07-09 11:31:04 -04:00
Tim Gross	9a29df2292	scheduler: emit structured logs from reconciliation (#26169 ) Both the cluster reconciler and node reconciler emit a debug-level log line with their results, but these are unstructured multi-line logs that are annoying for operators to parse. Change these to emit structured key-value pairs like we do everywhere else. Ref: https://hashicorp.atlassian.net/browse/NMD-818 Ref: https://go.hashi.co/rfc/nmd-212	2025-07-01 10:37:44 -04:00
Piotr Kazmierczak	0ddbc548a3	scheduler: rename reconciliation package to `reconciler` (#26038 ) nouns are better than verbs for package names	2025-06-12 14:36:09 +02:00

17 Commits