mirror of https://github.com/kemko/nomad.git synced 2026-01-05 09:55:44 +03:00

Go to file

Tim Gross e168548341 provide allocrunner hooks with prebuilt taskenv and fix mutation bugs (#25373 )

Some of our allocrunner hooks require a task environment for interpolating values based on the node or allocation. But several of the hooks accept an already-built environment or builder and then keep that in memory. Both of these retain a copy of all the node attributes and allocation metadata, which balloons memory usage until the allocation is GC'd.

While we'd like to look into ways to avoid keeping the allocrunner around entirely (see #25372), for now we can significantly reduce memory usage by creating the task environment on-demand when calling allocrunner methods, rather than persisting it in the allocrunner hooks.

In doing so, we uncover two other bugs:
* The WID manager, the group service hook, and the checks hook have to interpolate services for specific tasks. They mutated a taskenv builder to do so, but each time they mutate the builder, they write to the same environment map. When a group has multiple tasks, it's possible for one task to set an environment variable that would then be interpolated in the service definition for another task if that task did not have that environment variable. Only the service definition interpolation is impacted. This does not leak env vars across running tasks, as each taskrunner has its own builder.

To fix this, we move the `UpdateTask` method off the builder and onto the taskenv as the `WithTask` method. This makes a shallow copy of the taskenv with a deep clone of the environment map used for interpolation, and then overwrites the environment from the task.

* The checks hook interpolates Nomad native service checks only on `Prerun` and not on `Update`. This could cause unexpected deregistration and registration of checks during in-place updates. To fix this, we make sure we interpolate in the `Update` method.

I also bumped into an incorrectly implemented interface in the CSI hook. I've pulled that and some better guardrails out to https://github.com/hashicorp/nomad/pull/25472.

Fixes: https://github.com/hashicorp/nomad/issues/25269
Fixes: https://hashicorp.atlassian.net/browse/NET-12310
Ref: https://github.com/hashicorp/nomad/issues/25372

2025-03-24 12:05:04 -04:00

.changelog

provide allocrunner hooks with prebuilt taskenv and fix mutation bugs (#25373 )

2025-03-24 12:05:04 -04:00

.github

dependabot: update reviewer for website directory (#25498 )

2025-03-24 12:03:02 -04:00

.release

Prepare for next release

2025-03-12 10:37:52 +00:00

.semgrep

build: Update Go to v1.24.1 (#25249 )

2025-03-06 10:33:14 +00:00

.tours

Make number of scheduler workers reloadable (#11593 )

2022-01-06 11:56:13 -05:00

acl

dynamic host volumes: ACL policies (#24356 )

2024-12-19 09:25:53 -05:00

api

docs: oidc client assertions and pkce (#25375 )

2025-03-20 09:14:17 -05:00

HostVolumePlugin interface and two implementations (#24497 )

2024-12-19 09:25:54 -05:00

client

provide allocrunner hooks with prebuilt taskenv and fix mutation bugs (#25373 )

2025-03-24 12:05:04 -04:00

command

test: Calculate agent endpoint scheduler count, not static. (#25473 )

2025-03-21 13:47:53 +00:00

contributing

docs: extend code layout in contributing guides (#25330 )

2025-03-10 11:55:38 -04:00

demo

dynamic host volumes: change env vars, fixup auto-delete (#24943 )

2025-01-27 10:36:53 -06:00

dev

tools: filter Nomad Enterprise tags in pre-push hook (#24452 )

2024-11-13 09:50:43 -05:00

drivers

drivers: set -1 exit code in case executor gets killed (#25453 )

2025-03-20 15:06:39 +01:00

e2e

e2e: fixes node write policy for consul agents (#25418 )

2025-03-17 15:18:30 -04:00

enos

Merge pull request #25479 from hashicorp/NET-11546-enos-same-allocs

2025-03-24 16:03:57 +01:00

helper

consul: Remove legacy token based authentication workflow (#25217 )

2025-03-05 15:38:11 -05:00

integrations

docs: fix Grafana doc breaking link (#18988 )

2023-11-03 14:31:37 +00:00

internal/testing/apitests

oidc: support PKCE and client assertion / private key JWT (#25231 )

2025-03-10 13:32:53 -05:00

jobspec2

Check for nil values when parsing HCL strings (#25294 )

2025-03-06 10:38:33 +01:00

lib

docs: oidc client assertions and pkce (#25375 )

2025-03-20 09:14:17 -05:00

nomad

provide allocrunner hooks with prebuilt taskenv and fix mutation bugs (#25373 )

2025-03-24 12:05:04 -04:00

plugins

remove remote task execution code (#24909 )

2025-01-29 08:08:34 -05:00

scheduler

disconnected: removes deprecated disconnect fields (#25284 )

2025-03-05 14:46:02 -05:00

scripts

build: Update Go to v1.24.1 (#25249 )

2025-03-06 10:33:14 +00:00

terraform

docs: add missing copyright headers in Terraform examples (#20412 )

2024-04-16 15:21:03 -04:00

testutil

vault: Remove legacy token based authentication workflow. (#25155 )

2025-02-28 07:40:02 +00:00

tools

deps: Update tool dependencies. (#25275 )

2025-03-06 11:51:07 +00:00

[ci/cd] Moves our default github action flows to use Node v20 (#25425 )

2025-03-19 11:38:20 -04:00

version

Prepare for next release

2025-03-12 10:37:52 +00:00

website

Fix link rendering in server.default_scheduler_config (#25482 )

2025-03-21 12:50:57 -05:00

.copywrite.hcl

copywrite: fix and add copywrite config enterprise comments. (#19590 )

2024-01-03 08:58:53 +00:00

.git-blame-ignore-revs

add copywrite headers commit to ignore-revs config file (#17037 )

2023-05-01 10:57:43 -04:00

.gitattributes

Remove invalid gitattributes

2018-02-14 14:47:43 -08:00

.gitignore

Make paths in e2e/terraform/ directory relative to the module (#24664 )

2024-12-13 17:33:59 +01:00

.go-version

build: Update Go to v1.24.1 (#25249 )

2025-03-06 10:33:14 +00:00

.golangci.yml

build: Update Go to v1.24.1 (#25249 )

2025-03-06 10:33:14 +00:00

.semgrepignore

build: disable semgrep on structs.go for now

2022-02-01 10:09:49 -06:00

CHANGELOG-unsupported.md

Merge release 1.9.0 files

2024-10-14 07:42:14 +01:00

CHANGELOG.md

docs: update 1.10-beta changelog with major features (#25367 )

2025-03-12 10:58:46 -04:00

CODEOWNERS

Remove web team from CODEOWNERS for content directories (#24946 )

2025-01-27 08:57:58 -05:00

Dockerfile

add LICENSE to release artifacts (#20345 )

2024-04-12 10:57:15 -05:00

GNUmakefile

Prepare for next release

2025-03-11 14:09:02 +00:00

go.mod

chore(deps): bump github.com/golang-jwt/jwt/v5 from 5.2.1 to 5.2.2 (#25490 )

2025-03-24 09:27:48 -04:00

go.sum

chore(deps): bump github.com/golang-jwt/jwt/v5 from 5.2.1 to 5.2.2 (#25490 )

2025-03-24 09:27:48 -04:00

LICENSE

move license to 2024

2023-12-01 12:26:27 -08:00

main_test.go

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

main.go

deps: Switch from mitchellh/cli to hashicorp/cli (#19321 )

2024-12-19 15:41:11 +00:00

README.md

docs: update all URLs to developer.hashicorp.com (#16247 )

2023-10-24 11:00:11 -04:00

Vagrantfile

dev: make cni, consul, dev, docker, and vault scripts Lima compat. (#16689 )

2023-03-28 16:21:14 +01:00

README.md

Nomad

Nomad is a simple and flexible workload orchestrator to deploy and manage containers (docker, podman), non-containerized applications (executable, Java), and virtual machines (qemu) across on-prem and clouds at scale.

Nomad is supported on Linux, Windows, and macOS. A commercial version of Nomad, Nomad Enterprise, is also available.

Website: https://developer.hashicorp.com/nomad
Tutorials: HashiCorp Developer
Forum: Discuss

Nomad provides several key features:

Deploy Containers and Legacy Applications: Nomad’s flexibility as an orchestrator enables an organization to run containers, legacy, and batch applications together on the same infrastructure. Nomad brings core orchestration benefits to legacy applications without needing to containerize via pluggable task drivers.
Simple & Reliable: Nomad runs as a single binary and is entirely self contained - combining resource management and scheduling into a single system. Nomad does not require any external services for storage or coordination. Nomad automatically handles application, node, and driver failures. Nomad is distributed and resilient, using leader election and state replication to provide high availability in the event of failures.
Device Plugins & GPU Support: Nomad offers built-in support for GPU workloads such as machine learning (ML) and artificial intelligence (AI). Nomad uses device plugins to automatically detect and utilize resources from hardware devices such as GPU, FPGAs, and TPUs.
Federation for Multi-Region, Multi-Cloud: Nomad was designed to support infrastructure at a global scale. Nomad supports federation out-of-the-box and can deploy applications across multiple regions and clouds.
Proven Scalability: Nomad is optimistically concurrent, which increases throughput and reduces latency for workloads. Nomad has been proven to scale to clusters of 10K+ nodes in real-world production environments.
HashiCorp Ecosystem: Nomad integrates seamlessly with Terraform, Consul, Vault for provisioning, service discovery, and secrets management.

Quick Start

Testing

See Developer: Getting Started for instructions on setting up a local Nomad cluster for non-production use.

Optionally, find Terraform manifests for bringing up a development Nomad cluster on a public cloud in the terraform directory.

Production

See Developer: Nomad Reference Architecture for recommended practices and a reference architecture for production deployments.

Documentation

Full, comprehensive documentation is available on the Nomad website: https://developer.hashicorp.com/nomad/docs

Guides are available on HashiCorp Developer.

Roadmap

A timeline of major features expected for the next release or two can be found in the Public Roadmap.

This roadmap is a best guess at any given point, and both release dates and projects in each release are subject to change. Do not take any of these items as commitments, especially ones later than one major release away.

Contributing

See the contributing directory for more developer documentation.

Languages

Go 76.9%

MDX 11%

JavaScript 8.2%

Handlebars 1.7%

HCL 1.4%

Other 0.7%

README.md Unescape Escape

Nomad

Quick Start

Testing

Production

Documentation

Roadmap

Contributing

README.md