nomad

mirror of https://github.com/kemko/nomad.git synced 2026-01-01 16:05:42 +03:00

Author	SHA1	Message	Date
Michael Schurter	ee5059a6a7	docs: revert to labels={"foo.bar": "baz"} style (#26535 ) * docs: revert to labels={"foo.bar": "baz"} style Back in #24074 I thought it was necessary to wrap labels in a list to support quoted keys in hcl2. This... doesn't appear to be true at all? The simpler `labels={...}` syntax appears to work just fine. I updated the docs and a test (and modernized it a bit). I also switched some other examples to the `labels = {}` format from the old `labels{}` format. * copywronged * fmtd	2025-08-20 09:26:42 -07:00
Tim Gross	c8dcd3c2db	docker: clamp CPU shares to minimum of 2 (#26081 ) In #25963 we added normalization of CPU shares for large hosts where the total compute was larger than the maximum CPU shares. But if the result after normalization is less than 2, runc will have an integer overflow. We prevent this in the shared executor for the `exec`/`rawexec` driver by clamping to the safe minimum value. Do this for the `docker` driver as well and add test coverage of it for the shared executor too. Fixes: https://github.com/hashicorp/nomad/issues/26080 Ref: https://github.com/hashicorp/nomad/pull/25963	2025-06-19 13:48:06 -04:00
Conor Mongey	f7096fb9d6	docker: add cgroupns task config (#25927 )	2025-06-11 13:50:44 -04:00
dependabot[bot]	6a35c1b8ea	chore(deps): bump github.com/docker/docker from 28.1.1+incompatible to 28.2.2+incompatible (#25954 ) * chore(deps): bump github.com/docker/docker Bumps [github.com/docker/docker](https://github.com/docker/docker) from 28.1.1+incompatible to 28.2.2+incompatible. - [Release notes](https://github.com/docker/docker/releases) - [Commits](https://github.com/docker/docker/compare/v28.1.1...v28.2.2) --- updated-dependencies: - dependency-name: github.com/docker/docker dependency-version: 28.2.2+incompatible dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * deps: containerd/errdefs instead of docker/errdefs moby's errdefs are deprecated as of `f1bb44aeee` and now merely point to containerd's --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Daniel Bennett <dbennett@hashicorp.com>	2025-06-05 10:26:18 -04:00
Tim Gross	34e96932a1	drivers: normalize CPU shares/weights to fit large hosts (#25963 ) The `resources.cpu` field is scheduled in MHz. On most Linux task drivers, this value is then mapped to a `cpu.share` (cgroups v1) or `cpu.weight` (cgroups v2). But this means on very large hosts where the total compute is greater than the Linux kernel defined maximum CPU shares, you can't set a `resources.cpu` value large enough to consume the entire host. The `cpu.share`/`cpu.weight` value is relative within the parent cgroup's slice, which is owned by Nomad. So we can fix this by re-normalizing the weight on very large hosts such that the maximum `resources.cpu` matches up with largest possible CPU share. This happens in the task driver so that the rest of Nomad doesn't need to be aware of this implementation detail. Note that these functions will result in bad share config if the request is more than the available, but that's supposed to be caught in the scheduler so by not catching it here we intentionally hit the runc error. Fixes: https://hashicorp.atlassian.net/browse/NMD-297 Fixes: https://github.com/hashicorp/nomad/issues/7731 Ref: https://go.hashi.co/rfc/nmd-211	2025-06-03 15:57:40 -04:00
Tim Gross	374e987b9b	metrics: emit cache and rss stats on cgroup v2 (#25751 ) In cgroups v2, a different map of memory stats is available from the kernel than in v1. The Docker API reflects this change. But there are equivalent values in the map for RSS (anonymously mapped memory) and cache (filesystem cache and tmpfs), which the Docker driver is not currently emitting. Fallback to these alternate values when the cgroups v1 values are not available. Include the anonymous mapping in the "measured" allocation stats as "RSS" so that they both show up in allocation metrics. We can do this on both the `docker` driver and the Linux executor for `exec` and `java` drivers. Fixes: https://github.com/hashicorp/nomad/issues/19185 Ref: https://hashicorp.atlassian.net/browse/NMD-437 Ref: https://www.kernel.org/doc/html/latest/admin-guide/cgroup-v2.html#memory-interface-files Ref: https://www.kernel.org/doc/Documentation/cgroup-v1/memory.txt	2025-04-24 12:48:18 -04:00
Tim Gross	c7cb49f205	testing: fix a panic in docker stats collection test (#25747 ) When the context closes, the stats emitter closes its channel. It's possible for the channel to be closed in the stats emitter goroutine before the `select` in the test sees that the context has closed, which can result in a panic in the test when we try to read the empty value off the channel.	2025-04-24 10:41:03 -04:00
Piotr Kazmierczak	3ad0df71a8	docker: correct stat response for rss, cache and swap memory in cgroups v1 (#25741 ) #25138 refactoring accidentally removed some of the memory stats that weren't available as concrete types in containerapi.	2025-04-24 15:17:56 +02:00
Tim Gross	4d7ed88a8d	testing: use Docker Hub registry mirror for additional tests (#25733 ) This image was missed in https://github.com/hashicorp/nomad/pull/25703 and is resulting in rate limited in tests.	2025-04-24 08:50:32 -04:00
Tim Gross	88dc842729	testing: use Docker Hub registry mirror for CI (#25703 ) As of April 1, Docker Hub rate limits tightened. With only 10 pulls/hr/IP, we're likely to encounter test failures. Switch all Docker images getting pulled from this repository to use the HashiCorp managed registry mirror. Note that most of our tests in `drivers/docker` don't pull from the remote registry but load a local image, while others will need to pull from the remote and fetch different images depending on OS/arch. Refactor the definition of test task configuration to make it clear which is which, and de-factor some false sharing of setup functions. Updates the E2E tests to use that registry by configuring the Docker daemon. This required changing out a few container images that we don't have in the registry, but these new images are all smaller. There are a couple of tests that still use explicitly-tagged `docker.io` images or other third-party registries, which have been left in place. Ref: https://hashicorp.atlassian.net/browse/NET-12233 update E2E images to those in the registry mirror fix windows and docklog test build fix stopsignal test mop-up more mop-up	2025-04-18 14:21:49 -04:00
James Rasell	c85c723336	ci: Run core tests groups workflow on amd64 and arm64 runners. (#25695 )	2025-04-17 15:16:29 +01:00
Allison Larson	d1d8945d2e	Add docker plugin config option image_pull_timeout value for default timeout (#25489 ) * Add docker plugin config image_pull_timeout value for default timeout * Add image_pull_timeout docker plugin config to docs * Add changelog	2025-03-24 13:03:14 -07:00
Piotr Kazmierczak	e249a6197f	docker: TestDockerDriver_OOMKilled should now run on cgroups v2 (#25443 ) Docker driver's TestDockerDriver_OOMKilled should run on cgroups v2 now, since we're running docker v27 client library and our runners run docker v26 that contain containerd fix containerd/containerd#6323.	2025-03-19 16:53:37 +01:00
dependabot[bot]	459f95ce3f	chore(deps): bump github.com/docker/docker from 27.4.1+incompatible to 28.0.1+incompatible (#25405 ) Co-authored-by: James Rasell <jrasell@hashicorp.com>	2025-03-18 08:32:37 +00:00
Juana De La Cuesta	5605f9630d	Fix the docker image parser to account for private repos (#24926 ) * fix: fix the docker image parser to account for private repos * style: change the local regex for docker image indentifiers and use docker package instead * func: return early when no repo found on the image name * func: return error if no path found in image * Update drivers/docker/utils.go Co-authored-by: Tim Gross <tgross@hashicorp.com> * Update coordinator.go * Update driver.go * Update network.go --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2025-03-04 16:53:20 +01:00
Jorge Marey	25426f0777	fingerprint: add config option to disable dmidecode (#25108 )	2025-02-13 11:20:48 -05:00
Daniel Bennett	91194b3cc2	docker: refactor to handle futures more easily (#24992 ) at least one bug has been created because it's easy to miss a future.set() in pullImageImpl() this pulls future.set() out to PullImage(), the same level where it's created and wait()ed	2025-02-07 12:45:17 -06:00
Daniel Bennett	62ef621582	docker: respect image_pull_timeout (#24991 ) I believe the docker driver stopped respecting image_pull_timeout in Nomad 1.9.0 in `981ca36049` this makes the timeout apply again	2025-02-07 11:36:31 -06:00
Daniel Bennett	3493551c38	docker: surface image pull progress error (#24981 ) set() on the future, so the caller can handle it instead of wait()ing forever and causing the allocation to get stuck "pending"	2025-02-07 10:36:09 -06:00
Michael Smithhisler	47c14ddf28	remove remote task execution code (#24909 )	2025-01-29 08:08:34 -05:00
James Rasell	0726e4cc3e	driver/docker: Fix container CPU stats collection (#24768 ) The recent change to collection via a "one-shot" Docker API call did not update the stream boolean argument. This results in the PreCPUStats values being zero and therefore breaking the CPU calculations which rely on this data. The base fix is to update the passed boolean parameter to match the desired non-streaming behaviour. The non-streaming API call correctly returns the PreCPUStats data which can be seen in the added unit test. The most recent change also modified the behaviour of the collectStats go routine, so that any error encountered results in the routine exiting. In the event this was a transient error, the container will continue to run, however, no stats will be collected until the task is stopped and replaced. This PR reverts the behaviour, so that an error encountered during a stats collection run results in the error being logged but the collection process continuing with a backoff used.	2025-01-07 07:42:31 +00:00
Vincent Ducamps	6469b59a0a	docker: Fix a bug where images with port number and no tags weren't parsed correctly	2025-01-03 11:38:43 +01:00
Juana De La Cuesta	a9e7166b6b	[gh-24339] Move from streaming stats to polling for docker (#24525 ) * fix: dont stream the docker stats, read them one by one * func: add a NewSafeTicker to the herlper functions * style: remove commented code	2024-11-21 17:36:53 +01:00
Seth Hoenig	b539b54c9e	docker: close hijacked write connection when exec ends (#24244 )	2024-10-17 11:41:29 -05:00
Seth Hoenig	b18851617f	docker: close response connection once stdin is exhausted (#24202 )	2024-10-17 11:07:23 -05:00
Piotr Kazmierczak	1ac14f4869	docker: always use API version negotiation when initializing clients (#24237 ) During a refactoring of the docker driver in #23966 we introduced a bug: API version negotiation option was not passed to every new client call.	2024-10-17 15:23:14 +02:00
Tim Gross	d12128c380	docker: use streaming stats collection to correct CPU stats (#24229 ) In #23966 we switched to the official Docker SDK for the `docker` driver. In the process we refactored code around stats collection to use the "one shot" version of stats. Unfortunately this "one shot" stats collection does not include the `PreCPU` stats, which are the stats from the previous read. This breaks the calculation we use to determine CPU ticks, because now we're subtracting 0 from the current value to get the delta. Switch back to using the streaming stats collection. Add a test that fully exercises the `TaskStats` API. Fixes: https://github.com/hashicorp/nomad/issues/24224 Ref: https://hashicorp.atlassian.net/browse/NET-11348	2024-10-17 08:25:59 -04:00
Piotr Kazmierczak	f9cbaaf6c7	docker: fix a bug where auth for private registries wasn't parsed correctly (#24215 ) In #23966 we introduced an official Docker client and did not notice that in contrast to our previous 3rd party client, the official SDK PullOptions object expects a base64 encoded JSON with username and password, instead of username/ password pair.	2024-10-16 22:04:54 +02:00
Tim Gross	e9ba630639	docker: fix script check execution (#24098 ) In #24095 we made a fix for non-streaming exec into Docker tasks for script checks and `change_mode = "script"`, but didn't complete E2E testing. We need to use `ContainerExecAttach` in the new API in order to get stdout/stderr from tasklets, but the previous `ContainerExecStart` call will prevent this from running successfully with an error that the exec has already run. * Ref: [NET-11202 (comment)](https://hashicorp.atlassian.net/browse/NET-11202?focusedCommentId=551618) * This has shipped in Nomad 1.9.0-beta.1 but not production yet. * This should fix the remaining issues in nightly E2E for Docker.	2024-10-01 16:41:38 -04:00
Tim Gross	7a88d5d626	docker: fix non-streaming exec attachment (#24095 ) In ##23966 when we switched to using the official Docker SDK client, this included new API calls for attaching to the "exec objects" created for running processes inside a running Docker task. When we updated the API for the non-streaming cases (script health checks, and `change_mode = "script"`), we used the container ID and not the exec object ID. These IDs aren't identical because you can have multiple exec objects for a given container. This results in errors like "unable to upgrade to tcp, received 404" because the Docker API can't find the exec object with the container ID. * Ref: [NET-11202 (comment)](https://hashicorp.atlassian.net/browse/NET-11202?focusedCommentId=551618) * This has shipped in Nomad 1.9.0-beta.1 but not production yet.	2024-10-01 11:27:13 -04:00
Tim Gross	bf0a65f2d6	docker: reset timer after collecting stats (#24092 ) In ##23966 when we switched to using the official Docker SDK client, we had to rework the stats collection loop for the new client. But we missed resetting the timer on the collection loop, which meant that we'd only collect stats once and then never again. * Ref: [NET-11202 (comment)](https://hashicorp.atlassian.net/browse/NET-11202?focusedCommentId=550814) * This has shipped in Nomad 1.9.0-beta.1 but not production yet.	2024-10-01 08:31:03 -04:00
Tim Gross	154aeb77af	docker: fix bug in waiting for container to exit (#24081 ) In ##23966 when we switched to using the official Docker SDK client, we had more contexts to add because most of the library methods take one. But for some APIs like waiting for a container to exit after we've started it, we never want to close this context, because the operation can outlive the Nomad agent itself.	2024-09-30 08:50:07 -04:00
Piotr Kazmierczak	ec42aa2a1b	docker: use docker errdefs instead of string comparisons when checking errors (#24075 )	2024-09-27 15:32:29 +02:00
Piotr Kazmierczak	981ca36049	docker: use official client instead of fsouza/go-dockerclient (#23966 ) This PR replaces fsouza/go-dockerclient 3rd party docker client library with docker's official SDK. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com> Co-authored-by: Seth Hoenig <shoenig@duck.com>	2024-09-26 18:41:44 +02:00
Seth Hoenig	51215bf102	deps: update to go-set/v3 and refactor to use custom iterators (#23971 ) * deps: update to go-set/v3 * deps: use custom set iterators for looping	2024-09-16 13:40:10 -05:00
Tim Gross	192d70cee7	docker: update infra_image to new registry (#23927 ) The gcr.io container registry is shutting down in March. Update the default `image_image` for Docker's "pause" containers to point to the new location hosted by the k8s project. Fixes: https://github.com/hashicorp/nomad/issues/23911 Ref: https://hashicorp.atlassian.net/browse/NET-10942	2024-09-06 14:34:03 -04:00
Tim Gross	6aa503f2bb	docker: disable cpuset management for non-root clients (#23804 ) Nomad clients manage a cpuset cgroup for each task to reserve or share CPU cores. But Docker owns its own cgroups, and attempting to set a parent cgroup that Nomad manages runs into conflicts with how runc manages cgroups via systemd. Therefore Nomad must run as root in order for cpuset management to ever be compatible with Docker. However, some users running in unsupported configurations felt that the changes we made in Nomad 1.7.0 to ensure Nomad was running correctly represented a regression. This changeset disables cpuset management for non-root Nomad clients. When running Nomad as non-root, the driver will not longer reconcile cpusets with Nomad and `resources.cores` will behave incorrectly (but the driver will still run). Although this is one small step along the way to supporting a rootless Nomad client, running Nomad as non-root is still unsupported. This PR is insufficient by itself to have a secure and properly-working rootless Nomad client. Ref: https://github.com/hashicorp/nomad/issues/18211 Ref: https://github.com/hashicorp/nomad/issues/13669 Ref: https://hashicorp.atlassian.net/browse/NET-10652 Ref: https://github.com/opencontainers/runc/blob/main/docs/systemd.md	2024-08-14 16:44:13 -04:00
Tim Gross	9543e740af	docker: fix delimiter for selinux label for read-only volumes (#23750 ) The Docker driver's `volume` field to specify bind-mounts takes a list of strings that consist of three `:`-delimited fields: source, destination, and options. We append the SELinux label from the plugin configuration as the third field. But when the user has already specified the volume is read-only with `:ro`, we're incorrectly appending the SELinux label with another `:` instead of the required `,`. Combine the options into a single field value before appending them to the bind mounts configuration. Updated the tests to split out Windows behavior (which doesn't accept options) and to ensure the test task has the expected environment for bind mounts. Fixes: https://github.com/hashicorp/nomad/issues/23690	2024-08-08 09:08:01 -04:00
Piotr Kazmierczak	f22ce921cd	docker: adjust capabilities on Windows (#23599 ) Adjusts Docker capabilities per OS, and checks for runtime on Windows. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2024-07-17 09:01:45 +02:00
Piotr Kazmierczak	d5e1515e80	docker: default to hyper-v isolation on Windows (#23452 )	2024-07-01 08:56:43 +02:00
Piotr Kazmierczak	0ece7b5c16	docker: validate that containers do not run as ContainerAdmin on Windows (#23443 ) This enables checks for ContainerAdmin user on docker images on Windows. It's only checked if users run docker with process isolation and not hyper-v, because hyper-v provides its own, proper sandboxing. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2024-06-27 16:22:24 +02:00
Piotr Kazmierczak	830297bcf0	docker: update image in TestDockerDriver_Start_Image_HTTPS (#23309 )	2024-06-12 16:13:39 +02:00
Piotr Kazmierczak	0e8a67f0e1	docker: oom_score_adj support (#23297 )	2024-06-12 10:49:20 +02:00
Tim Gross	71fd5c2474	testing: pull Docker images from mirror (#23190 ) In https://github.com/hashicorp/nomad/pull/17401 we added test helpers that would allow `docker` driver tests to pull from a mirror of the Docker Hub registry. Extend the use of this helper a test that recently hit rate-limiting. Fixes: https://github.com/hashicorp/nomad/issues/23174	2024-06-06 11:21:45 -04:00
Piotr Kazmierczak	307fd590d7	docker: new container_exists_attempts configuration field (#22419 ) This allows users to set a custom value of attempts that will be made to purge an existing (not running) container if one is found during task creation. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2024-05-30 19:22:14 +02:00
Piotr Kazmierczak	bf11e39ac8	docker: add a unit test for "container already exists" error when creating containers (#22238 )	2024-05-30 11:24:28 +02:00
Seth Hoenig	825efc3925	docker: use correct effective cpuset filename on legacy cgroups v1 systems (#20294 )	2024-04-05 08:05:51 -05:00
Yorick Gersie	6124ee8afb	cpuset fixer: use correct cgroup path for updates (#20276 ) * cpuset fixer: use correct cgroup path for updates fixes #20275 * docker: flatten switch statement and add test cases * cl: add cl --------- Co-authored-by: Seth Hoenig <shoenig@duck.com>	2024-04-04 15:54:10 -05:00
Seth Hoenig	05937ab75b	exec2: add client support for unveil filesystem isolation mode (#20115 ) * exec2: add client support for unveil filesystem isolation mode This PR adds support for a new filesystem isolation mode, "Unveil". The mode introduces a "alloc_mounts" directory where tasks have user-owned directory structure which are bind mounts into the real alloc directory structure. This enables a task driver to use landlock (and maybe the real unveil on openbsd one day) to isolate a task to the task owned directory structure, providing sandboxing. * actually create alloc-mounts-dir directory * fix doc strings about alloc mount dir paths	2024-03-13 08:24:17 -05:00
Seth Hoenig	4d83733909	tests: swap testify for test in more places (#20028 ) * tests: swap testify for test in plugins/csi/client_test.go * tests: swap testify for test in testutil/ * tests: swap testify for test in host_test.go * tests: swap testify for test in plugin_test.go * tests: swap testify for test in utils_test.go * tests: swap testify for test in scheduler/ * tests: swap testify for test in parse_test.go * tests: swap testify for test in attribute_test.go * tests: swap testify for test in plugins/drivers/ * tests: swap testify for test in command/ * tests: fixup some test usages * go: run go mod tidy * windows: cpuset test only on linux	2024-02-29 12:11:35 -06:00

1 2 3 4 5 ...

437 Commits