nomad

mirror of https://github.com/kemko/nomad.git synced 2026-01-06 18:35:44 +03:00

Author	SHA1	Message	Date
Tim Gross	b7419bc940	api: only set url field in config if previously unset (#23785 ) In #16872 we added support for configuring the API client with a unix domain socket. In order to set the host correctly, we parse the address before mutating the Address field in the configuration. But this prevents the configuration from being reused across multiple clients, as the next time we parse the address it will no longer be pointing to the socket. This breaks consumers like the autoscaler, which reuse the API config between plugins. Update the `NewClient` constructor to only override the `url` field if it hasn't already been parsed. Include a test demonstrating safe reuse with a unix domain socket. Ref: https://github.com/hashicorp/nomad-autoscaler/issues/944 Ref: https://github.com/hashicorp/nomad-autoscaler/pull/945	2024-08-09 13:28:04 -04:00
Tim Gross	920f4702d6	testing: fix skip comment on `RequireWindows` helper (#23776 )	2024-08-09 09:07:25 -04:00
Tim Gross	ef116b12d5	metrics: add `client.tasks` state metrics (#23773 ) Although we have `client.allocations` metrics to track allocation states on a client, having separate metrics for `client.tasks` will allow operators to identify that there are individual tasks in an unexpected state in an otherwise healthy allocation. Fixes: https://github.com/hashicorp/nomad/issues/23770	2024-08-09 09:02:17 -04:00
VPanteleev-S7	2e5d6192a7	docs: illustrate how to use the obtained token (#19557 ) Currently, the page doesn't explain how to do things as the logged in user.	2024-08-08 15:35:26 -04:00
Kartik Prajapati	3a3e63e2e1	cli: add role update functionality to acl token update (#18532 )	2024-08-08 15:33:36 -04:00
Farbod Ahmadian	bb4c4fbd49	cli: show warning when creating token if policy doesn't exist (#16437 )	2024-08-08 11:04:55 -04:00
Tim Gross	9543e740af	docker: fix delimiter for selinux label for read-only volumes (#23750 ) The Docker driver's `volume` field to specify bind-mounts takes a list of strings that consist of three `:`-delimited fields: source, destination, and options. We append the SELinux label from the plugin configuration as the third field. But when the user has already specified the volume is read-only with `:ro`, we're incorrectly appending the SELinux label with another `:` instead of the required `,`. Combine the options into a single field value before appending them to the bind mounts configuration. Updated the tests to split out Windows behavior (which doesn't accept options) and to ensure the test task has the expected environment for bind mounts. Fixes: https://github.com/hashicorp/nomad/issues/23690	2024-08-08 09:08:01 -04:00
johncooler	3214c2bd62	docs: remove duplicate config option (#23768 )	2024-08-08 08:25:25 -04:00
Tim Gross	bcb0ee3031	identity: prevent missing task identity from panicking server (#23763 ) Tasks have a default identity created during canonicalization. If this default identity is somehow missing, we'll hit panics when trying to create and sign the claims in the plan applier. Fallback to the default identity if it's missing from the task. This changeset will need a different implementation in 1.7.x+ent backports, as the constructor for identities was refactored significantly in #23708. The panic cannot occur in 1.6.x+ent. Fixes: https://github.com/hashicorp/nomad/issues/23758	2024-08-07 15:26:20 -04:00
Daniel Bennett	d131c41943	cni: network.cni job updates should replace allocs (#23764 ) a change to the network{cni{}} block means that the user wants the network config to change, and that only happens during initial alloc setup, so we need to replace the alloc(s) to get fresh network(s) to reconfigure from scratch. e.g. a job plan diff like this ``` +/- Task Group: "g" (1 in-place update) + Network { + CNIConfig { + a: "ayy" } ``` should instead be ``` +/- Task Group: "g" (1 create/destroy update) + Network { + CNIConfig { + a: "ayy" } ```	2024-08-07 12:13:11 -05:00
Aimee Ukasick	20511fa64d	docs: Clarify namespace rules matching criteria. (#23752 ) Clarify how Nomad evaluates policy rules. Fixes: #20118 Jira: https://hashicorp.atlassian.net/browse/CE-695 Related tutorial PR: https://github.com/hashicorp/tutorials/pull/2205	2024-08-07 09:28:38 -04:00
Tim Gross	4a5921cb16	acl: disallow leading `/` on variable paths (#23757 ) The path for a Variable never begins with a leading `/`, because it's stripped off in the API before it ever gets to the state store. The CLI and UI allow the leading `/` for convenience, but this can be misleading when it comes to writing ACL policies. An ACL policy with a path starting with a leading `/` will never match. Update the ACL policy parser so that we prevent an incorrect variable path in the policy. Fixes: https://github.com/hashicorp/nomad/issues/23730	2024-08-07 09:26:18 -04:00
Juana De La Cuesta	218fa82d02	Merge pull request #23756 from hashicorp/backport-versions build: revert change to active versions	2024-08-07 08:59:14 +02:00
Tim Gross	8a774702c7	build: revert change to active versions In #23720 we removed the 1.7.x and 1.6.x release branches from the "active" versions for backports. But we forgot that we needed these backports to remain active for documentation backports because of versioned docs.	2024-08-06 16:33:01 -04:00
Aimee Ukasick	021692eccf	docs: refactor CNI plugin content (#23707 ) - Pulled common content from multiple pages into new partials - Refactored install/index to be OS-based so I could add linux-distro-based instructions to install-consul-cni-plugins.mdx partial. The tab groups on the install/index page do match and change focus as expected. - Moved CNI overview-type content to networking/index - Refactored networking/cni to include install CNI plugins and configuration content (from install/index). - Moved CNI plugins explanation in bridge mode configuration section into bullet points. They had been #### headings, which aren't rendered in the R page TOC. I tried to simplify and format the bullet point content to be easier to scan. Ref: https://hashicorp.atlassian.net/browse/CE-661 Fixes: https://github.com/hashicorp/nomad/issues/23229 Fixes: https://github.com/hashicorp/nomad/issues/23583	2024-08-06 14:47:46 -04:00
Deniz Onur Duzgun	0f7b8698ec	security: fix write symlink escape on the same allocdir path (#23738 ) Resolves symlink escape when unarchiving by removing existing paths within the same allocation directory which can occur by writing a header that points to a symlink that lives outside of the sandbox environment. This exploit requires first compromising the Nomad client agent at the source allocation. Ref: https://hashicorp.atlassian.net/browse/NET-10607 Ref: https://github.com/hashicorp/nomad-enterprise/pull/1725	2024-08-05 16:23:27 -04:00
Tim Gross	b25f1b66ce	resources: allow job authors to configure size of secrets tmpfs (#23696 ) On supported platforms, the secrets directory is a 1MiB tmpfs. But some tasks need larger space for downloading large secrets. This is especially the case for tasks using `templates`, which need extra room to write a temporary file to the secrets directory that gets renamed to the old file atomically. This changeset allows increasing the size of the tmpfs in the `resources` block. Because this is a memory resource, we need to include it in the memory we allocate for scheduling purposes. The task is already prevented from using more memory in the tmpfs than the `resources.memory` field allows, but can bypass that limit by writing to the tmpfs via `template` or `artifact` blocks. Therefore, we need to account for the size of the tmpfs in the allocation resources. Simply adding it to the memory needed when we create the allocation allows it to be accounted for in all downstream consumers, and then we'll subtract that amount from the memory resources just before configuring the task driver. For backwards compatibility, the default value of 1MiB is "free" and ignored by the scheduler. Otherwise we'd be increasing the allocated resources for every existing alloc, which could cause problems across upgrades. If a user explicitly sets `resources.secrets = 1` it will no longer be free. Fixes: https://github.com/hashicorp/nomad/issues/2481 Ref: https://hashicorp.atlassian.net/browse/NET-10070	2024-08-05 16:06:58 -04:00
Tim Gross	e684636aed	cli: add option to return original HCL in `job inspect` command (#23699 ) In 1.6.0 we shipped the ability to review the original HCL in the web UI, but didn't follow-up with an equivalent in the command line. Add a `-hcl` flag to the `job inspect` command. Closes: https://github.com/hashicorp/nomad/issues/6778	2024-08-05 15:35:18 -04:00
Tim Gross	bc50eebebd	workload identity: add support for extra claims config for Vault (#23675 ) Although we encourage users to use Vault roles, sometimes they're going to want to assign policies based on entity and pre-create entities and aliases based on claims. This allows them to use single default role (or at least small number of them) that has a templated policy, but have an escape hatch from that. When defining Vault entities the `user_claim` must be unique. When writing Vault binding rules for use with Nomad workload identities the binding rule won't be able to create a 1:1 mapping because the selector language allows accessing only a single field. The `nomad_job_id` claim isn't sufficient to uniquely identify a job because of namespaces. It's possible to create a JWT auth role with `bound_claims` to avoid this becoming a security problem, but this doesn't allow for correct accounting of user claims. Add support for an `extra_claims` block on the server's `default_identity` blocks for Vault. This allows a cluster administrator to add a custom claim on all allocations. The values for these claims are interpolatable with a limited subset of fields, similar to how we interpolate the task environment. Fixes: https://github.com/hashicorp/nomad/issues/23510 Ref: https://hashicorp.atlassian.net/browse/NET-10372 Ref: https://hashicorp.atlassian.net/browse/NET-10387	2024-08-05 15:01:54 -04:00
Aimee Ukasick	cbacdb2041	DOCS: CE-659 chroot limitations for isolated fork/exec driver (#23739 )	2024-08-05 14:35:54 -04:00
Tim Gross	0b9defe6e4	refactor identity claims constructor to builder (#23708 ) Upcoming work to add extensibility to identity claims for Vault (ref #23510) will require exposing server configuration and more objects from state to the process of creating an `IdentityClaims` struct. Depending on how we inject these parameters into the constructor, we end up creating circular dependencies or a lot more logic in the setup in the plan applier and alloc endpoint. There are three contexts where we call `NewIdentityClaims`: the plan applier (where we only care about the default identity), signing task identities, and signing service identities. Each needs different parameters. So we'll refactor the constructor as a builder with methods that the caller can decide to use (or not) depending on context. I've pulled this work out of #23675 to make it easier to review separately. Ref: https://github.com/hashicorp/nomad/issues/23510 Ref: https://github.com/hashicorp/nomad/pull/23675 Ref: https://hashicorp.atlassian.net/browse/NET-10372 Ref: https://hashicorp.atlassian.net/browse/NET-10387	2024-08-05 14:31:56 -04:00
Tim Gross	9ff7437b06	docs: document `client.alloc_mounts_dir` configuration (#23733 ) In Nomad 1.8.0 we introduced the `alloc_mounts_dir` to support unveil filesystem isolation, but we didn't document the configuration value.	2024-08-05 11:59:47 -04:00
dependabot[bot]	2c8ee29ade	chore(deps): bump github.com/moby/term (#23587 ) Bumps [github.com/moby/term](https://github.com/moby/term) from 0.0.0-20210619224110-3f7ff695adc6 to 0.5.0. - [Commits](https://github.com/moby/term/commits/v0.5.0) --- updated-dependencies: - dependency-name: github.com/moby/term dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-08-05 11:47:23 -04:00
dependabot[bot]	1ba16f11ec	chore(deps): bump github.com/containernetworking/cni from 1.1.2 to 1.2.3 (#23701 ) Bumps [github.com/containernetworking/cni](https://github.com/containernetworking/cni) from 1.1.2 to 1.2.3. - [Release notes](https://github.com/containernetworking/cni/releases) - [Commits](https://github.com/containernetworking/cni/compare/v1.1.2...v1.2.3) --- updated-dependencies: - dependency-name: github.com/containernetworking/cni dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-08-05 11:45:57 -04:00
dependabot[bot]	8e6ccf38ff	chore(deps): bump github.com/docker/docker (#23731 ) Bumps [github.com/docker/docker](https://github.com/docker/docker) from 27.0.2+incompatible to 27.1.1+incompatible. - [Release notes](https://github.com/docker/docker/releases) - [Commits](https://github.com/docker/docker/compare/v27.0.2...v27.1.1) --- updated-dependencies: - dependency-name: github.com/docker/docker dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-08-05 11:41:54 -04:00
Jeanne Angeles Franco	cfce3e56be	Add artifacts manifest (automatically generated) (#23719 )	2024-08-02 09:15:43 -04:00
Tim Gross	0c01b2d4e4	build: remove inactive CE versions from being backport eligible (#23720 ) As of Nomad 1.8.0 LTS we're no longer backporting changes to Nomad CE versions 1.7.x and 1.6.x. We never got around to removing the flags in the file the backport assistant uses to control automated backports.	2024-08-01 14:56:34 -04:00
Tim Gross	9d4686c0df	tls: remove deprecated `prefer_server_cipher_suites` field (#23712 ) The TLS configuration object includes a deprecated `prefer_server_cipher_suites` field. In version of Go prior to 1.17, this property controlled whether a TLS connection would use the cipher suites preferred by the server or by the client. This field is ignored as of 1.17 and, according to the `crypto/tls` docs: "Servers now select the best mutually supported cipher suite based on logic that takes into account inferred client hardware, server hardware, and security." This property has been long-deprecated and leaving it in place may lead to false assumptions about how cipher suites are negotiated in connection to a server. So we want to remove it in Nomad 1.9.0. Fixes: https://github.com/hashicorp/nomad-enterprise/issues/999 Ref: https://hashicorp.atlassian.net/browse/NET-10531	2024-08-01 08:52:05 -04:00
Tim Gross	8c5ae0783d	docs: fix incorrect header size in changelog (#23716 )	2024-08-01 08:50:16 -04:00
Tim Gross	2ee6043cab	tls: support setting min version to TLS1.3 (#23713 ) Nomad already supports TLS1.3, but not as a minimum version configuration. Update our config validation to allow setting `tls_min_version` to 1.3. Update the documentation to match Vault and warn that the `tls_cipher_suites` field is ignored when TLS is 1.3 Fixes: https://github.com/hashicorp/nomad/issues/20131 Ref: https://hashicorp.atlassian.net/browse/NET-10530	2024-08-01 08:46:32 -04:00
Juana De La Cuesta	2e62c37676	Merge pull request #23618 from hashicorp/nj-remove-wrong-changelog fix: Remove the changelog from v1.6.13	2024-08-01 10:04:35 +02:00
Adrian Todorov	b1212bae59	Fix a Sentinel policy template (#23686 ) Fix a multiline comment missing an opening `#`	2024-07-31 14:18:14 -04:00
sbenoistmics	8ce942b5b3	Allow static ports duplication on network host with multiple matches (#23702 )	2024-07-30 11:39:56 -05:00
Tim Gross	c06859e5bc	docs: add note about removing support for older clients in 1.9 (#23695 ) In Nomad 1.6.0 we started sending the node secret with RPCs that previously did not include it. We planned to deprecate the older auth workflow but didn't set a release. Removing the legacy support means that nodes running <1.6.0 will fail to heartbeat. Ref: https://hashicorp.atlassian.net/browse/NET-10009	2024-07-26 13:26:24 -04:00
Phil Renaud	08eab0045a	[ui] Unescape csi/ in plugin GET requests (#23625 ) * Unescape csi/ in plugin GET requests * CSI de-prefixing no longer necessary in mirage mocked endpoint	2024-07-26 13:19:42 -04:00
Tim Gross	d5ca07a247	docs: notices of upcoming deprecations and backports (#23683 ) Add a section to the docs describing planned upcoming deprecations and removals. Also added some missing upgrade guide sections missed during the last release.	2024-07-25 10:20:18 -04:00
James Rasell	6a7eb15590	release: explicitly set Linux service file user and group param. (#23687 ) The default root:root is used as this provides permissions to run both server and client agents. The comment details what changes can be made to operators if needed. When running the service file prior to this change, root:root would be the default.	2024-07-25 15:18:47 +01:00
Tim Gross	92d216f3b8	scaling: fix state store corruption bug for job scaling events (#23673 ) When updating a `JobScalingEvent`, the state store function did not copy the existing object before mutating it. This corrupts the state store because it modifies the leaf node without committing it in a transaction. It can also cause the Nomad server to crash with a "fatal error: concurrent map read and map write" if its `ScalingEvents` map is read via the `ScaleStatus` RPC at the same time as it's being written. This changeset also removes some mostly-unused public methods on the struct that dangerously encourage you to mutate it outside of a copy. Ref: https://hashicorp.atlassian.net/browse/NET-10529	2024-07-24 09:18:03 -04:00
Tim Gross	c8be863bc8	reporting: allow export interval and address to be configurable (#23674 ) The go-census library supports configuration to send metrics to a local development version of the collector. Add "undocumented" configuration options to the `reporting` block allow developers to debug and verify we're sending the data we expect with real Nomad servers and not just unit tests. Ref: https://hashicorp.atlassian.net/browse/NET-10057 Ref: https://github.com/hashicorp/nomad-enterprise/pull/1708	2024-07-24 08:29:59 -04:00
Tim Gross	c280891703	template: allow change_mode script to run after client restart (#23663 ) For templates with `change_mode = "script"`, we set a driver handle in the poststart method, so the template runner can execute the script inside the task. But when the client is restarted and the template contents change during that window, we trigger a change_mode in the prestart method. In that case, the hook will not have the handle and so returns an errror trying to run the change mode. We restore the driver handle before we call any prestart hooks, so we can pass that handle in the constructor whenever it's available. In the normal task start case the handle will be empty but also won't be called. The error messages are also misleading, as there's no capabilities check happening here. Update the error messages to match. Fixes: https://github.com/hashicorp/nomad/issues/15851 Ref: https://hashicorp.atlassian.net/browse/NET-9338	2024-07-24 08:29:39 -04:00
Deniz Onur Duzgun	7a2c70e3f6	deps: bump azidentity to v1.7.0 (#23664 )	2024-07-22 15:03:19 -04:00
Daniel Bennett	10d3f1749b	e2e: test all cni config formats (#23650 )	2024-07-22 10:17:03 -05:00
Tim Gross	0f4014b4a9	docs: external KMS configuration (#23600 ) In #23580 we're implementing support for encrypting Nomad's key material with external KMS providers or Vault Transit. This changeset breaks out the documentation from that PR to keep the review manageable and present it to a wider set of reviewers. Ref: https://hashicorp.atlassian.net/browse/NET-10334 Ref: https://github.com/hashicorp/nomad/issues/14852 Ref: https://github.com/hashicorp/nomad/pull/23580	2024-07-19 15:08:54 -04:00
Tim Gross	a29f9b6fc0	keyring: E2E testing for KMS/rotation (#23601 ) In #23580 we're implementing support for encrypting Nomad's key material with external KMS providers or Vault Transit. This changeset breaks out the E2E infrastructure and testing from that PR to keep the review manageable. Ref: https://hashicorp.atlassian.net/browse/NET-10334 Ref: https://github.com/hashicorp/nomad/issues/14852 Ref: https://github.com/hashicorp/nomad/pull/23580	2024-07-19 13:49:48 -04:00
Tim Gross	2f4353412d	keyring: support prepublishing keys (#23577 ) When a root key is rotated, the servers immediately start signing Workload Identities with the new active key. But workloads may be using those WI tokens to sign into external services, which may not have had time to fetch the new public key and which might try to fetch new keys as needed. Add support for prepublishing keys. Prepublished keys will be visible in the JWKS endpoint but will not be used for signing or encryption until their `PublishTime`. Update the periodic key rotation to prepublish keys at half the `root_key_rotation_threshold` window, and promote prepublished keys to active after the `PublishTime`. This changeset also fixes two bugs in periodic root key rotation and garbage collection, both of which can't be safely fixed without implementing prepublishing: * Periodic root key rotation would never happen because the default `root_key_rotation_threshold` of 720h exceeds the 72h maximum window of the FSM time table. We now compare the `CreateTime` against the wall clock time instead of the time table. (We expect to remove the time table in future work, ref https://github.com/hashicorp/nomad/issues/16359) * Root key garbage collection could GC keys that were used to sign identities. We now wait until `root_key_rotation_threshold` + `root_key_gc_threshold` before GC'ing a key. * When rekeying a root key, the core job did not mark the key as inactive after the rekey was complete. Ref: https://hashicorp.atlassian.net/browse/NET-10398 Ref: https://hashicorp.atlassian.net/browse/NET-10280 Fixes: https://github.com/hashicorp/nomad/issues/19669 Fixes: https://github.com/hashicorp/nomad/issues/23528 Fixes: https://github.com/hashicorp/nomad/issues/19368	2024-07-19 13:29:41 -04:00
Piotr Kazmierczak	78bc8e7843	scheduler: fix TestAllocSet_filterByTainted (#23648 )	2024-07-19 17:41:06 +02:00
Tim Gross	a8ab2d13b4	docs: explain how to use insecure registries with Docker (#23642 ) The documentation for the `SSL` option for the Docker driver is misleading inasmuch as it's both deprecated and non-functional in current versions of Docker. Remove this option from the docs and add a section explaining how to use insecure registries. Fixes: https://github.com/hashicorp/nomad/issues/23616	2024-07-19 11:18:47 -04:00
Phil Renaud	5df6fe6a57	[ui] Pack metadata booleanified and added to the statuses endpoint (#23404 ) * Pack metadata booleanified and added to the statuses endpoint * Simplify job.IsPack declaration	2024-07-19 11:16:43 -04:00
dependabot[bot]	cf6ce224b3	chore(deps): bump github.com/hashicorp/go-checkpoint (#23588 )	2024-07-19 15:13:43 +01:00
James Rasell	de0a86a55a	docs: Fix ACL login API path documentation. (#23624 )	2024-07-19 11:56:15 +01:00

1 2 3 4 5 ...

26059 Commits