nomad

mirror of https://github.com/kemko/nomad.git synced 2026-01-06 02:15:43 +03:00

Author	SHA1	Message	Date
grembo	6f04b91912	Add `disable_file` parameter to job's `vault` stanza (#13343 ) This complements the `env` parameter, so that the operator can author tasks that don't share their Vault token with the workload when using `image` filesystem isolation. As a result, more powerful tokens can be used in a job definition, allowing it to use template stanzas to issue all kinds of secrets (database secrets, Vault tokens with very specific policies, etc.), without sharing that issuing power with the task itself. This is accomplished by creating a directory called `private` within the task's working directory, which shares many properties of the `secrets` directory (tmpfs where possible, not accessible by `nomad alloc fs` or Nomad's web UI), but isn't mounted into/bound to the container. If the `disable_file` parameter is set to `false` (its default), the Vault token is also written to the NOMAD_SECRETS_DIR, so the default behavior is backwards compatible. Even if the operator never changes the default, they will still benefit from the improved behavior of Nomad never reading the token back in from that - potentially altered - location.	2023-06-23 15:15:04 -04:00
Luiz Aoqui	f4c7182873	node pools: apply node pool scheduler configuration (#17598 )	2023-06-21 20:31:50 -04:00
Samantha	7ef1905333	check: Add support for Consul field tls_server_name (#17334 )	2023-06-02 10:19:12 -04:00
Tim Gross	2d059bbf22	node pools: add `node_pool` field to job spec (#17379 ) This changeset only adds the `node_pool` field to the jobspec, and ensures that it gets picked up correctly as a change. Without the rest of the implementation landed yet, the field will be ignored.	2023-06-01 16:08:55 -04:00
Seth Hoenig	9ff1d927d9	docs: fixup example of readiness check (#17296 ) A "readiness" check implies a failing healthcheck will not cause the deployment of a service to stop - i.e. it is only used as a liveness probe in the context of service discoverability. Fix our docs example to reflect that a readiness check is created by setting on_update to "ignore" (as opposed to "ignore_warnings").	2023-05-23 15:29:10 -05:00
Mike Nomitch	3db97acf8a	docs: add documentation on ephemeral disk and logs (#15829 )	2023-05-17 16:58:11 -04:00
Roman Zipp	22f5217b85	docs: remove unneeded brackets from job specification template docs (#17219 )	2023-05-17 16:45:00 -04:00
Tim Gross	2aa3c746c4	logs: fix missing allocation logs after update to Nomad 1.5.4 (#17087 ) When the server restarts for the upgrade, it loads the `structs.Job` from the Raft snapshot/logs. The jobspec has long since been parsed, so none of the guards around the default value are in play. The empty field value for `Enabled` is the zero value, which is false. This doesn't impact any running allocation because we don't replace running allocations when either the client or server restart. But as soon as any allocation gets rescheduled (ex. you drain all your clients during upgrades), it'll be using the `structs.Job` that the server has, which has `Enabled = false`, and logs will not be collected. This changeset fixes the bug by adding a new field `Disabled` which defaults to false (so that the zero value works), and deprecates the old field. Fixes #17076	2023-05-04 16:01:18 -04:00
James Rasell	06e877f26b	docs: update artifact jobspec sshkey example path. (#17077 )	2023-05-04 14:29:36 +01:00
Seth Hoenig	7744caed48	connect: use explicit docker.io prefix in default envoy image names (#17045 ) This PR modifies references to the envoyproxy/envoy docker image to explicitly include the docker.io prefix. This does not affect existing users, but makes things easier for Podman users, who otherwise need to specify the full name because Podman does not default to docker.io	2023-05-02 09:27:48 -05:00
Seth Hoenig	8919997896	docs: add more notes about artifact breaking changes in 1.5.0 (#17005 ) * changelog: note artifact breaking changes for 1.5.0 * docs: add note about environment variables to artifact job spec docs * Update website/content/docs/job-specification/artifact.mdx Co-authored-by: Luiz Aoqui <luiz@hashicorp.com> --------- Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2023-04-27 11:41:18 -05:00
Tim Gross	30bc456f03	logs: allow disabling log collection in jobspec (#16962 ) Some Nomad users ship application logs out-of-band via syslog. For these users having `logmon` (and `docker_logger`) running is unnecessary overhead. Allow disabling the logmon and pointing the task's stdout/stderr to /dev/null. This changeset is the first of several incremental improvements to log collection short of full-on logging plugins. The next step will likely be to extend the internal-only task driver configuration so that cluster administrators can turn off log collection for the entire driver. --- Fixes: #11175 Co-authored-by: Thomas Weber <towe75@googlemail.com>	2023-04-24 10:00:27 -04:00
Tim Gross	b8a472d692	ephemeral disk: `migrate` should imply `sticky` (#16826 ) The `ephemeral_disk` block's `migrate` field allows for best-effort migration of the ephemeral disk data to new nodes. The documentation says the `migrate` field is only respected if `sticky=true`, but in fact if client ACLs are not set the data is migrated even if `sticky=false`. The existing behavior when client ACLs are disabled has existed since the early implementation, so "fixing" that case now would silently break backwards compatibility. Additionally, having `migrate` not imply `sticky` seems nonsensical: it suggests that if we place on a new node we migrate the data but if we place on the same node, we throw the data away! Update so that `migrate=true` implies `sticky=true` as follows: * The failure mode when client ACLs are enabled comes from the server not passing along a migration token. Update the server so that the server provides a migration token whenever `migrate=true` and not just when `sticky=true` too. * Update the scheduler so that `migrate` implies `sticky`. * Update the client so that we check for `migrate \|\| sticky` where appropriate. * Refactor the E2E tests to move them off the old framework and make the intention of the test more clear.	2023-04-07 16:33:45 -04:00
Horacio Monsalvo	5957880112	connect: add meta on ConsulSidecarService (#16705 ) Co-authored-by: Sol-Stiep <sol.stiep@southworks.com>	2023-03-30 16:09:28 -04:00
James Rasell	39ec124bb8	docs: detail support for Nomad checks in service block. (#16598 )	2023-03-22 09:27:58 +01:00
Suselz	5309325621	Update csi_plugin.mdx (#16584 ) Co-authored-by: James Rasell <jrasell@users.noreply.github.com>	2023-03-21 16:16:18 +01:00
Michael Schurter	46ae1025bb	docs: dispatch_payload and jobs api docs had some weirdness (#16514 ) * docs: dispatch_payload docs had some weirdness Docs said "Examples" when there was only 1 example. Not sure what the floating "to" in the description was for. * docs: missing a heading level on jobs api docs	2023-03-16 09:42:46 -07:00
Tim Gross	101e5d0225	docs: clarify migration behavior under `nomad alloc stop` (#16468 )	2023-03-14 09:00:29 -04:00
Tim Gross	03d6a8c70a	docs: note that secrets dir is usually mounted `noexec` (#16363 )	2023-03-07 11:57:15 -05:00
Alessio Perugini	365ccf4377	Allow configurable range of Job priorities (#16084 )	2023-02-17 09:23:13 -05:00
Seth Hoenig	7ffb0b1102	docs: remove cores/memory beta label, update driver cpu docs (#16175 ) * docs: remove cores/memory beta label, update driver cpu docs * docs: fixup cr stuff	2023-02-14 14:43:07 -06:00
Charlie Voiselle	29893023f7	Add information about template to interpolation page (#10807 ) * Add information about templating using `env` function to refer to environment variables.	2023-02-10 16:12:11 -05:00
Michael Schurter	eabb47e2d0	Workload Identity, Task API, and Dynamic Node Metadata Docs (#16102 ) * docs: add dynamic node metadata api docs Also update all paths in the client API docs to explicitly state the `/v1/` prefix. We're inconsistent about that, but I think it's better to display the full path than to only show the fragment. If we ever do a `/v2/` whether or not we explicitly state `/v1/` in our docs won't be our greatest concern. * docs: add task-api docs	2023-02-09 16:03:43 -08:00
Bryce Kalow	84ed398e8d	docs: fix outstanding content conformance errors (#16040 )	2023-02-02 15:40:07 -06:00
Tim Gross	ba20138ffd	System and sysbatch jobs always have zero index (#16030 ) Service jobs should have unique allocation Names, derived from the Job.ID. System jobs do not have unique allocation Names because the index is intended to indicated the instance out of a desired count size. Because system jobs do not have an explicit count but the results are based on the targeted nodes, the index is less informative and this was intentionally omitted from the original design. Update docs to make it clear that NOMAD_ALLOC_INDEX is always zero for system/sysbatch jobs Validate that `volume.per_alloc` is incompatible with system/sysbatch jobs. System and sysbatch jobs always have a `NOMAD_ALLOC_INDEX` of 0. So interpolation via `per_alloc` will not work as soon as there's more than one allocation placed. Validate against this on job submission.	2023-02-02 16:18:01 -05:00
Charlie Voiselle	fe4ff5be2a	Add option to expose workload token to task (#15755 ) Add `identity` jobspec block to expose workload identity tokens to tasks. --------- Co-authored-by: Anders <mail@anars.dk> Co-authored-by: Tim Gross <tgross@hashicorp.com> Co-authored-by: Michael Schurter <mschurter@hashicorp.com>	2023-02-02 10:59:14 -08:00
Daniel Bennett	9f583f57f5	Change `job init` default to example`.nomad.hcl` and recommend in docs (#15997 ) recommend .nomad.hcl for job files instead of .nomad (without .hcl) * nomad job init -> example.nomad.hcl * update docs	2023-02-02 11:47:47 -06:00
jmwilkinson	46f3977db2	Allow wildcard datacenters to be specified in job file (#11170 ) Also allows for default value of `datacenters = ["*"]`	2023-02-02 09:57:45 -05:00
Glen Yu	813fd6ed98	docs: removed extra 'end' in one of the code blocks in template stanza documentation (#15963 )	2023-01-31 13:55:10 -05:00
Charlie Voiselle	ca597f7a3e	Fix broken link, typo, style edits. (#15968 )	2023-01-30 15:52:43 -05:00
Sudharshan K S	a5f568a6f3	Corrected a typo (#15942 )	2023-01-30 15:18:18 -05:00
Piotr Kazmierczak	949a6f60c7	renamed stanza to block for consistency with other projects (#15941 )	2023-01-30 15:48:43 +01:00
舍我其谁	69b08bb706	volume: Add the missing option propagation_mode (#15626 )	2023-01-30 09:32:07 -05:00
Dao Thanh Tung	031765bc39	Fix documentation for `meta` block: string replacement in key from `-` to `_` (#15940 ) Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>	2023-01-30 14:51:04 +01:00
Yorick Gersie	24a575ab80	Allow per_alloc to be used with host volumes (#15780 ) Disallowing per_alloc for host volumes in some cases makes life of a nomad user much harder. When we rely on the NOMAD_ALLOC_INDEX for any configuration that needs to be re-used across restarts we need to make sure allocation placement is consistent. With CSI volumes we can use the `per_alloc` feature but for some reason this is explicitly disabled for host volumes. Ensure host volumes understand the concept of per_alloc	2023-01-26 09:14:47 -05:00
Luiz Aoqui	6b01bbb507	docs: add caveat on dynamic blocks (#15857 )	2023-01-25 15:54:45 -05:00
Ashlee M Boyer	3444ece549	docs: Migrate link formats (#15779 ) * Adding check-legacy-links-format workflow * Adding test-link-rewrites workflow * chore: updates link checker workflow hash * Migrating links to new format Co-authored-by: Kendall Strautman <kendallstrautman@gmail.com>	2023-01-25 09:31:14 -08:00
Ashlee M Boyer	294da1bc41	[docs] Adjusting links for rewrite project (#15810 ) * Adjusting link to page about features * Fixing typo * Replacing old learn links with devdot paths * Removing extra space	2023-01-17 10:55:47 -05:00
Luiz Aoqui	754574ce17	docs: add missing parameter `propagation_mode` to `volume_mount` (#15785 )	2023-01-16 10:18:50 -05:00
Seth Hoenig	4698d8da79	consul/connect: support for proxy upstreams opaque config (#15761 ) This PR adds support for configuring `proxy.upstreams[].config` for Consul Connect upstreams. This is an opaque config value to Nomad - the data is passed directly to Consul and is unknown to Nomad.	2023-01-12 08:20:54 -06:00
Luiz Aoqui	1318477789	scheduler: allow using device ID as attribute (#15455 ) Devices are fingerprinted as groups of similar devices. This prevented specifying specific device by their ID in constraint and affinity rules. This commit introduces the `${device.ids}` attribute that returns a comma separated list of IDs that are part of the device group. Users can then use the set operators to write rules.	2023-01-10 14:28:23 -05:00
Cyrille Colin	f6ebb66c86	Update template.mdx (#15737 ) fix typo issue in variable url : remove unwanted "r"	2023-01-10 10:42:33 +01:00
Luiz Aoqui	b72c79ebb9	docs: networking (#15358 ) Co-authored-by: Charlie Voiselle <464492+angrycub@users.noreply.github.com>	2023-01-06 11:47:10 -05:00
James Rasell	bfcb21a550	docs: clarify shutdown_delay jobspec param and service behaviour. (#15695 )	2023-01-05 16:57:13 +01:00
James Rasell	76e185677b	docs: fix service name interpolation key details. (#15643 )	2023-01-03 10:58:00 +01:00
Michael Schurter	55a5dfc221	docs: clarify rescheduling happens when tasks fail (#15485 )	2022-12-08 12:58:26 -08:00
Seth Hoenig	cfc67c3422	client: sandbox go-getter subprocess with landlock (#15328 ) * client: sandbox go-getter subprocess with landlock This PR re-implements the getter package for artifact downloads as a subprocess. Key changes include On all platforms, run getter as a child process of the Nomad agent. On Linux platforms running as root, run the child process as the nobody user. On supporting Linux kernels, uses landlock for filesystem isolation (via go-landlock). On all platforms, restrict environment variables of the child process to a static set. notably TMP/TEMP now points within the allocation's task directory kernel.landlock attribute is fingerprinted (version number or unavailable) These changes make Nomad client more resilient against a faulty go-getter implementation that may panic, and more secure against bad actors attempting to use artifact downloads as a privilege escalation vector. Adds new e2e/artifact suite for ensuring artifact downloading works. TODO: Windows git test (need to modify the image, etc... followup PR) * landlock: fixup items from cr * cr: fixup tests and go.mod file	2022-12-07 16:02:25 -06:00
Matus Goljer	5bec70723d	Update affinity.mdx (#15168 ) Fix the comment to correspond to the code	2022-11-30 19:01:56 -05:00
Seth Hoenig	106dce9c9f	docs: clarify how to access task meta values in templates (#15212 ) This PR updates template and meta docs pages to give examples of accessing meta values in templates. To do so one must use the environment variable form of the meta key name, which isn't obvious and wasn't yet documented.	2022-11-10 16:11:53 -06:00
twunderlich-grapl	1b5eedc07a	Fix s3 example URLs in the artifacts docs (#15123 ) * Fix s3 URLs so that they work Unfortunately, s3 urls prefixed with https:// do NOT work with the underlying go-getter library. As such, this fixes the examples so that they are working examples that won't cause problems for people reading the docs. See discussion in https://github.com/hashicorp/nomad/issues/1113 circa 2016. * Use s3:// protocol schema for artifact examples Per the discussion in https://github.com/hashicorp/nomad/pull/15123, we're going to use the explicit s3 protocol in the examples since that is the likeliest to work in all scenarios	2022-11-07 14:14:57 -05:00

1 2 3 4 5

204 Commits