nomad

mirror of https://github.com/kemko/nomad.git synced 2026-01-06 18:35:44 +03:00

Author	SHA1	Message	Date
Seth Hoenig	ae6c4c8e3f	deps: purge use of old x/exp packages (#20373 )	2024-04-12 08:29:00 -05:00
Tim Gross	1e50090776	docs: clarify "best effort" for ephemeral disk migration (#20357 ) The docs for ephemeral disk migration use the term "best effort" without outlining the requirements or the cases under which the migration can fail. Update the docs to make it obvious that ephemeral disk migration is subject to data loss. Fixes: https://github.com/hashicorp/nomad/issues/20355	2024-04-11 16:35:22 -04:00
astudentofblake	7b7ed12326	func: Allow custom paths to be added the the getter landlock (#20349 ) * func: Allow custom paths to be added the the getter landlock Fixes: 20315 * fix: slices imports fix: more meaningful examples fix: improve documentation fix: quote error output	2024-04-11 15:17:33 -05:00
dependabot[bot]	5612ab46c3	chore(deps): bump ip from 2.0.0 to 2.0.1 in /ui (#20021 ) Bumps [ip](https://github.com/indutny/node-ip) from 2.0.0 to 2.0.1. - [Commits](https://github.com/indutny/node-ip/compare/v2.0.0...v2.0.1) --- updated-dependencies: - dependency-name: ip dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-04-11 15:53:15 -04:00
dependabot[bot]	229e645681	chore(deps): bump follow-redirects from 1.15.5 to 1.15.6 in /ui (#20145 ) Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects) from 1.15.5 to 1.15.6. - [Release notes](https://github.com/follow-redirects/follow-redirects/releases) - [Commits](https://github.com/follow-redirects/follow-redirects/compare/v1.15.5...v1.15.6) --- updated-dependencies: - dependency-name: follow-redirects dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-04-11 15:52:32 -04:00
dependabot[bot]	ea6a112f73	build(deps): bump express from 4.18.2 to 4.19.2 in /ui (#20239 ) Bumps [express](https://github.com/expressjs/express) from 4.18.2 to 4.19.2. - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/master/History.md) - [Commits](https://github.com/expressjs/express/compare/4.18.2...4.19.2) --- updated-dependencies: - dependency-name: express dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-04-11 15:51:13 -04:00
Tim Gross	d56e8ad1aa	WI: ensure Consul hook and WID manager interpolate services (#20344 ) Services can have some of their string fields interpolated. The new Workload Identity flow doesn't interpolate the services before requesting signed identities or using those identities to get Consul tokens. Add support for interpolation to the WID manager and the Consul tokens hook by providing both with a taskenv builder. Add an "interpolate workload" field to the WI handle to allow passing the original workload name to the server so the server can find the correct service to sign. This changeset also makes two related test improvements: * Remove the mock WID manager, which was only used in the Consul hook tests and isn't necessary so long as we provide the real WID manager with the mock signer and never call `Run` on it. It wasn't feasible to exercise the correct behavior without this refactor, as the mocks were bypassing the new code. * Fixed swapped expect-vs-actual assertions on the `consul_hook` tests. Fixes: https://github.com/hashicorp/nomad/issues/20025	2024-04-11 15:40:28 -04:00
dependabot[bot]	6c419a37ee	build(deps): bump tar from 6.2.0 to 6.2.1 in /ui (#20353 ) Bumps [tar](https://github.com/isaacs/node-tar) from 6.2.0 to 6.2.1. - [Release notes](https://github.com/isaacs/node-tar/releases) - [Changelog](https://github.com/isaacs/node-tar/blob/main/CHANGELOG.md) - [Commits](https://github.com/isaacs/node-tar/compare/v6.2.0...v6.2.1) --- updated-dependencies: - dependency-name: tar dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-04-11 15:22:09 -04:00
Tim Gross	a13e455c51	deployment watcher: fix goroutine leak when job is purged (#20348 ) The deployment watcher on the leader makes blocking queries to detect when the set of active deployments changes. It takes the resulting list of deployments and adds or removes watchers based on whether the deployment is active. But when a job is purged, the deployment will be deleted. This unblocks the query but the query result only shows the remaining deployments. When the query unblocks, ensure that all active watchers have a corresponding deployment in state. If not, remove the watcher so that the goroutine stops. Fixes: https://github.com/hashicorp/nomad/issues/19988	2024-04-11 14:51:05 -04:00
Gabi	ca22f34373	fix exhausted node metrics reporting in preemption (#20346 )	2024-04-11 14:49:56 -04:00
Tim Gross	8298d39e78	Connect transparent proxy support Add support for Consul Connect transparent proxies Fixes: https://github.com/hashicorp/nomad/issues/10628	2024-04-10 11:00:18 -04:00
Tim Gross	9340c77b12	docs: remove extra indents in tproxy HCL examples	2024-04-10 10:21:32 -04:00
Tim Gross	4fef82e8e2	tproxy: refactor `getPortMapping` The `getPortMapping` method forces callers to handle two different data structures, but only one caller cares about it. We don't want to return a single map or slice because the `cni.PortMapping` object doesn't include a label field that we need for tproxy. Return a new datastructure that closes over both a slice of `cni.PortMapping` and a map of label to index in that slice.	2024-04-10 10:16:13 -04:00
Tim Gross	e2e561da88	tproxy: documentation improvements	2024-04-10 08:55:50 -04:00
James Rasell	a7c56a6563	docs: fix incorrect formatting within ACL policy spec. (#20339 )	2024-04-09 14:46:06 +01:00
Tim Gross	2cf341b761	drain: use authenticated ID as source of drained-by metadata (#20317 ) When a node is set to drain, the state store reads the auth token off the request to record `LastDrain` metadata about the token used to drain the node. This code path in the state store can't correctly handle signed Workload Identity tokens or bearer tokens that may have expired (for example, while restarting a server and applying uncompacted Raft logs). Rather than re-authenticating the request at the time of FSM apply, record the string derived from the authenticated identity as part of the Raft log entry. Fixes: https://github.com/hashicorp/nomad/issues/17471	2024-04-09 09:28:24 -04:00
James Rasell	200b7134f0	docs: ensure namespace ACL policy capabilities are all listed. (#20306 )	2024-04-09 13:57:10 +01:00
Tim Gross	8eaf176868	client: fix IPv6 parsing for `client.servers` block (#20324 ) When the `client.servers` block is parsed, we split the port from the address. This does not correctly handle IPv6 addresses when they are in URL format (wrapped in brackets), which we require to disambiguate the port and address. Fix the parser to correctly split out the port and handle a missing port value for IPv6. Update the documentation to make the URL format requirement clear. Fixes: https://github.com/hashicorp/nomad/issues/20310	2024-04-08 15:06:27 -04:00
Tim Gross	a0cbc1a26a	cli: remove extraneous trailing newline from `nomad fmt` (#20318 ) When `nomad fmt` writes to stdout instead of overwriting a file, the command was using the `UI` output, which appends an extra newline. This results in extra trailing newlines when using `nomad fmt` as part of a pipeline or editor plugin. Update the command to write directly to stdout when in the stdout mode. Fixes: https://github.com/hashicorp/nomad/issues/20307	2024-04-08 13:29:22 -04:00
Phil Renaud	9a20e98d27	[ui] Show re-bound keyboard nav hints instead of their default values (#20235 ) * Rebinds show up as soon as you start rebinding * Hint bind and rebind tests * Orphaned getCommandByPattern method removed	2024-04-08 10:11:23 -04:00
James Rasell	0cbd08ebf2	docs: add Digital Ocean Spaces artifact jobspec example. (#20304 )	2024-04-08 08:15:07 +01:00
Tim Gross	548adb0fd4	tproxy: E2E tests (#20296 ) Add the `consul-cni` plugin to the Linux AMI for E2E, and add a test case that covers the transparent proxy feature. Add test assertions to the Connect tests for upstream reachability Ref: https://github.com/hashicorp/nomad/pull/20175	2024-04-05 14:23:26 -04:00
Tim Gross	8b6d6e48bf	tproxy: job submission hooks (#20244 ) Add a constraint on job submission that requires the `consul-cni` plugin fingerprint whenever transparent proxy is used. Add a validation that the `network.dns` cannot be set when transparent proxy is used, unless the `no_dns` flag is set.	2024-04-05 13:13:15 -04:00
Daniel Bennett	da09778eab	Enable numeric pagination tokens (#20299 ) * enable uint64 pagination tokens, so they can be compared as numbers instead of strings * tokenize job ModifyIndex as uint64, so an new upcoming state index can paginate properly * test require->must	2024-04-05 09:49:41 -05:00
Seth Hoenig	825efc3925	docker: use correct effective cpuset filename on legacy cgroups v1 systems (#20294 )	2024-04-05 08:05:51 -05:00
Tim Gross	2382ab8776	E2E: ensure periodic test can't fail due to cron conflicts (#20300 ) The E2E test for periodic dispatch jobs has a `cron` trigger for once a minute. If the test happens to run at the top of the minute, it's possible for the forced dispatch to run from the test code, then the periodic timer triggers and leaves a running child job. This fails the test because it expects only a single job in the "dead" state. Make it so that the `cron` expression is implausible to run during our test window, and migrate the test off the old framework while we're at it.	2024-04-05 08:45:35 -04:00
Tim Gross	d1f3a72104	tproxy: `transparent_proxy` reference docs (#20241 ) Ref: https://github.com/hashicorp/nomad/pull/20175	2024-04-04 17:01:07 -04:00
Tim Gross	bb062deadc	docs: update service mesh integration docs for transparent proxy (#20251 ) Update the service mesh integration docs to explain how Consul needs to be configured for transparent proxy. Update the walkthrough to assume that `transparent_proxy` mode is the best approach, and move the manually-configured `upstreams` to a separate section for users who don't want to use Consul DNS. Ref: https://github.com/hashicorp/nomad/pull/20175 Ref: https://github.com/hashicorp/nomad/pull/20241	2024-04-04 17:01:07 -04:00
Tim Gross	76009d89af	tproxy: networking hook changes (#20183 ) When `transparent_proxy` block is present and the network mode is `bridge`, use a different CNI configuration that includes the `consul-cni` plugin. Before invoking the CNI plugins, create a Consul SDK `iptables.Config` struct for the allocation. This includes: * Use all the `transparent_proxy` block fields * The reserved ports are added to the inbound exclusion list so the alloc is reachable from outside the mesh * The `expose` blocks and `check` blocks with `expose=true` are added to the inbound exclusion list so health checks work. The `iptables.Config` is then passed as a CNI argument to the `consul-cni` plugin. Ref: https://github.com/hashicorp/nomad/issues/10628	2024-04-04 17:01:07 -04:00
Tim Gross	e8d203e7ce	transparent proxy: add jobspec support (#20144 ) Add a transparent proxy block to the existing Connect sidecar service proxy block. This changeset is plumbing required to support transparent proxy configuration on the client. Ref: https://github.com/hashicorp/nomad/issues/10628	2024-04-04 17:01:07 -04:00
Tim Gross	648daceca1	E2E: skip Vault 1.16.1 for JWT compatibility test (#20301 ) Vault 1.16.1 has a known issue around the JWT auth configuration that will prevent this test from ever passing. Skip testing the JWT code path on 1.16.1. Once 1.16.2 ships it will no longer get skipped. Ref: https://github.com/hashicorp/nomad/issues/20298	2024-04-04 17:00:35 -04:00
Yorick Gersie	6124ee8afb	cpuset fixer: use correct cgroup path for updates (#20276 ) * cpuset fixer: use correct cgroup path for updates fixes #20275 * docker: flatten switch statement and add test cases * cl: add cl --------- Co-authored-by: Seth Hoenig <shoenig@duck.com>	2024-04-04 15:54:10 -05:00
Tim Gross	a71632e3a4	docs: recommendation for maximum number of template dependencies (#20259 )	2024-04-04 11:08:49 -04:00
Julien Castets	9b5eb26c83	doc nomad-autoscaler: add options for pass-through strategy (#20284 )	2024-04-04 10:54:34 -04:00
Tim Gross	c1f020d60f	E2E: refactor Connect tests to use stdlib testing (#20278 ) Migrate our E2E tests for Connect off the old framework in preparation for writing E2E tests for transparent proxy and the updated workload identity workflow. Mark the tests that cover the legacy Consul token submitted workflow. Ref: https://github.com/hashicorp/nomad/pull/20175	2024-04-04 10:48:10 -04:00
James Rasell	fd5a42a6ca	docs: clarify data dir default parameters and default creation. (#20268 )	2024-04-04 09:20:47 +01:00
Tim Gross	78f9f17867	api: add missing `AllocDirStats` field in Go API (#20261 ) The JSON response for the Read Stats client API includes an `AllocDirStats` field. This field is missing in the `api` package, so consumers of the Go API can't use it to read the values we're getting back from the HTTP server. Fixes: https://github.com/hashicorp/nomad/issues/20246	2024-04-03 08:54:05 -04:00
Tim Gross	4ce728afbd	E2E: make `vault.create_from_role` unique per cluster (#20267 ) If a E2E cluster is destroyed after a different one has been created, the role and policy we create in Vault for the cluster will be deleted and Vault-related tests will fail. Note that before 1.9, we should figure out a way to give HCP Vault access to the JWKS endpoint and have a different set of policies, but we'll need to have a role-per-cluster in that case as well. Fixes: https://github.com/hashicorp/nomad-e2e/issues/138 (internal)	2024-04-03 08:45:01 -04:00
Tim Gross	cf25cf5cd5	E2E: use a self-hosted Consul for easier WI testing (#20256 ) Our `consulcompat` tests exercise both the Workload Identity and legacy Consul token workflow, but they are limited to running single node tests. The E2E cluster is network isolated, so using our HCP Consul cluster runs into a problem validating WI tokens because it can't reach the JWKS endpoint. In real production environments, you'd solve this with a CNAME pointing to a public IP pointing to a proxy with a real domain name. But that's logisitcally impractical for our ephemeral nightly cluster. Migrate the HCP Consul to a single-node Consul cluster on AWS EC2 alongside our Nomad cluster. Bootstrap TLS and ACLs in Terraform and ensure all nodes can reach each other. This will allow us to update our Consul tests so they can use Workload Identity, in a separate PR. Ref: #19698	2024-04-02 15:24:51 -04:00
Tim Gross	31f53cec01	structs: fix test for empty DNS configuration (#20233 ) The `DNSConfig.IsZero` method incorrectly returns true if any of the fields are empty, rather than if all of them are empty. The only code path that consumes this method is on the client, where it's used as part of equality checks on the allocation network status to set the priority of allocation updates to the server. Hypothetically, if the network hook modified only the DNS configuration and no task states were emitted, it would be possible to miss an allocation update. In practice this appears to be impossible, but we should fix the bug so that there aren't errors in future consumers.	2024-03-29 10:47:53 -04:00
Seth Hoenig	6ad648bec8	networking: Inject implicit constraints on CNI plugins when using bridge mode (#15473 ) This PR adds a job mutator which injects constraints on the job taskgroups that make use of bridge networking. Creating a bridge network makes use of the CNI plugins: bridge, firewall, host-local, loopback, and portmap. Starting with Nomad 1.5 these plugins are fingerprinted on each node, and as such we can ensure jobs are correctly scheduled only on nodes where they are available, when needed.	2024-03-27 16:11:39 -04:00
Tim Gross	9c2286014f	docs: update Consul compatibility matrix (#20242 ) Version of Nomad and Consul that were known not to be compatible are no longer supported in general. Update the compatibility matrix for Consul to match.	2024-03-27 16:11:14 -04:00
Tim Gross	c3e7b13d54	deps: update consul-template to 0.37.4 to fix resource leak (#20234 ) A Nomad user reported an issue where template runner `View.poll` goroutines were being leaked when using templates with many dependencies. This resource leak was fixed in consul-template 0.37.4. Fixes: https://github.com/hashicorp/nomad/issues/20163	2024-03-27 11:51:34 -04:00
Juana De La Cuesta	c7e7fdfa84	[f-gh-208] Force recreation and redeployment of task if volume label changes (#20074 ) Scheduler: Force recreation and redeployment of task if volume mount labels in the task definitions changes	2024-03-27 11:43:31 +01:00
Seth Hoenig	bd2a809135	subproc: lazy lookup nomad binary in self call (#20231 )	2024-03-26 12:33:06 -05:00
Tim Gross	2fde4a0c93	namespace/node pool: forward RPCs cross-region if ACLs aren't enabled (#20220 ) Although it's not recommended, it's possible to federate regions without ACLs enabled. In this case, ACL-related objects such as namespaces and node pools can be written independently in each region and won't be replicated. If you use commands like `namespace apply` or `node pool delete`, the RPC is supposed to be forwarded to the authoritative region. But when ACLs are disabled, there is no authoritative region and so the RPC will always be applied to the local region even if the `-region` flag is passed. Remove the change to the RPC region for the namespace and node pool write RPC whenver ACLs are disabled, so that forwarding works. Fixes: https://github.com/hashicorp/nomad/issues/20197 Ref: https://github.com/hashicorp/nomad/issues/20128	2024-03-26 10:39:37 -04:00
Seth Hoenig	77889a16fb	exec2: more tweaks to driver harness (#20221 ) Also add an explicit exit code to subproc package for when a child process is instructed to run an unrunnable command (i.e. cannot be found or is not executable) - with the 127 return code folks using bash are familiar with	2024-03-26 08:02:41 -05:00
Tim Gross	a50e6267d0	cli: remove redundant `allocs` profile from `operator debug` (#20219 ) The pprof `allocs` profile is identical to the `heap` profile, just with a different default view. Collecting only one of the two is sufficient to view all of `alloc_objects`, `alloc_space`, `inuse_objects`, and `inuse_space`, and collecting only one means that both views will be of the same profile. Also improve the docstrings on the goroutine profiles explaining what's in each so that it's clear why we might want all of debug=0, debug=1, and debug=2.	2024-03-26 08:19:18 -04:00
Juana De La Cuesta	f2965cad36	[gh-19729] Fix logic for updating terminal allocs on clients with max client disconnect (#20181 ) Only ignore allocs on terminal states that are updated --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2024-03-26 10:31:58 +01:00
Phil Renaud	fee242c53d	Namespace added to example test in exec window (#20218 )	2024-03-25 17:02:07 -04:00

1 2 3 4 5 ...

25726 Commits