nomad

mirror of https://github.com/kemko/nomad.git synced 2026-01-01 16:05:42 +03:00

Author	SHA1	Message	Date
James Rasell	df16c96a9f	cli: use same offset when following single or multiple alloc logs. (#18604 )	2023-10-03 08:43:14 +01:00
Kevin Wang	e7b70adc2c	cli: improve `job` and `status` text (#18628 )	2023-10-02 10:31:57 -04:00
Luiz Aoqui	7267be719f	config: apply defaults to extra Consul and Vault (#18623 ) * config: apply defaults to extra Consul and Vault Apply the expected default values when loading additional Consul and Vault cluster configuration. Without these defaults some fields would be left empty. * config: retain pointer of multi Consul and Vault When calling `Copy()` the pointer reference from the `"default"` key of the `Consuls` and `Vaults` maps to the `Consul` and `Vault` field of `Config` was being lost. * test: ensure TestAgent has the right reference to the default Consul config	2023-09-29 17:15:20 -03:00
Tim Gross	aaee3076c2	consul: allow `consul` block in task scope (#18597 ) To support Workload Identity with Consul for templates, we want templates to be able to use the WI created at the task scope (either implicitly or set by the user). But to allow different tasks within a group to be assigned to different clusters as we're doing for Vault, we need to be able to set the `consul` block with its `cluster` field at the task level to override the group.	2023-09-29 15:03:48 -04:00
Luiz Aoqui	a4b29a29cb	vault: add `jwt_backend_path` agent config (#18606 ) Add agent configuration to allow cluster operators to define the path where the JWT auth method backend is mounted.	2023-09-28 18:02:30 -03:00
Tim Gross	5001bf4547	consul: use constant instead of "default" literal (#18611 ) Use the constant `structs.ConsulDefaultCluster` instead of the string literal "default", as we've done for Vault.	2023-09-28 16:50:21 -04:00
Juana De La Cuesta	e0407b4cf2	server: Add reporting configuration to nomad server (#18609 ) * func: add reporting config to server * func: add reporting manager for ce * func: add reporting to quick tests	2023-09-28 22:00:43 +02:00
Luiz Aoqui	fed1992cea	vault: remove `use_identity` agent config (#18592 ) The initial intention behind the `vault.use_identity` configuration was to indicate to Nomad servers that they would need to sign a workload identities for allocs with a `vault` block. But in order to support identity renewal, #18262 and #18431 moved the token signing logic to the alloc runner since a new token needs to be signed prior to the TTL expiring. So #18343 implemented `use_identity` as a flag to indicate that the workload identity JWT flow should be used when deriving Vault tokens for tasks. But this configuration value is set on servers so it is not available to clients at the time of token derivation, making its meaning not clear: a job may end up using the identity-based flow even when `use_identity` is `false`. The only reliable signal available to clients at token derivation time is the presence of an `identity` block for Vault, and this is already configured with the `vault.default_identity` configuration block, making `vault.use_identity` redundant. This commit removes the `vault.use_identity` configuration and simplifies the logic on when an implicit Vault identity is injected into tasks.	2023-09-27 17:44:07 -03:00
Luiz Aoqui	868aba57bb	vault: update identity name to start with `vault_` (#18591 ) * vault: update identity name to start with `vault_` In the original proposal, workload identities used to derive Vault tokens were expected to be called just `vault`. But in order to support multiple Vault clusters it is necessary to associate identities with specific Vault cluster configuration. This commit implements a new proposal to have Vault identities named as `vault_<cluster>`.	2023-09-27 15:53:28 -03:00
Luiz Aoqui	19241964a4	config: fix some issues with workload identity and multi Consul and Vault (#18590 ) * config: fix multi consul and vault config parse Capture the loop variable when parsing multiple Consul and Vault configuration blocks so the duration parse function uses the correct field when it's called later on. * client: build Vault client with right config When setting up the multiple Vault clients, the code was always loading the default configuration, resulting in all clients to be configured the same way. * config: fix WorkloadIdentityConfig.Copy() method Ensure `WorkloadIdentityConfig.Copy()` does not return the original pointer for the `TTL` field.	2023-09-27 14:41:11 -03:00
Juana De La Cuesta	124272c050	server: Add reporting option to agent (#18572 ) * func: add reporting option to agent * func: add test for merge and fix comments * Update config_ce.go * Update config_ce.go * Update config_ce.go * fix: add reporting config to default configuration and update to use must over require * Update command/agent/config_parse.go Co-authored-by: Luiz Aoqui <luiz@hashicorp.com> * Update nomad/structs/config/reporting.go Co-authored-by: Luiz Aoqui <luiz@hashicorp.com> * Update nomad/structs/config/reporting.go Co-authored-by: Luiz Aoqui <luiz@hashicorp.com> * style: rename license and reporting config * fix: use default function instead of empty struct --------- Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2023-09-27 00:11:32 +02:00
Daniel Bennett	fab968a748	csi: document volume expansion (#18573 ) and show Capacity in `volume status` command.	2023-09-26 14:49:15 -05:00
Tim Gross	02a5aab359	consul: provide workload's Consul token to service client (#18559 ) This is a work-in-progress changeset to provide workload-specific Consul tokens that are created by the `consul_hook` and attached to workload registration requests by the `group_service_hook` and `service_hook`. This requires unreleased updates to Consul's `api` package, so this changeset includes a temporary `replace` directive in the go.mod file.	2023-09-26 14:13:29 -04:00
Juana De La Cuesta	72acaf6623	[17449] Introduces a locking mechanism over variables (#18207 ) It includes the work over the state store, the PRC server, the HTTP server, the go API package and the CLI's command. To read more on the actuall functionality, refer to the RFCs [NMD-178] Locking with Nomad Variables and [NMD-179] Leader election using locking mechanism for the Autoscaler.	2023-09-21 17:56:33 +02:00
Tim Gross	d7bd47d60f	config: remove `consul.template_identity` in lieu of `task_identity` (#18540 ) The original thinking for Workload Identity integration with Consul and Vault was that we'd allow `template` blocks to specify their own identity. But because the login to Consul/Vault to get tokens happens at the task level, this would involve making the `template` block a new WID watcher on its own rather than using the Consul and Vault hooks we're building at the group/task level. So it doesn't make sense to have separate identities for individual `template` blocks rather than at the level of tasks. Update the agent configuration to rename the `template_identity` to the more accurate `task_identity`, which will be used for any non-service hooks (just `template` today). Update the implicit identities job mutation hook to create the identity we'll need as well.	2023-09-20 15:43:08 -04:00
Tim Gross	fdc6c2151d	vault: select Vault API client by cluster name (#18533 ) Nomad Enterprise will support configuring multiple Vault clients. Instead of having a single Vault client field in the Nomad client, we'll have a function that callers can parameterize by the Vault cluster name that returns the correctly configured Vault API client wrapper.	2023-09-19 14:35:01 -04:00
Gerard Nguyen	1339599185	cli: Add prune flag for nomad server force-leave command (#18463 ) This feature will help operator to remove a failed/left node from Serf layer immediately without waiting for 24 hours for the node to be reaped * Update CLI with prune flag * Update API /v1/agent/force-leave with prune query string parameter * Update CLI and API doc * Add unit test	2023-09-15 08:45:11 -04:00
stswidwinski	bd519dcbf4	Fix for https://github.com/hashicorp/nomad/issues/18493 (#18494 ) Co-authored-by: James Rasell <jrasell@users.noreply.github.com>	2023-09-14 13:35:15 +01:00
hc-github-team-nomad-core	297de953e0	Generate files for 1.6.2 release	2023-09-13 15:41:21 -03:00
Luiz Aoqui	3534307d0d	vault: add `use_identity` and `default_identity` agent configuration and implicit workload identity (#18343 )	2023-09-12 13:53:37 -03:00
Luiz Aoqui	82372fecb8	config: add TTL to agent identity config (#18457 ) Add support for identity token TTL in agent configuration fields such as Consul `service_identity` and `template_identity`. Co-authored-by: Michael Schurter <mschurter@hashicorp.com>	2023-09-12 11:13:09 -03:00
Seth Hoenig	2e1974a574	client: refactor cpuset partitioning (#18371 ) * client: refactor cpuset partitioning This PR updates the way Nomad client manages the split between tasks that make use of resources.cpus vs. resources.cores. Previously, each task was explicitly assigned which CPU cores they were able to run on. Every time a task was started or destroyed, all other tasks' cpusets would need to be updated. This was inefficient and would crush the Linux kernel when a client would try to run ~400 or so tasks. Now, we make use of cgroup heirarchy and cpuset inheritence to efficiently manage cpusets. * cr: tweaks for feedback	2023-09-12 09:11:11 -05:00
James Rasell	d923fc554d	consul/connect: add new fields to Consul Connect upstream block (#18430 ) Co-authored-by: Horacio Monsalvo <horacio.monsalvo@southworks.com>	2023-09-11 16:02:52 +01:00
Michael Schurter	ef24e40b39	identity: support jwt expiration and rotation (#18262 ) Implements expirations and renewals for alternate workload identity tokens.	2023-09-08 14:50:34 -07:00
Tim Gross	3ee6c31241	ACLs: allow/deny/default config for Consul/Vault clusters by namespace (#18425 ) In Nomad Enterprise when multiple Vault/Consul clusters are configured, cluster admins can control access to clusters for jobs via namespace ACLs, similar to how we've done so for node pools. This changeset updates the ACL configuration structs, but doesn't wire them up.	2023-09-08 11:37:20 -04:00
Tim Gross	7cdd592809	jobspec: support `cluster` field for Vault block (#18408 ) This field supports the upcoming ENT-only multiple Vault clusters feature. The job validation and mutation hooks will come in a separate PR. Ref: https://github.com/hashicorp/team-nomad/issues/404	2023-09-07 10:15:28 -04:00
Tim Gross	7863d7bcbb	jobspec: support `cluster` field for Consul and Service blocks (#18409 ) This field supports the upcoming ENT-only multiple Consul clusters feature. The job validation and mutation hooks will come in a separate PR. Ref: https://github.com/hashicorp/team-nomad/issues/404	2023-09-07 09:48:49 -04:00
Luiz Aoqui	7466496608	config: fix identity config for Consul service (#18363 ) Rename the agent configuraion for workload identity to `WorkloadIdentityConfig` to make its use more explicit and remove the `ServiceName` field since it is never expected to be defined in a configuration file. Also update the job mutation to inject a service identity following these rules: 1. Don't inject identity if `consul.use_identity` is false. 2. Don't inject identity if `consul.service_identity` is not specified. 3. Don't inject identity if service provider is not `consul`. 4. Set name and service name if the service specifies an identity. 5. Inject `consul.service_identity` if service does not specify an identity.	2023-08-31 11:22:48 -03:00
James Rasell	a9d5beb141	test: use correct parallel test setup func (#18326 )	2023-08-25 13:51:36 +01:00
Piotr Kazmierczak	b430d21a67	agent: add `consul.service_identity` and `consul.template_identity` blocks (#18279 ) This PR introduces updates to the agent config required for workload identity support.	2023-08-24 17:45:34 +02:00
Seth Hoenig	f5b0da1d55	all: swap exp packages for maps, slices (#18311 )	2023-08-23 15:42:13 -05:00
Андрей Неустроев	3e61b3a37d	Add multiple times in periodic jobs (#17858 )	2023-08-22 15:42:31 -04:00
Piotr Kazmierczak	9fa39eb829	jobspec: add nomad_service field and identity block (#18239 ) This PR introduces updates to the jobspec required for workload identity support for services. --------- Co-authored-by: Luiz Aoqui <luiz@hashicorp.com>	2023-08-21 20:07:47 +02:00
Luiz Aoqui	196213c451	jobspec: add `role` to `vault` (#18257 )	2023-08-18 15:29:02 -04:00
Tim Gross	a8bad048b6	config: parsing support for multiple Consul clusters in agent config (#18255 ) Add the plumbing we need to accept multiple Consul clusters in Nomad agent configuration, to support upcoming Nomad Enterprise features. The `consul` blocks are differentiated by a new `name` field, and if the `name` is omitted it becomes the "default" Consul configuration. All blocks with the same name are merged together, as with the existing behavior. As with the `vault` block, we're still using HCL1 for parsing configuration and the `Decode` method doesn't parse multiple blocks differentiated only by a field name without a label. So we've had to add an extra parsing pass, similar to what we've done for HCL1 jobspecs. This also revealed a subtle bug in the `vault` block handling of extra keys when there are multiple `vault` blocks, which I've fixed here. For now, all existing consumers will use the "default" Consul configuration, so there's no user-facing behavior change in this changeset other than the contents of the agent self API. Ref: https://github.com/hashicorp/team-nomad/issues/404	2023-08-18 15:25:16 -04:00
James Rasell	6108f5c4c3	admin: rename _oss files to _ce (#18209 )	2023-08-18 07:47:24 +01:00
Tim Gross	74b796e6d0	config: parsing support for multiple Vault clusters in agent config (#18224 ) Add the plumbing we need to accept multiple Vault clusters in Nomad agent configuration, to support upcoming Nomad Enterprise features. The `vault` blocks are differentiated by a new `name` field, and if the `name` is omitted it becomes the "default" Vault configuration. All blocks with the same name are merged together, as with the existing behavior. Unfortunately we're still using HCL1 for parsing configuration and the `Decode` method doesn't parse multiple blocks differentiated only by a field name without a label. So we've had to add an extra parsing pass, similar to what we've done for HCL1 jobspecs. For now, all existing consumers will use the "default" Vault configuration, so there's no user-facing behavior change in this changeset other than the contents of the agent self API. Ref: https://github.com/hashicorp/team-nomad/issues/404	2023-08-17 14:10:32 -04:00
Tim Gross	f00bff09f1	fix multiple overflow errors in exponential backoff (#18200 ) We use capped exponential backoff in several places in the code when handling failures. The code we've copy-and-pasted all over has a check to see if the backoff is greater than the limit, but this check happens after the bitshift and we always increment the number of attempts. This causes an overflow with a fairly small number of failures (ex. at one place I tested it occurs after only 24 iterations), resulting in a negative backoff which then never recovers. The backoff becomes a tight loop consuming resources and/or DoS'ing a Nomad RPC handler or an external API such as Vault. Note this doesn't occur in places where we cap the number of iterations so the loop breaks (usually to return an error), so long as the number of iterations is reasonable. Introduce a helper with a check on the cap before the bitshift to avoid overflow in all places this can occur. Fixes: #18199 Co-authored-by: stswidwinski <stan.swidwinski@gmail.com>	2023-08-15 14:38:18 -04:00
Michael Schurter	0e22fc1a0b	identity: add support for multiple identities + audiences (#18123 ) Allows for multiple `identity{}` blocks for tasks along with user-specified audiences. This is a building block to allow workload identities to be used with Consul, Vault and 3rd party JWT based auth methods. Expiration is still unimplemented and is necessary for JWTs to be used securely, so that's up next. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2023-08-15 09:11:53 -07:00
Esteban Barrios	65d562b760	config: add configurable content security policy (#18085 )	2023-08-14 14:23:03 -04:00
Seth Hoenig	d9341f0664	update go1.21 (#18184 ) * build: update to go1.21 * go: eliminate helpers in favor of min/max * build: run go mod tidy * build: swap depguard for semgrep * command: fixup broken tls error check on go1.21	2023-08-14 08:43:27 -05:00
hashicorp-copywrite[bot]	a9d61ea3fd	Update copyright file headers to BUSL-1.1	2023-08-10 17:27:29 -05:00
Seth Hoenig	a4cc76bd3e	numa: enable numa topology detection (#18146 ) * client: refactor cgroups management in client * client: fingerprint numa topology * client: plumb numa and cgroups changes to drivers * client: cleanup task resource accounting * client: numa client and config plumbing * lib: add a stack implementation * tools: remove ec2info tool * plugins: fixup testing for cgroups / numa changes * build: update makefile and package tests and cl	2023-08-10 17:05:30 -05:00
Devashish Taneja	472693d642	server: add config to tune job versions retention. #17635 (#17939 )	2023-08-07 14:47:40 -04:00
Abbas Yazdanpanah	388198abef	CLI: make snapshot name requiered in creating volume snapshots (#17958 ) Co-authored-by: James Rasell <jrasell@users.noreply.github.com>	2023-08-04 10:36:07 +01:00
Luiz Aoqui	768978883d	cli: search all namespaces for node volumes (#17925 ) When looking for CSI volumes to display in the `node status` command the CLI needs to search all namespaces.	2023-08-01 09:55:39 -04:00
Tim Gross	4fb5bf9a16	cli: support wildcard namespace in alloc subcommands (#18095 ) The alloc exec and filesystem/logs commands allow passing the `-job` flag to select a random allocation. If the namespace for the command is set to `*`, the RPC handler doesn't handle this correctly as it's expecting to query for a specific job. Most commands handle this ambiguity by first verifying that only a single object of the type in question exists (ex. a single node or job). Update these commands so that when the `-job` flag is set we first verify there's a single job that matches. This also allows us to extend the functionality to allow for the `-job` flag to support prefix matching. Fixes: #12097	2023-07-31 13:15:15 -04:00
Gerard Nguyen	9e98d694a6	feature: Add new field render_templates on restart block (#18054 ) This feature is necessary when user want to explicitly re-render all templates on task restart. E.g. to fetch all new secrets from Vault, even if the lease on the existing secrets has not been expired.	2023-07-28 11:53:32 -07:00
Luiz Aoqui	ee31916c3b	cli: add help message for `-consul-namespace` (#18081 ) Add missing help entry for the `-consul-namespace` flag in `nomad job run`.	2023-07-28 10:22:59 -04:00
Michael Schurter	d14362ec19	core: add jwks rpc and http api (#18035 ) Add JWKS endpoint to HTTP API for exposing the root public signing keys used for signing workload identity JWTs. Part 1 of N components as part of making workload identities consumable by third party services such as Consul and Vault. Identity attenuation (audience) and expiration (+renewal) are necessary to securely use workload identities with 3rd parties, so this merge does not yet document this endpoint. --------- Co-authored-by: Tim Gross <tgross@hashicorp.com>	2023-07-27 11:27:17 -07:00

1 2 3 4 5 ...

3635 Commits