mirror of https://github.com/kemko/nomad.git synced 2026-01-01 16:05:42 +03:00

Files

Michael Schurter 66fbc0f67e identity: default to RS256 for new workload ids (#18882 )

OIDC mandates the support of the RS256 signing algorithm so in order to maximize workload identity's usefulness this change switches from using the EdDSA signing algorithm to RS256.

Old keys will continue to use EdDSA but new keys will use RS256. The EdDSA generation code was left in place because it's fast and cheap and I'm not going to lie I hope we get to use it again.

**Test Updates**

Most of our Variables and Keyring tests had a subtle assumption in them that the keyring would be initialized by the time the test server had elected a leader. ed25519 key generation is so fast that the fact that it was happening asynchronously with server startup didn't seem to cause problems. Sadly rsa key generation is so slow that basically all of these tests failed.

I added a new `testutil.WaitForKeyring` helper to replace `testutil.WaitForLeader` in cases where the keyring must be initialized before the test may continue. However this is mostly used in the `nomad/` package.

In the `api` and `command/agent` packages I decided to switch their helpers to wait for keyring initialization by default. This will slow down tests a bit, but allow those packages to not be as concerned with subtle server readiness details. On my machine rsa key generation takes 63ms, so hopefully the difference isn't significant on CI runners.

**TODO**

- Docs and changelog entries.
- Upgrades - right now upgrades won't get RS256 keys until their root key rotates either manually or after ~30 days.
- Observability - I'm not sure there's a way for operators to see if they're using EdDSA or RS256 unless they inspect a key. The JWKS endpoint can be inspected to see if EdDSA will be used for new identities, but it doesn't technically define which key is active. If upgrades can be fixed to automatically rotate keys, we probably don't need to worry about this.

**Requiem for ed25519**

When workload identities were first implemented we did not immediately consider OIDC compliance. Consul, Vault, and many other third parties support JWT auth methods without full OIDC compliance. For the machine<-->machine use cases workload identity is intended to fulfill, OIDC seemed like a bigger risk than asset.

EdDSA/ed25519 is the signing algorithm we chose for workload identity JWTs because of all these lovely properties:

1. Deterministic keys that can be derived from our preexisting root keys. This was perhaps the biggest factor since we already had a root encryption key around from which we could derive a signing key.
2. Wonderfully compact: 64 byte private key, 32 byte public key, 64 byte signatures. Just glorious.
3. No parameters. No choices of encodings. It's all well-defined by [RFC 8032](https://datatracker.ietf.org/doc/html/rfc8032).
4. Fastest performing signing algorithm! We don't even care that much about the performance of our chosen algorithm, but what a free bonus!
5. Arguably one of the most secure signing algorithms widely available. Not just from a cryptanalysis perspective, but from an API and usage perspective too.

Life was good with ed25519, but sadly it could not last.

[IDPs](https://en.wikipedia.org/wiki/Identity_provider), such as AWS's IAM OIDC Provider, love OIDC. They have OIDC implemented for humans, so why not reuse that OIDC support for machines as well? Since OIDC mandates RS256, many implementations don't bother implementing other signing algorithms (or at least not advertising their support). A quick survey of OIDC Discovery endpoints revealed only 2 out of 10 OIDC providers advertised support for anything other than RS256:

- [PayPal](https://www.paypalobjects.com/.well-known/openid-configuration) supports HS256
- [Yahoo](https://api.login.yahoo.com/.well-known/openid-configuration) supports ES256

RS256 only:

- [GitHub](https://token.actions.githubusercontent.com/.well-known/openid-configuration)
- [GitLab](https://gitlab.com/.well-known/openid-configuration)
- [Google](https://accounts.google.com/.well-known/openid-configuration)
- [Intuit](https://developer.api.intuit.com/.well-known/openid_configuration)
- [Microsoft](https://login.microsoftonline.com/fabrikamb2c.onmicrosoft.com/v2.0/.well-known/openid-configuration)
- [SalesForce](https://login.salesforce.com/.well-known/openid-configuration)
- [SimpleLogin (acquired by ProtonMail)](https://app.simplelogin.io/.well-known/openid-configuration/)
- [TFC](https://app.terraform.io/.well-known/openid-configuration)

2023-10-31 11:25:20 -07:00

acl

deps: upgrade to go-set/v2 (#18638 )

2023-10-05 11:56:17 -05:00

affinities

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

alloc_logs

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

artifact

deps: upgrade to go-set/v2 (#18638 )

2023-10-05 11:56:17 -05:00

auth

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

bin

scripts: fix interpreter for bash (#12549 )

2022-04-12 10:08:21 -04:00

clientstate

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

connect

docs: update all URLs to developer.hashicorp.com (#16247 )

2023-10-24 11:00:11 -04:00

consul

admin: rename _oss files to _ce (#18209 )

2023-08-18 07:47:24 +01:00

consulcompat

e2e: fix bind name to allow Connect reachability (#18878 )

2023-10-27 10:15:17 -04:00

consultemplate

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

csi

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

deployment

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

disconnectedclients

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

e2eutil

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

eval_priority

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

events

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

example

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

execagent

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

framework

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

isolation

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

jobsubmissions

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

lifecycle

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

metrics

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

namespaces

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

networking

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

nodedrain

deps: upgrade to go-set/v2 (#18638 )

2023-10-05 11:56:17 -05:00

nomadexec

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

operator_scheduler

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

overlap

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

oversubscription

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

parameterized

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

periodic

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

pledge

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

podman

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

quotas

admin: rename _oss files to _ce (#18209 )

2023-08-18 07:47:24 +01:00

remotetasks

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

rescheduling

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

scaling

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

scalingpolicies

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

scheduler_sysbatch

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

scheduler_system

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

servicediscovery

all: swap exp packages for maps, slices (#18311 )

2023-08-23 15:42:13 -05:00

spread

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

taskevents

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

terraform

e2e: packer windows from "ECS_Optimized" image (#18453 )

2023-09-11 12:26:32 -05:00

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

upgrades

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

deps: upgrade to go-set/v2 (#18638 )

2023-10-05 11:56:17 -05:00

vaultcompat

identity: default to RS256 for new workload ids (#18882 )

2023-10-31 11:25:20 -07:00

vaultsecrets

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

volumes

docs: update all URLs to developer.hashicorp.com (#16247 )

2023-10-24 11:00:11 -04:00

workload_id

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

.gitignore

e2e: have TF write-out HCL for CSI volume registration (#7599 )

2020-04-02 12:16:43 -04:00

e2e_test.go

Update copyright file headers to BUSL-1.1

2023-08-10 17:27:29 -05:00

README.md

E2E: move example test to use golangs stdlib test runner (#12383 )

2022-03-25 14:44:16 -04:00

README.md

End to End Tests

This package contains integration tests. Unlike tests alongside Nomad code, these tests expect there to already be a functional Nomad cluster accessible (either on localhost or via the NOMAD_ADDR env var).

See framework/doc.go for how to write tests.

The NOMAD_E2E=1 environment variable must be set for these tests to run.

Provisioning Test Infrastructure on AWS

The terraform/ folder has provisioning code to spin up a Nomad cluster on AWS. You'll need both Terraform and AWS credentials to setup AWS instances on which e2e tests will run. See the README for details. The number of servers and clients is configurable, as is the specific build of Nomad to deploy and the configuration file for each client and server.

Provisioning Local Clusters

To run tests against a local cluster, you'll need to make sure the following environment variables are set:

NOMAD_ADDR should point to one of the Nomad servers
CONSUL_HTTP_ADDR should point to one of the Consul servers
NOMAD_E2E=1

TODO: the scripts in ./bin currently work only with Terraform, it would be nice for us to have a way to deploy Nomad to Vagrant or local clusters.

Running

After completing the provisioning step above, you can set the client environment for NOMAD_ADDR and run the tests as shown below:

# from the ./e2e/terraform directory, set your client environment
# if you haven't already
$(terraform output environment)

cd ..
go test -v ./...

If you want to run a specific suite, you can specify the -suite flag as shown below. Only the suite with a matching Framework.TestSuite.Component will be run, and all others will be skipped.

go test -v -suite=Consul .

If you want to run a specific test, you'll need to regex-escape some of the test's name so that the test runner doesn't skip over framework struct method names in the full name of the tests:

go test -v . -run 'TestE2E/Consul/\*consul\.ScriptChecksE2ETest/TestGroup'
                              ^       ^             ^               ^
                              |       |             |               |
                          Component   |             |           Test func
                                      |             |
                                  Go Package      Struct

We're also in the process of migrating to "stdlib-style" tests that use the standard go testing package without a notion of "suite". You can run these with -run regexes the same way you would any other go test:

go test -v . -run TestExample/TestExample_Simple

I Want To...

...SSH Into One Of The Test Machines

You can use the Terraform output to find the IP address. The keys will in the ./terraform/keys/ directory.

ssh -i keys/nomad-e2e-*.pem ubuntu@${EC2_IP_ADDR}

Run terraform output for IP addresses and details.

...Deploy a Cluster of Mixed Nomad Versions

The variables.tf file describes the nomad_version, and nomad_local_binary variables that can be used for most circumstances. But if you want to deploy mixed Nomad versions, you can provide a list of versions in your terraform.tfvars file.

For example, if you want to provision 3 servers all using Nomad 0.12.1, and 2 Linux clients using 0.12.1 and 0.12.2, you can use the following variables:

# will be used for servers
nomad_version = "0.12.1"

# will override the nomad_version for Linux clients
nomad_version_client_linux = [
    "0.12.1",
    "0.12.2"
]

...Deploy Custom Configuration Files

Set the profile field to "custom" and put the configuration files in ./terraform/config/custom/ as described in the README.

...Deploy More Than 4 Linux Clients

Use the "custom" profile as described above.

...Change the Nomad Version After Provisioning

You can update the nomad_version variable, or simply rebuild the binary you have at the nomad_local_binary path so that Terraform picks up the changes. Then run terraform plan/terraform apply again. This will update Nomad in place, making the minimum amount of changes necessary.