Commit Graph

25925 Commits

Author SHA1 Message Date
Daniel Bennett
c84b4ad67b e2e: add test for task schedule{} (#23382) 2024-06-20 11:18:53 -05:00
Tim Gross
ee48bdd4d9 server: fix panic if heartbeat reset happens for GC'd node (#23383)
When setting up the timer for heartbeat invalidation, there's no control that
allows us to remove that timer when the node is GC'd. If the GC window is narrow
enough, it's possible to GC a node that has a waiting heartbeat timer. In this
case, we hit a bug where querying for the node returns `nil` and this is
incorrectly handled when checking for disconnect/reconnect state. Fix this bug
by correctly handling a `nil` node and allowing the `Node.Update` RPC to fire
normally (which then errors correctly).

Fixes: https://github.com/hashicorp/nomad/issues/23376
Ref: https://hashicorp.atlassian.net/browse/NET-10109
2024-06-20 10:05:44 -04:00
James Rasell
ca97aa55c0 Merge pull request #23397 from hashicorp/post-1.8.1-release
Post 1.8.1 release
2024-06-20 14:30:41 +01:00
James Rasell
d63ad1a6c5 Generate UI assets 2024-06-20 14:13:24 +01:00
James Rasell
6278e6c0cb Merge release 1.8.1 files 2024-06-20 08:02:48 +01:00
James Rasell
26d0a9169c docs: fix typo in alloc exec CLI docs page. (#23392) 2024-06-20 07:50:32 +01:00
hc-github-team-nomad-core
41e0dfd13b Prepare for next release 2024-06-19 15:24:16 +01:00
hc-github-team-nomad-core
9566174e92 Generate files for 1.8.1 release 2024-06-19 15:24:08 +01:00
Phil Renaud
81d4f364b1 [ui] Squashing Ember deprecations (#22424)
* Now that message is a gettable on the task-event model, no longer needs to be attrs-set at serializer

* xstate future-proofing default prop
2024-06-19 09:02:53 -04:00
Daniel Bennett
cfeedd05e8 api: use the task in Allocations.GetPauseState (#23377) 2024-06-18 12:31:12 -05:00
Phil Renaud
b92afa5b4a [ui] Helios page headers added to the administration section (#23366)
* Helios page headers added to the administration section

* data-test-title mirage helper missed on policy page
2024-06-18 12:37:51 -04:00
Phil Renaud
5aad029ddb [ui] Dont show keyboard hints when a user is trying to take a screenshot (#23365)
* Dont show keyboard hints when a user is trying to take a screenshot

* Oh hey metaKey is its own special thing
2024-06-18 12:37:05 -04:00
James Rasell
54115a1b37 github: use larger instance type for prepare-release release step. (#23374) 2024-06-18 14:36:40 +01:00
scoss
7dcb9fcf76 add exec2 and podman to supported driver list for memory-max resource limit (#23364)
* add exevc2 and podman to supported driver list

* tweak exec2 naming

Co-authored-by: David Yu <dyu@hashicorp.com>

---------

Co-authored-by: Seth Hoenig <shoenig@duck.com>
Co-authored-by: David Yu <dyu@hashicorp.com>
2024-06-18 08:26:50 -05:00
dependabot[bot]
a80ebe2c15 chore(deps): bump github.com/docker/docker (#20460)
Bumps [github.com/docker/docker](https://github.com/docker/docker) from 26.0.1+incompatible to 26.0.2+incompatible.
- [Release notes](https://github.com/docker/docker/releases)
- [Commits](https://github.com/docker/docker/compare/v26.0.1...v26.0.2)

---
updated-dependencies:
- dependency-name: github.com/docker/docker
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-06-17 16:53:29 -04:00
Daniel Bennett
2da38ba9c4 e2e: jobs3 hcl vars differently (#23363)
and include jobspec and vars in registrations
(so they show up in the UI under job Definition)
2024-06-17 13:20:51 -05:00
Daniel Bennett
5a6e3d5ef0 e2e: add Enterprise Option for cluster3.Establish (#23362) 2024-06-17 12:59:37 -05:00
Phil Renaud
8e589a9319 [ui] Add "stopped" as a valid status on jobs index/job detail (#23328)
* Stopped status passed through to the statuses endpoint and observed on job model and steady-state panel

* Status passed to statuses endpoint and test for FE model statuses
2024-06-14 23:33:00 -04:00
Tim Gross
d9a10a6298 deps: update runc to 1.1.13 (#23331)
Update `runc` to 1.1.13 to pick up build support for Go 1.22.4+, in order to
ensure we've resolved errors cloning processes into Linux namespaces for
libcontainer (`exec` driver) with new versions of Go and older but still
supported versions of glibc.

This changeset has two minor quirks:

* Testing shows that the reported issues is already resolved on `main` by
  upgrading to Go 1.22.4 without this dependency bump, at least for glibc 2.31.
  Upgrading the dependency should make sure there isn't another glibc version
  where the problem will still appear.

* This version of `runc` refers to fields in `cilium/ebpf` which are not present
  in more recent versions of that library. So in order to build, we have to
  downgrade `cilium/ebpf`. Fortunately, `runc` is the only consumer of that
  transitive dependency.

Closes: https://github.com/hashicorp/nomad/issues/20212
Ref: https://hashicorp.atlassian.net/browse/NET-10078
2024-06-14 14:38:44 -04:00
David Yu
0cc2ab5ae9 Merge pull request #23322 from hashicorp/david-yu-patch-1
docs: install `consul-cni` manually or via linux packaging
2024-06-14 11:37:46 -07:00
David Yu
36f75c5f3e Update index.mdx 2024-06-14 11:25:23 -07:00
David Yu
b2d29340b6 Update index.mdx
remove LICENSE.txt from unzip
2024-06-14 11:00:49 -07:00
David Yu
be30e130fe Update index.mdx 2024-06-14 10:57:05 -07:00
David Yu
ac2a5a851f Update index.mdx 2024-06-14 10:25:52 -07:00
David Yu
b79d813e7d Update index.mdx 2024-06-14 10:12:34 -07:00
David Yu
dea70a356e Update index.mdx 2024-06-14 10:03:17 -07:00
David Yu
f974381253 Update index.mdx 2024-06-14 09:47:36 -07:00
David Yu
26a30ac908 Update index.mdx 2024-06-14 09:42:56 -07:00
Daniel Bennett
3b9283a2fd task schedule: fix end-of-month validation bug (#23329)
fix erroneous "end cannot be sooner than start" error condition
on the last day of the month
2024-06-14 11:26:35 -05:00
David Yu
947ecd1c77 Update website/content/docs/install/index.mdx
Co-authored-by: Tim Gross <tgross@hashicorp.com>
2024-06-14 09:16:14 -07:00
Phil Renaud
eacf47a484 [ui] Remove simulacral allocation stat in favor of live-updating one (#23306)
* Remove simulacral allocation stat in favor of live-updating one

* Tests remove dependency on job-summary as well
2024-06-14 09:23:31 -04:00
Piotr Kazmierczak
85430be6dd raw_exec: oom_score_adj support (#23308) 2024-06-14 11:36:27 +02:00
David Yu
fe0e76cc3b Update index.mdx 2024-06-13 20:50:57 -07:00
David Yu
5d9d337727 Update index.mdx 2024-06-13 20:37:21 -07:00
David Yu
a08d6f5768 Update index.mdx 2024-06-13 20:10:31 -07:00
David Yu
92af6280e3 Update service-mesh.mdx 2024-06-13 20:09:53 -07:00
David Yu
92a5257d7b Update v1_8_x.mdx 2024-06-13 20:07:51 -07:00
David Yu
331f96f103 Update index.mdx 2024-06-13 20:06:39 -07:00
David Yu
51ff35bef4 docs: install consul-cni via package 2024-06-13 16:41:26 -07:00
David Yu
94bb91ab80 docs - release notes updates (#23312)
Also updated Consul compatibility matrix
2024-06-13 13:46:42 -04:00
Luke Palmer
75874136ac fix cgroup setup for non-default devices (#22518) 2024-06-13 09:27:19 -04:00
Phil Renaud
d34943435d [ui] Rework of node/job attributes/meta using PathTree (#23290)
* Rework of attributes using pathTree

* Pack meta reintroduced and made local

* attributes table test updated for new pathTree syntax

* removed flat import and extended the PathTree type signature to include prefix

* Slightly darken the is-faded text in tables
2024-06-12 14:28:17 -04:00
Tim Gross
ce04fe4a4e acls: reduce permissions of client agent virtual policy (#23304)
Nomad client agents run as privileged processes and require access to much of
the cluster state, secrets, etc. to operate. But we can improve upon this by
tightening up the virtual policy that use for RPC requests authenticated by the
node secret ID. This changeset removes the `node:read`, `plugin:read`, and
`plugin:list` policy, as well as namespace operations. In return, we add a
`AllowClientOp` check to the RPCs the client uses that would otherwise need
those policies.

Where possible, the update RPCs have also been changed to match on node ID so
that a client can only make the RPC that impacts itself. In future work, we may
be able to downscope further by adding node pool filtering to `AllowClientOp`.

Ref: https://github.com/hashicorp/nomad-enterprise/issues/1528
Ref: https://github.com/hashicorp/nomad-enterprise/pull/1529
Ref: https://hashicorp.atlassian.net/browse/NET-9925
2024-06-12 11:32:22 -04:00
Piotr Kazmierczak
830297bcf0 docker: update image in TestDockerDriver_Start_Image_HTTPS (#23309) 2024-06-12 16:13:39 +02:00
Piotr Kazmierczak
0e8a67f0e1 docker: oom_score_adj support (#23297) 2024-06-12 10:49:20 +02:00
Matt McQuillan
7f1665d326 Merge pull request #23286 from hashicorp/mmcquillan/jiraworkflow
Adding GHA workflow to sync with Jira
2024-06-11 12:52:57 -04:00
Tim Gross
44078d4786 docs: update configuration docs to include trace-level logging (#23285) 2024-06-11 09:19:52 -04:00
Tim Gross
7d73065066 numa: fix scheduler panic due to topology serialization bug (#23284)
The NUMA topology struct field `NodeIDs` is a `idset.Set`, which has no public
members. As a result, this field is never serialized via msgpack and persisted
in state. When `numa.affinity = "prefer"`, the scheduler dereferences this nil
field and panics the scheduler worker.

Ideally we would fix this by adding a msgpack serialization extension, but
because the field already exists and is just always empty, this breaks RPC wire
compatibility across upgrades. Instead, create a new field that's populated at
the same time we populate the more useful `idset.Set`, and repopulate the set on
demand.

Fixes: https://hashicorp.atlassian.net/browse/NET-9924
2024-06-11 08:55:00 -04:00
Tim Gross
288a048a2e e2e: add prerelease builds to Consul/Vault compatibility tests (#23287)
Update the Consul/Vault build downloader functions so that we include the
current prerelease build (if any) in our E2E compatibility testing we do on each
PR. This will automatically cycle out when the GA build is released, because
that build is "higher" in the sorted set.
2024-06-11 08:54:27 -04:00
Tim Gross
61608e43cb test: move NUMA platform scan out of testing global (#23289)
The `testing.go` test helpers file for the driver manager initializes the NUMA
scan as a package-global variable. This causes it to be pulled in even in
production builds, so even running commands like `nomad version` will cause the
NUMA scan to happen. Move the scan into the test helper setup.
2024-06-11 08:52:51 -04:00