Commit Graph

15446 Commits

Author SHA1 Message Date
Lang Martin
ab3e6259d0 executor_unix and _windows stub getAllPids ByScanning 2019-07-17 17:34:06 -04:00
Lang Martin
1a9c598fc2 executor_universal_linux getAllPids chooses cgroup when available 2019-07-17 17:33:55 -04:00
Lang Martin
3834616691 executor use e.getAllPids() 2019-07-17 17:33:11 -04:00
Lang Martin
d3ef456bd7 resource_container_linux new getAllPidsByCgroup 2019-07-17 17:31:36 -04:00
Lang Martin
412997f566 pid_collector getAllPids -> getAllPidsByScanning 2019-07-17 17:31:20 -04:00
Buck Doyle
87d716ed0b UI: Add page titles (#5924)
This uses ember-page-title to add dynamic page titles throughout the
route hierarchy. When there’s more than one region, the current
current region is added before the final entry of “- Nomad”.
2019-07-17 15:02:58 -05:00
Chris Baker
0965379f27 Merge pull request #5870 from hashicorp/b-nmd-1529-alloc-stop-missing-header
api: return X-Nomad-Index header on allocation stop
2019-07-17 13:25:17 -04:00
Michael Schurter
bcfb39da83 Merge pull request #5791 from hashicorp/b-plan-snapshotindex
nomad: include snapshot index when submitting plans
2019-07-17 09:25:00 -07:00
Mahmood Ali
b352af92ca Merge pull request #5948 from hashicorp/b-stats-recover-plugin-shutdown
Collect driver stats when driver plugins are restarted
2019-07-17 12:14:56 +08:00
Mahmood Ali
15caf5cf8c Merge pull request #5837 from hashicorp/b-consul-restore-sync-2
Avoid de-registering slowly restored services
2019-07-17 12:02:24 +08:00
Mahmood Ali
66bef39dd5 log unrecoverable errors 2019-07-17 11:01:59 +07:00
Mahmood Ali
121c97475d address review feedback 2019-07-17 10:43:13 +07:00
Lang Martin
cacf79e6ac Merge pull request #5960 from shvar/master
take NodeID from url in api for node eligibility
2019-07-16 16:09:12 -04:00
Yishan Lin
457c3f8ce5 Add interoperability support line to Nomad Downloads documentation page.
Added line around interoperability to Downloads page.
2019-07-16 10:51:22 -07:00
Yishan Lin
143448614b Added line around interoperability to Nomad Downloads page. 2019-07-15 14:11:11 -07:00
Buck Doyle
596b5aaf7e UI: Add copy button for client/allocation UUIDs (#5926)
The button shows a success icon and tooltip on click, and resets
after two seconds.
2019-07-15 12:14:32 -05:00
Preetha
2d5aa6de68 Merge pull request #5938 from RenaudWasTaken/master
Updated the TensorRT demo to use the official NVIDIA image
2019-07-15 11:30:31 -05:00
Preetha
c59fde4b3e Merge pull request #5952 from cneira/jail-task-driver
Added Community task driver for FreeBSD jails
2019-07-15 11:12:25 -05:00
Eli Shvartsman
b2d40e7ef7 take NodeID from url in api for node eligibility 2019-07-15 18:34:53 +03:00
Mahmood Ali
6844736d4b sort changelog entries 2019-07-15 10:56:47 +08:00
Mahmood Ali
52ffcbe37d changelog GH-5954 2019-07-15 10:55:31 +08:00
Mahmood Ali
e129c41ab2 Merge pull request #5954 from hashicorp/b-fix-streaming-rpc-tls
rpc: use tls wrapped connection for streaming rpc
2019-07-13 07:29:48 +08:00
cneira
290ddea792 fixup 2019-07-12 17:08:23 -04:00
cneira
bcf3b8eefb Merge branch 'jail-task-driver' of https://github.com/cneira/nomad into jail-task-driver 2019-07-12 16:52:22 -04:00
cneira
8181d59c7b fixup 2019-07-12 16:52:19 -04:00
Mahmood Ali
0a58242d55 update changelog 2019-07-13 00:47:43 +08:00
Carlos Neira
4bdea17705 Update jail-task-driver.html.md 2019-07-12 11:45:56 -04:00
Carlos Neira
111d615099 Fixed LXC reference 2019-07-12 11:27:47 -04:00
Mahmood Ali
b0d98d17b5 rpc: use tls wrapped connection for streaming rpc
This ensures that server-to-server streaming RPC calls use the tls
wrapped connections.

Prior to this, `streamingRpcImpl` function uses tls for setting header
and invoking the rpc method, but returns unwrapped tls connection.
Thus, streaming writes fail with tls errors.

This tls streaming bug existed since 0.8.0[1], but PR #5654[2]
exacerbated it in 0.9.2.  Prior to PR #5654, nomad client used to
shuffle servers at every heartbeat -- `servers.Manager.setServers`[3]
always shuffled servers and was called by heartbeat code[4].  Shuffling
servers meant that a nomad client would heartbeat and establish a
connection against all nomad servers eventually.  When handling
streaming RPC calls, nomad servers used these local connection to
communicate directly to the client.  The server-to-server forwarding
logic was left mostly unexercised.

PR #5654 means that a nomad client may connect to a single server only
and caused the server-to-server forward streaming RPC code to get
exercised more and unearthed the problem.

[1] https://github.com/hashicorp/nomad/blob/v0.8.0/nomad/rpc.go#L501-L515
[2] https://github.com/hashicorp/nomad/pull/5654
[3] https://github.com/hashicorp/nomad/blob/v0.9.1/client/servers/manager.go#L198-L216
[4] https://github.com/hashicorp/nomad/blob/v0.9.1/client/client.go#L1603
2019-07-12 14:41:44 +08:00
Mahmood Ali
a6604f8926 rpc: add positive tests for server streaming RPC 2019-07-12 14:32:52 +08:00
Omar Khawaja
88c03d0888 TF config enable services (#5947)
* enable vault, consul, and nomad services to make them persistent after reboot

* update AMI
2019-07-11 22:36:58 +02:00
cneira
38018d7a8c Added Community task driver for FreeBSD jails 2019-07-11 13:43:16 -04:00
Preetha
c031f666c9 Merge pull request #5912 from hashicorp/f-systemd-nofile
systemd: set a high but non-infinite fd limit
2019-07-11 12:31:12 -05:00
Mahmood Ali
bbf8f90ecb client/taskrunner: fix stats stats retry logic
Previously, if a channel is closed, we retry the Stats call.  But, if that call
fails, we go in a backoff loop without calling Stats ever again.

Here, we use a utility function for calling driverHandle.Stats call that retries
as one expects.

I aimed to preserve the logging formats but made small improvements as I saw fit.
2019-07-11 13:58:07 +08:00
Mahmood Ali
d8a43b2066 Signal plugin shutdown for driver.TaskStats
The driver plugin stub client must call `grpcutils.HandleGrpcErr` to handle plugin
shutdown similar to other functions.  This ensures that TaskStats returns
`ErrPluginShutdown` when plugin shutdown.
2019-07-11 13:57:35 +08:00
Lang Martin
8da0b91287 Merge pull request #5784 from hashicorp/b-batch-node-dereg
batch node deregistration
2019-07-10 14:24:54 -04:00
Lang Martin
17f49512ee Changelog 2019-07-10 13:56:57 -04:00
Lang Martin
e0edc7d93e node_endpoint preserve both messages as rpcs and in raft 2019-07-10 13:56:20 -04:00
Lang Martin
5eac719133 core_sched add compat comment for later removal 2019-07-10 13:56:20 -04:00
Lang Martin
675f750038 structs drop deprecation warning, revert unnecessary comment change 2019-07-10 13:56:20 -04:00
Lang Martin
9bdf35ad4d NodeDeregisterBatch -> NodeBatchDeregister match JobBatch pattern 2019-07-10 13:56:20 -04:00
Lang Martin
3ad8a2498c state_store error if called without node_ids 2019-07-10 13:56:20 -04:00
Lang Martin
838460e70c fsm new NodeDeregisterBatchRequestType sorted at the end of the case 2019-07-10 13:56:20 -04:00
Lang Martin
0deb209d99 checklist NodeDeregisterBatchRequestType must go at the end 2019-07-10 13:56:20 -04:00
Lang Martin
f5632ba98a structs NodeDeregisterBatchRequestType must go at the end 2019-07-10 13:56:20 -04:00
Lang Martin
c20c232eab fsm label batch_deregister_node metrics explicitly
Co-Authored-By: Mahmood Ali <mahmood@notnoop.com>
2019-07-10 13:56:20 -04:00
Lang Martin
c915bf08a0 new file: contributing/checklist-rpc-endpoint.md 2019-07-10 13:56:20 -04:00
Lang Martin
4d11c32e19 core_sched use the new rpc names 2019-07-10 13:56:20 -04:00
Lang Martin
9c42d03366 fsm support new NodeDeregisterBatchRequest 2019-07-10 13:56:20 -04:00
Lang Martin
23fdadfd95 node endpoint support new NodeDeregisterBatchRequest 2019-07-10 13:56:19 -04:00