Commit Graph

2622 Commits

Author SHA1 Message Date
Alex Dadgar
80b434d467 Merge pull request #3411 from cheeseprocedure/f-qemu-graceful-shutdown
Qemu driver: graceful shutdown feature
2017-11-03 16:41:34 -07:00
Michael Schurter
e93d625d44 Remove noisy log line
Didn't mean to commit this
2017-11-03 16:00:30 -07:00
Matt Mercer
f734d842f5 Qemu driver: clean up logging; fail unsupported features on Windows 2017-11-03 15:40:20 -07:00
Alex Dadgar
a94bab6491 fix spelling mistake 2017-11-03 15:04:59 -07:00
Alex Dadgar
11184c7514 Merge pull request #3459 from multani/docker-oom-notification
docker: log that a container has been killed by the OOM killer
2017-11-03 13:24:03 -07:00
Matt Mercer
66f9840dd0 Qemu driver: tweaks in response to PR feedback
Remove attribute for long qemu monitor path; misc cleanup; update tests
2017-11-03 11:28:56 -07:00
Preetha Appan
b3631f3d32 Remove event GenericSource, and address other code review comments. Also added deprecation info in comments. 2017-11-03 10:10:06 -05:00
Preetha Appan
d63e693679 Move logic for determinic event display message to task_runner, added two new fields DisplayMessage and Details. 2017-11-03 09:13:01 -05:00
Alex Dadgar
c15f49ae8d Alloc Runner doesn't panic on restoration. 2017-11-02 16:14:13 -07:00
Alex Dadgar
52598bff7e Merge pull request #3493 from hashicorp/f-remove-atlas
Remove Atlas and Scada from codebase
2017-11-02 16:00:44 -07:00
Michael Schurter
4bca2cd669 Merge pull request #3490 from hashicorp/f-gc-logging
Make unable-to-gc log level adaptive
2017-11-02 14:32:40 -07:00
Diptanu Choudhury
5d36408475 Added the node_id as a tag 2017-11-02 13:29:10 -07:00
Alex Dadgar
53dbc4f127 remove atlas 2017-11-02 11:27:21 -07:00
Michael Schurter
cb3a03c829 Make unable-to-gc log level adaptive
WARNing when someone has over 50 non-terminal allocs was just too
confusing.

Tested manually with `gc_max_allocs = 10` and bumping a job from `count
= 19` to `count = 21`:

```
2017/11/02 17:54:21.076132 [INFO] client.gc: garbage collection due to number of allocations (19) is over the limit (10) skipped because no terminal allocations
...
2017/11/02 17:54:48.634529 [WARN] client.gc: garbage collection due to number of allocations (21) is over the limit (10) skipped because no terminal allocations
```
2017-11-02 10:57:42 -07:00
Diptanu Choudhury
103ff5526e Added support for tagged metrics 2017-11-02 10:07:57 -07:00
Diptanu Choudhury
9593e12672 Incrementing the start counter when we are actually starting a container 2017-11-02 09:51:20 -07:00
Diptanu Choudhury
0bade76fd5 Recording counter for dead allocs properly 2017-11-02 09:51:20 -07:00
Diptanu Choudhury
45583d757e Added metrics to track task/alloc start/restarts/dead events 2017-11-02 09:51:20 -07:00
Matt Mercer
185658507f Qemu driver: defer cleanup sooner 2017-11-01 17:37:43 -07:00
Matt Mercer
15d7565931 Qemu driver: clean up test logging; retry integration test for longer 2017-11-01 17:21:56 -07:00
Matt Mercer
60030d89d1 Use strings.Replace() instead of custom function 2017-11-01 15:31:35 -07:00
Matt Mercer
2924bada55 Qemu driver: basic testing of graceful shutdown feature 2017-11-01 15:31:30 -07:00
Matt Mercer
1ff97035f0 Qemu driver: include PIDs in log output 2017-11-01 15:31:24 -07:00
Matt Mercer
200a12cbcc Qemu driver: ensure proper cleanup of resources 2017-11-01 15:31:20 -07:00
Matt Mercer
22f390d75a Qemu driver: minor logging fixes 2017-11-01 15:31:14 -07:00
Matt Mercer
3f6fdfcb9b Standardize driver.qemu logging prefix 2017-11-01 15:30:44 -07:00
Matt Mercer
00e3cc869d Qemu driver: add graceful shutdown feature 2017-11-01 15:30:36 -07:00
Michael Schurter
ec43315e13 Fix regression by returning error on unknown alloc 2017-11-01 15:16:38 -05:00
Michael Schurter
a3ea8302bf Fix race in test 2017-11-01 15:16:38 -05:00
Michael Schurter
fb3a780b7a Trigger GCs after alloc changes
GC much more aggressively by triggering GCs when allocations become
terminal as well as after new allocations are added.
2017-11-01 15:16:38 -05:00
Michael Schurter
9c1e595e2e Fix GC'd alloc tracking
The Client.allocs map now contains all AllocRunners again, not just
un-GC'd AllocRunners. Client.allocs is only pruned when the server GCs
allocs.

Also stops logging "marked for GC" twice.
2017-11-01 15:16:38 -05:00
Alex Dadgar
af7e1413a5 fix test 2017-10-30 12:35:31 -07:00
Alex Dadgar
05bb446323 Node access is done using locked Node copy
Fixes https://github.com/hashicorp/nomad/issues/3454

Reliably reproduced the data race before by having a fingerprinter
change the nodes attributes every millisecond and syncing at the same
rate. With fix, did not ever panic.
2017-10-27 13:27:24 -07:00
Jonathan Ballet
15114e448e docker: changed OOM killed error message 2017-10-27 20:30:52 +02:00
Jonathan Ballet
e5a790753d docker: log that a container has been killed by the OOM killer
Fix: #2203 (at least for Docker tasks)
2017-10-27 18:05:27 +02:00
Alex Dadgar
f5549a6eb9 go style vars 2017-10-25 10:49:34 -07:00
Alex Dadgar
366e91276f fix two flaky tests 2017-10-23 18:15:52 -07:00
Alex Dadgar
8accabcd87 move to consul freeport implementation 2017-10-23 16:51:40 -07:00
Alex Dadgar
e1b1465081 Standardize retrieving a free port into a helper package 2017-10-23 16:48:20 -07:00
Alex Dadgar
3be474b811 don't double parallel 2017-10-23 16:48:06 -07:00
Alex Dadgar
41773c3fdf respond to comments 2017-10-23 15:50:27 -07:00
Alex Dadgar
5b4e31bb5b Skip if no docker 2017-10-19 16:55:10 -07:00
Alex Dadgar
8de487c241 fix flaky java tests 2017-10-19 16:49:57 -07:00
Alex Dadgar
e175c56082 Undo darwin user setting 2017-10-19 16:49:57 -07:00
Alex Dadgar
651159077b Run as user on mac 2017-10-19 16:49:57 -07:00
Alex Dadgar
838e7cbac9 sudo docker works 2017-10-19 16:49:57 -07:00
Alex Dadgar
039c1f84c6 docker tests 2017-10-19 16:49:57 -07:00
Michael Schurter
4a6255b6c6 Add logging around moby/moby#32648 bug 2017-10-18 10:44:03 -07:00
Michael Schurter
87b86230c4 Properly fail rkt fingerprinting on old vesions 2017-10-16 13:58:58 -07:00
Michael Schurter
dbc3a3baaa Squelch repeated rkt version warnings 2017-10-16 12:09:47 -07:00