Omar Khawaja
3817ce3dd5
editing monitoring.html ( #4754 )
2018-10-04 18:40:13 -04:00
Omar Khawaja
d836fa06cf
editing lb guide ( #4753 )
2018-10-04 18:26:51 -04:00
Alex Dadgar
343e06c60f
Merge pull request #4638 from oleksii-shyman/nvidia-plugin
...
WIP :: Nvidia Plugin
2018-10-04 15:24:36 -07:00
oleksii.shyman
a7e04f1520
Introduce nvidia-plugin reserve
...
- added reserve functionality that returns OCI compliant env variables
specifying GPU IDs to be injected inside the container
2018-10-04 14:55:34 -07:00
Omar Khawaja
b8c3a1d02d
Monitoring and Alerting Guide with Prometheus [WIP] ( #4706 )
...
* add prometheus configuration guide
* fixing sub navigation issue
* Add detail to Next Steps
* add alerting component to guide
* update
* change docker image name and shorten job templates
* re-arrange to fix broken links
2018-10-04 17:15:10 -04:00
Omar Khawaja
dd3601979e
Load Balancing with Fabio Guide ( #4445 )
...
* add load-balancing guide
* restructure load balancing section
* defining consul lb strategies inline and giving fabio its own bullet point
* update docker image name and shorten job template
* changing system scheduler link to relative link and moving load balancing navigation link right to right above Web UI
2018-10-04 16:18:52 -04:00
oleksii.shyman
9c8c67e948
Introduce Nvidia-plugin stats
...
- created go-nvml wrapper for stats
- added stats feature to nvidia-plugin
2018-10-03 15:12:05 -07:00
oleksii.shyman
63f4fbf273
Introduce nvidia-plugin fingerprinting
...
- created go-nvml wrapper for fingerprinting
- added fingerprinting feature to nvidia-plugin
2018-10-03 15:11:56 -07:00
Alex Dadgar
95d9286ad1
changelog
2018-09-26 14:53:15 -07:00
Alex Dadgar
025c5d4455
Merge pull request #4723 from hashicorp/b-autopilot-cli
...
Fix autopilot set enable custom upgrades flag
2018-09-25 13:53:52 -07:00
Alex Dadgar
9688161a54
Fix autopilot set enable custom upgrades flag
2018-09-25 13:49:35 -07:00
Alex Dadgar
589e67202b
Merge pull request #4720 from hashicorp/b-jet-fixes
...
Series of scheduler fixes / debugging enhancements
2018-09-25 13:25:11 -07:00
Alex Dadgar
088f51a330
skip e2e/vault if integration isn't set
2018-09-25 11:29:09 -07:00
Alex Dadgar
bcb1a67015
Merge pull request #4712 from hashicorp/b-failed-trigger-reason
...
Add a missing eval trigger reason
2018-09-25 10:50:16 -07:00
Alex Dadgar
b3e85557f0
fix logging
2018-09-25 10:49:55 -07:00
Preetha Appan
47e22f6b7c
Add failed follow up to the list of allowed eval trigger reasons
...
needs unit test
2018-09-25 10:49:55 -07:00
Preetha Appan
e9c7dc1286
Added logging around nacked evals in the scheduler worker
2018-09-25 10:49:02 -07:00
Alex Dadgar
454c1d0e84
Merge pull request #4717 from barda999/master
...
changed ${nomad.class} to ${node.class}
2018-09-24 16:51:27 -07:00
barda999
c09cb9f08d
changed ${nomad.class} to ${node.class}
...
I guess that was an unintentional mistake
2018-09-24 16:48:06 -07:00
Alex Dadgar
086b1266c6
Merge pull request #4698 from hashicorp/t-vault-matrix
...
Vault test matrix
2018-09-24 16:34:35 -07:00
Alex Dadgar
668a90102f
proper variable capture
2018-09-24 16:34:15 -07:00
Alex Dadgar
029a7f617e
Merge pull request #4716 from hashicorp/f-no-reuse-triggerby
...
Unique TriggerBy for blocked evals
2018-09-24 16:08:31 -07:00
Alex Dadgar
302a6940af
Merge branch 'b-plan' into b-jet-fixes
2018-09-24 16:07:29 -07:00
Alex Dadgar
b8ec297263
Merge pull request #4709 from hashicorp/b-deployments
...
Fix deployment watcher index usage
2018-09-24 16:05:02 -07:00
Alex Dadgar
ed53038e04
Unique TriggerBy for blocked evals
...
Give blocked evals a unique triggerby reason to make debugging a chain
of evaluations easier.
2018-09-24 14:47:49 -07:00
Alex Dadgar
4c40d62f68
test allocs fit
2018-09-24 13:59:01 -07:00
Alex Dadgar
06920ee46c
Better comment on snapshotindex
2018-09-24 13:53:43 -07:00
Alex Dadgar
82889c432e
Denormalize jobs in plan and ignore resources of terminal allocs
...
Denormalize jobs in AppendAllocs:
AppendAlloc was originally only ever called for inplace upgrades and new
allocations. Both these code paths would remove the job from the
allocation. Now we use this to also add fields such as FollowupEvalID
which did not normalize the job. This is only a performance enhancement.
Ignore terminal allocs:
Failed allocations are annotated with the followup Eval ID when one is
created to replace the failed allocation. However, in the plan applier,
when we check if allocations fit, these terminal allocations were not
filtered. This could result in the plan being rejected if the node would
be overcommited if the terminal allocations resources were considered.
2018-09-24 13:53:43 -07:00
Alex Dadgar
9d4ff89eaf
Fix other instances of blocking queries
2018-09-24 13:52:39 -07:00
Preetha Appan
1a9c18f9df
update changelog
2018-09-24 11:19:51 -05:00
Preetha
21f7198835
Merge pull request #4702 from hashicorp/b-non-voter-boostrap
...
Do not bootstrap with non voters
2018-09-24 11:14:36 -05:00
Alex Dadgar
f7822161b3
always handle failed allocation
2018-09-21 15:13:54 -07:00
Alex Dadgar
34e8b2f264
Fix deployment watcher index usage
...
Fixes three issues:
1. Retrieving the latest evaluation index was not properly selecting the
greatest index. This would undermine checks we had to reduce the number
of evaluations created when the latest eval index was greater than any
alloc change
2. Fix an issue where the blocking query code was using the incorrect
index such that the index was higher than necassary.
3. Special case handling of blocked evaluation since the create/snapshot
index is no particularly useful since they can be reblocked.
2018-09-21 13:59:11 -07:00
Alex Dadgar
61b5eccb97
do not bootstrap with non voters
2018-09-19 17:17:39 -07:00
Alex Dadgar
2332293036
Merge pull request #4693 from Chaosteil/patch-1
...
Update federation.md command
2018-09-19 11:00:46 -07:00
Alex Dadgar
f7f5da204d
build nomad in e2e tests
2018-09-19 10:38:20 -07:00
Alex Dadgar
575e193332
vendor vault api for backwards compatibility
2018-09-19 10:23:18 -07:00
Alex Dadgar
ebe6fe208e
run in matrix
2018-09-19 10:21:57 -07:00
Alex Dadgar
7acb3ca2ee
vet
2018-09-19 10:18:10 -07:00
Alex Dadgar
67ab8eff07
test automation
2018-09-19 10:18:10 -07:00
Alex Dadgar
0ee63c43ea
add a vault test matrix
2018-09-19 10:18:10 -07:00
Alex Dadgar
58775f0782
fix rpc test
2018-09-19 10:17:54 -07:00
Alex Dadgar
aadb09aa16
fix panic
2018-09-18 13:02:03 -07:00
Dominykas Djačenko
0ad8a985ca
Update federation.md command
...
This fixes the documentation to use the most recent syntax for `nomad server join`
2018-09-18 12:58:42 -07:00
Alex Dadgar
acb263cbf8
Merge pull request #4692 from hashicorp/f-plugin-singleton
...
Singleton plugin loader
2018-09-18 10:48:59 -07:00
Alex Dadgar
0917b49871
fix documentation of reattach and use testlog
2018-09-18 10:48:37 -07:00
Alex Dadgar
6aaaa396f5
singleton wrapper
2018-09-18 10:08:46 -07:00
Alex Dadgar
3a26035dfe
Merge pull request #4686 from hashicorp/f-logger-deps
...
Use StandardLogger for Raft/Serf/Memberlist/Yamux
2018-09-17 15:36:43 -07:00
Alex Dadgar
58c889aa94
yamux
2018-09-17 14:22:40 -07:00
Alex Dadgar
5760da6939
vendor yamux
2018-09-17 13:58:51 -07:00