Alex Dadgar
1468985035
Remove generated structs
2018-04-12 16:35:31 -07:00
Alex Dadgar
8a80a5f7c5
Version bump and generated files
2018-04-12 16:21:50 -07:00
Alex Dadgar
e0171acbdd
Move where attribute for driver detection is set
2018-04-12 15:50:25 -07:00
Chelsea Holland Komlo
228a2319c2
delete driver name from only health check attributes
2018-04-12 18:24:41 -04:00
Alex Dadgar
d0605c5229
Fix tests
2018-04-12 14:29:30 -07:00
Alex Dadgar
f45b51a138
Driver health detection cleanups
...
This PR does:
1. Health message based on detection has format "Driver XXX detected"
and "Driver XXX not detected"
2. Set initial health description based on detection status and don't
wait for the first health check.
3. Combine updating attributes on the node, fingerprint and health
checking update for drivers into a single call back.
4. Condensed driver info in `node status` only shows detected drivers
and make the output less wide by removing spaces.
2018-04-12 12:46:40 -07:00
Charlie Voiselle
c728a2feb8
Changed "til" to "until"
...
Should be "till" or "until"; chose "until" because it is unambiguous as to meaning.
2018-04-11 12:36:28 -05:00
Chelsea Komlo
d32a4822fd
Merge pull request #4111 from hashicorp/b-undetected-set-health-to-false
...
Immediately set driver health status to false when driver moves to undetected
2018-04-10 18:30:31 -04:00
Chelsea Holland Komlo
a40750e596
update comment for when the fingerprinter setting health status
2018-04-10 16:53:00 -04:00
Chelsea Holland Komlo
46ec4633fe
fingerprinter should set health check status if health check is not periodic
2018-04-10 15:29:51 -04:00
Chelsea Holland Komlo
9eaa1e7c9e
add setters for access to the fingerprint manager's node
...
refactor extracting driver info
2018-04-10 15:29:51 -04:00
Chelsea Holland Komlo
49e12b1ad2
guard against overwriting health status
2018-04-10 15:29:51 -04:00
Chelsea Holland Komlo
d3f0d05ac8
immediately set healthy to false when driver moves to undetected
2018-04-10 15:29:51 -04:00
Alex Dadgar
586717bb15
Fix client uptime metric missing client prefix
2018-04-10 10:39:36 -07:00
Seth Vargo
ba6111e2a4
Set user-agent when talking to GCE metadata
2018-04-10 10:36:46 -04:00
Chelsea Komlo
4444a3309e
Merge pull request #4109 from hashicorp/f-shorten-docker-health-timeout
...
Shorten docker health timeout
2018-04-09 15:38:39 -04:00
Chelsea Holland Komlo
c6cd78db59
only initialize docker clients if they are nil
2018-04-09 14:13:07 -04:00
Chelsea Holland Komlo
4c1c88a91c
refacotoring simplification from code review
2018-04-09 10:34:17 -04:00
Chelsea Holland Komlo
af8fc4f62c
only run health check if driver moves from undetected to detected
2018-04-09 10:10:43 -04:00
Alex Dadgar
98a403a5a6
Start rebalance after discovering new servers
2018-04-05 15:41:59 -07:00
Alex Dadgar
b2ae8b73ef
Merge pull request #4106 from hashicorp/b-servers
...
Improved Client handling of failed RPCs
2018-04-05 13:48:50 -07:00
Alex Dadgar
9ce59c5828
more jitter
2018-04-05 13:48:33 -07:00
Chelsea Holland Komlo
d251199432
group similar functions; update comments
...
health check timeout should be 1 minute
2018-04-05 16:19:02 -04:00
Chelsea Holland Komlo
dee4fc4555
remove do once block when creating a new docker client
...
only set cached connections upon no error
2018-04-05 16:19:02 -04:00
Chelsea Holland Komlo
45d09d1ef9
use client with shorter timeouts for health checks
2018-04-05 16:19:02 -04:00
Chelsea Holland Komlo
9092439107
refactor docker clients method to be able to extend to creating new clients
2018-04-05 16:19:02 -04:00
Alex Dadgar
c86ad8fa32
Handle no leader and faster retries near limit
...
Handle the ErrNoLeader case and apply slower retries. Also when we have
missed the heartbeat retry aggressively, backing off after we have
missed for more than 30 seconds.
2018-04-05 11:22:47 -07:00
Alex Dadgar
12a8655dbd
Scale heartbeat retrying based on remaining heartbeat time
2018-04-05 10:58:13 -07:00
Alex Dadgar
80c380b456
Fire retry only when consul discovers new servers
2018-04-05 10:40:17 -07:00
Preetha
ff006877de
Merge pull request #4101 from hashicorp/b-rescheduling-edge-fixes
...
Fixes edge cases around timing/ task finish time being set more than once
2018-04-04 16:18:21 -05:00
Preetha Appan
e81886d588
remove outdated commented out test code
2018-04-04 15:03:24 -05:00
Preetha Appan
8b6143f272
Remove old comment
2018-04-04 15:01:48 -05:00
Preetha Appan
7fa7655ebe
Moves setting finishedAt to the right place and adds two unit tests.
2018-04-04 14:38:15 -05:00
Alex Dadgar
46e6d70435
Spelling error
2018-04-03 18:30:01 -07:00
Alex Dadgar
12ec2e3b60
RPC Retry Watcher
2018-04-03 18:05:28 -07:00
Preetha Appan
d8e975510a
Add comment
2018-04-03 19:49:03 -05:00
Alex Dadgar
ca3b13e4c0
randomize servers
2018-04-03 17:46:13 -07:00
Preetha Appan
aa4a0cff50
Fixes edge cases around timing and task finish time being set more than once
2018-04-03 16:34:59 -05:00
Alex Dadgar
16ec4481e3
Improve Vault error handling
2018-04-03 14:29:22 -07:00
Alex Dadgar
1a66631eff
remove generated files
2018-03-30 16:52:49 -07:00
Alex Dadgar
702a3be41e
Generated files
2018-03-30 16:14:40 -07:00
Michael Schurter
2ee0426985
test: don't rely on alloc runner update count
...
We were incorrectly relying on the count of alloc updates in a number of
tests. Since alloc updates are async, their number is non-determinstic
and largely meaningless.
This should fix quite a few flaky tests in Travis and prevent future
mistaken assumptions in tests.
2018-03-30 09:34:33 -07:00
Michael Schurter
53a504c69c
Merge pull request #4069 from hashicorp/f-hashealth
...
add HasHealth helper for nil checks
2018-03-29 17:03:20 -07:00
Alex Dadgar
357a10bcf4
Always capture the finish time
2018-03-29 11:27:22 -07:00
Michael Schurter
d09b0b62ba
add HasHealth helper for nil checks
...
We performed the DeploymentStatus nil checks a couple different ways, so
hopefully this helper will consoldiate them and make it more clear what
the code is doing.
2018-03-29 09:29:19 -07:00
Chelsea Komlo
00b358553d
Merge pull request #4065 from hashicorp/emit-node-event-on-first-health-change
...
Emit first node event after initialization on health status change
2018-03-29 11:23:25 -04:00
Chelsea Holland Komlo
aeb744d930
add clarifying comment
2018-03-29 10:58:39 -04:00
Michael Schurter
35f42b1fca
Merge pull request #4059 from hashicorp/b-drain-health-svc-only
...
only service allocs should have health watched
2018-03-28 16:49:22 -07:00
Michael Schurter
12dd17affe
only service allocs should have health watched
2018-03-28 16:20:11 -07:00
Chelsea Holland Komlo
dff03f6a91
emit first node event
2018-03-28 17:26:53 -04:00