Commit Graph

472 Commits

Author SHA1 Message Date
Chelsea Holland Komlo
3ebecb676c fix up comments 2018-04-17 11:53:08 -04:00
Alex Dadgar
cc3f264a35 Cleanup 2018-04-16 15:48:34 -07:00
Alex Dadgar
3cf87e0dc8 Copy the config given to the alloc runner 2018-04-16 15:45:52 -07:00
Alex Dadgar
89fa9a1e10 Fix copying drivers 2018-04-16 15:45:51 -07:00
Alex Dadgar
9929019b9a Operate on copy 2018-04-16 15:45:49 -07:00
Alex Dadgar
e0171acbdd Move where attribute for driver detection is set 2018-04-12 15:50:25 -07:00
Alex Dadgar
f45b51a138 Driver health detection cleanups
This PR does:

1. Health message based on detection has format "Driver XXX detected"
and "Driver XXX not detected"
2. Set initial health description based on detection status and don't
wait for the first health check.
3. Combine updating attributes on the node, fingerprint and health
checking update for drivers into a single call back.
4. Condensed driver info in `node status` only shows detected drivers
and make the output less wide by removing spaces.
2018-04-12 12:46:40 -07:00
Alex Dadgar
586717bb15 Fix client uptime metric missing client prefix 2018-04-10 10:39:36 -07:00
Alex Dadgar
98a403a5a6 Start rebalance after discovering new servers 2018-04-05 15:41:59 -07:00
Alex Dadgar
9ce59c5828 more jitter 2018-04-05 13:48:33 -07:00
Alex Dadgar
c86ad8fa32 Handle no leader and faster retries near limit
Handle the ErrNoLeader case and apply slower retries. Also when we have
missed the heartbeat retry aggressively, backing off after we have
missed for more than 30 seconds.
2018-04-05 11:22:47 -07:00
Alex Dadgar
12a8655dbd Scale heartbeat retrying based on remaining heartbeat time 2018-04-05 10:58:13 -07:00
Alex Dadgar
80c380b456 Fire retry only when consul discovers new servers 2018-04-05 10:40:17 -07:00
Alex Dadgar
46e6d70435 Spelling error 2018-04-03 18:30:01 -07:00
Alex Dadgar
12ec2e3b60 RPC Retry Watcher 2018-04-03 18:05:28 -07:00
Alex Dadgar
16ec4481e3 Improve Vault error handling 2018-04-03 14:29:22 -07:00
Chelsea Holland Komlo
aeb744d930 add clarifying comment 2018-03-29 10:58:39 -04:00
Chelsea Holland Komlo
dff03f6a91 emit first node event 2018-03-28 17:26:53 -04:00
Chelsea Holland Komlo
cdfeac13a1 specify driver health messages 2018-03-28 11:35:21 -04:00
Chelsea Holland Komlo
041786360e use time.Time for node events for compatibility 2018-03-27 15:43:57 -04:00
Chelsea Holland Komlo
9c143301de fix issue when updating node events 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
06a306e460 improve comments; update watchDriver 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
7b2ed01472 remove unused function 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
8597da0c5d simplify logic
bump log level
2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
115d54cb19 fix up health check logic comparison; add node events to client driver checks 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
8aefd294e1 Code review feedback 2018-03-21 15:15:26 -04:00
Alex Dadgar
3ad03d9498 notes from walk through 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
8a0ed4e5ff improve tests 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
fd25db9e1d updating comments; locking concurrent node access 2018-03-21 15:15:26 -04:00
Chelsea Holland Komlo
9dfb5c6b46 go style; update comments 2018-03-21 15:15:25 -04:00
Chelsea Holland Komlo
9ec5a93bc1 fix scheduler driver name; create node structs file 2018-03-21 15:15:25 -04:00
Chelsea Holland Komlo
a340baddf1 allow nomad to schedule based on the status of a client driver health check
Slight updates for go style
2018-03-21 15:15:25 -04:00
Chelsea Holland Komlo
1570972cb3 add concept of health checks to fingerprinters and nodes
fix up feedback from code review

add driver info for all drivers to node
2018-03-21 15:15:25 -04:00
Preetha Appan
dc183ad209 Fix spelling in comment 2018-03-14 15:54:25 -05:00
Alex Dadgar
4252ffe71c fix clearing node events 2018-03-14 09:48:59 -07:00
Chelsea Komlo
9cb3a02b0f Merge pull request #3945 from hashicorp/f-add-node-events
Add node events
2018-03-14 08:42:55 -04:00
Preetha
c4389cd61a Merge pull request #3968 from hashicorp/f-nicer-vault-error
Make server side error messages from vault more clearer
2018-03-13 20:49:39 -05:00
Alex Dadgar
8ba5ba33de small cleanup 2018-03-13 18:08:22 -07:00
Chelsea Holland Komlo
319f80907c code review feedback 2018-03-13 18:08:21 -07:00
Chelsea Holland Komlo
6eb9911caf code review feedback 2018-03-13 18:08:21 -07:00
Chelsea Holland Komlo
9ef6ad031a fix up go check warnings 2018-03-13 18:08:21 -07:00
Chelsea Holland Komlo
a72dcbfcdd add client side emitting of node events
Changelog
2018-03-13 18:08:21 -07:00
Preetha Appan
d83ad728db Address some code review comments 2018-03-13 18:19:16 -05:00
Preetha Appan
f4a4685d84 Return the err from server correctly 2018-03-13 18:10:14 -05:00
Preetha Appan
71e4061e0e Remove error wrapping and make vault connection server side errors clearer. 2018-03-13 17:09:03 -05:00
Alex Dadgar
42e9fe1b57 Merge pull request #3890 from hashicorp/b-heartbeat
Heartbeat improvements and handling failures during establishing leadership
2018-03-12 14:41:59 -07:00
Josh Soref
0afd7f8d56 spelling: transition 2018-03-11 19:06:05 +00:00
Josh Soref
c297d03315 spelling: thresholds 2018-03-11 19:03:47 +00:00
Josh Soref
c10723cd5a spelling: server 2018-03-11 18:55:30 +00:00
Josh Soref
25eb073964 spelling: registry 2018-03-11 18:41:13 +00:00