mirror of https://github.com/kemko/nomad.git synced 2026-01-03 00:45:43 +03:00

Go to file

Michael Schurter e6fd2583fa client: always wait 200ms before sending updates

Always wait 200ms before calling the Node.UpdateAlloc RPC to send
allocation updates to servers.

Prior to this change we only reset the update ticker when an error was
encountered. This meant the 200ms ticker was running while the RPC was
being performed. If the RPC was slow due to network latency or server
load and took >=200ms, the ticker would tick during the RPC.

Then on the next loop only the select would randomly choose between the
two viable cases: receive an update or fire the RPC again.

If the RPC case won it would immediately loop again due to there being
no updates to send.

When the update chan receive is selected a single update is added to the
slice. The odds are then 50/50 that the subsequent loop will send the
single update instead of receiving any more updates.

This could cause a couple of problems:

1. Since only a small number of updates are sent, the chan buffer may
   fill, applying backpressure, and slowing down other client
   operations.
2. The small number of updates sent may already be stale and not
   represent the current state of the allocation locally.

A risk here is that it's hard to reason about how this will interact
with the 50ms batches on servers when the servers under load.

A further improvement would be to completely remove the alloc update
chan and instead use a mutex to build a map of alloc updates. I wanted
to test the lowest risk possible change on loaded servers first before
making more drastic changes.

2020-11-25 11:36:51 -08:00

.circleci

proto: Switch to using buf (#9308 )

2020-11-17 07:01:48 -08:00

.github

remove local SECURITY.md in favor of org-wide policy

2020-08-24 15:41:28 -07:00

.netlify

Remove most Netlify configuration (#6194 )

2019-08-22 15:54:23 -05:00

acl

added new policy capabilities for recommendations API

2020-10-28 14:32:16 +00:00

api

Merge pull request #9352 from hashicorp/f-artifact-headers

2020-11-13 14:04:27 -06:00

client

client: always wait 200ms before sending updates

2020-11-25 11:36:51 -08:00

command

command: remove -namespace from help options when not applicable

2020-11-19 16:28:39 -05:00

contributing

docs: add contributor docs for issue labels (#8723 )

2020-08-24 10:19:57 -04:00

demo

hclfmt digitalocean demo to pass linting (#9353 )

2020-11-13 14:16:15 -05:00

dev

build: use hashicorp hclfmt

2020-05-24 18:31:57 -05:00

devices/gpu/nvidia

nvidia: support disabling the nvidia plugin (#8353 )

2020-07-21 10:11:16 -04:00

dist

dist: make README consistent with service unit (#7648 )

2020-04-08 09:32:03 -04:00

drivers

Merge pull request #8291 from shishir-a412ed/cpusets

2020-11-11 17:13:27 -05:00

e2e

e2e: test template path interpolation

2020-11-18 10:48:58 -08:00

helper

Send events to EventSinks (#9171 )

2020-10-26 17:27:54 -04:00

integrations

spelling: registrations

2018-03-11 18:40:53 +00:00

internal/testing/apitests

tests: non-CAS should be updated

2020-06-26 10:48:33 -04:00

jobspec

added new policy capabilities for recommendations API

2020-10-28 14:32:16 +00:00

jobspec2

appease deadcode linter

2020-11-12 11:44:49 -05:00

lib

deps: Switch to Go modules for dependency management

2020-06-02 14:30:36 -05:00

nomad

nomad: try to avoid slice resizing when batching

2020-11-24 09:14:00 -08:00

plugins

proto: Switch to using buf (#9308 )

2020-11-17 07:01:48 -08:00

scheduler

scheduler: enable upgrade path for bridge network finger print

2020-11-13 14:17:01 -06:00

scripts

proto: Switch to using buf (#9308 )

2020-11-17 07:01:48 -08:00

terraform

Use latest AMI for Ubuntu Xenial based on search (#9076 )

2020-10-14 11:01:54 -04:00

testutil

nomad operator debug - add client node filtering arguments (#9331 )

2020-11-12 11:25:28 -05:00

tools

proto: Switch to using buf (#9308 )

2020-11-17 07:01:48 -08:00

ui/csi: fix links to volume IDs (#9355 )

2020-11-13 15:44:34 -05:00

vendor

vendor: sync api/tasks for poststop hook

2020-11-16 11:28:02 -05:00

version

s/0.13/1.0/g

2020-10-14 15:17:47 -07:00

website

Merge pull request #9407 from hashicorp/docs-0129-backports

2020-11-20 09:09:47 -08:00

.gitattributes

Remove invalid gitattributes

2018-02-14 14:47:43 -08:00

.gitignore

ignore vagrant directory even if symlinked (#8114 )

2020-06-04 10:24:15 -04:00

.golangci.yml

chore: Switch from gometalinter to golangci-lint

2019-12-05 18:58:13 -06:00

build_linux_arm.go

Fix 32bit arm build

2017-02-09 11:22:17 -08:00

CHANGELOG.md

docs: add 0.12.9, 0.11.8, and 0.10.9 to changelog

2020-11-19 14:23:42 -08:00

GNUmakefile

proto: Switch to using buf (#9308 )

2020-11-17 07:01:48 -08:00

go.mod

Api/event stream payload values (#9277 )

2020-11-05 13:04:18 -05:00

go.sum

Api/event stream payload values (#9277 )

2020-11-05 13:04:18 -05:00

LICENSE

Initial commit

2015-06-01 12:21:00 +02:00

main_test.go

Adding initial skeleton

2015-06-01 13:46:21 +02:00

main.go

add helper commands for debugging state

2020-08-31 08:45:59 -04:00

README.md

doc: Simplify "Contributing" section of README (#9378 )

2020-11-17 11:20:38 -08:00

Vagrantfile

proto: Switch to using buf (#9308 )

2020-11-17 07:01:48 -08:00

README.md

Nomad

Overview

Nomad is an easy-to-use, flexible, and performant workload orchestrator that deploys:

Nomad enables developers to use declarative infrastructure-as-code for deploying their applications (jobs). Nomad uses bin packing to efficiently schedule jobs and optimize for resource utilization. Nomad is supported on macOS, Windows, and Linux.

Nomad is widely adopted and used in production by PagerDuty, CloudFlare, Roblox, Pandora, and more.

Deploy Containers and Legacy Applications: Nomad’s flexibility as an orchestrator enables an organization to run containers, legacy, and batch applications together on the same infrastructure. Nomad brings core orchestration benefits to legacy applications without needing to containerize via pluggable task drivers.
Simple & Reliable: Nomad runs as a single binary and is entirely self contained - combining resource management and scheduling into a single system. Nomad does not require any external services for storage or coordination. Nomad automatically handles application, node, and driver failures. Nomad is distributed and resilient, using leader election and state replication to provide high availability in the event of failures.
Device Plugins & GPU Support: Nomad offers built-in support for GPU workloads such as machine learning (ML) and artificial intelligence (AI). Nomad uses device plugins to automatically detect and utilize resources from hardware devices such as GPU, FPGAs, and TPUs.
Federation for Multi-Region, Multi-Cloud: Nomad was designed to support infrastructure at a global scale. Nomad supports federation out-of-the-box and can deploy applications across multiple regions and clouds.
Proven Scalability: Nomad is optimistically concurrent, which increases throughput and reduces latency for workloads. Nomad has been proven to scale to clusters of 10K+ nodes in real-world production environments.
HashiCorp Ecosystem: Nomad integrates seamlessly with Terraform, Consul, Vault for provisioning, service discovery, and secrets management.

Getting Started

Get started with Nomad quickly in a sandbox environment on the public cloud or on your computer.

These methods are not meant for production.

Documentation & Guides

Documentation is available on the Nomad website here. Guides are available on HashiCorp Learn website here.

Resources

Website
- www.nomadproject.io
Mailing List
- Google Groups
Gitter
- Nomad Chat Room

Who Uses Nomad

Roblox
- How Roblox built a platform for 100 million players with Nomad (2020)
- How Roblox runs a platform for 70 million gamers on Nomad (2019)
Cloudflare
- How We Use HashiCorp Nomad (2020)
BetterHelp
- How the world's largest online therapy provider runs on Nomad (2020)
Navi Capital
- How Nomad powers a $1B hedge fund in Brazil (2020)
Trivago
- Maybe You Don’t Need Kubernetes (2019)
- Nomad - Our Experiences and Best Practices (2019)
Reaktor
- Nomad: Kubernetes, but without the complexity (2019)
Pandora
- How Pandora Uses Nomad (2019)
CircleCI
- How CircleCI Processes 4.5 Million Builds Per Month (2019)
- Security & Scheduling are Not Your Core Competencies (2018)
Q2
- Q2’s Nomad Use and Overview (2019)
Citadel
- End-to-End Production Nomad at Citadel (2017)
- Extreme Scaling with HashiCorp Nomad & Consul (2016)
Deluxe Entertainment
- How Deluxe Uses the Complete HashiStack for Video Production (2018)
Jet.com (Walmart)
- Driving down costs at Jet.com with HashiCorp Nomad (2017)
PagerDuty
- PagerDuty’s Nomadic Journey (2017)
SAP Ariba
- HashiCorp Nomad @ SAP Ariba (2018)
Target
- Nomad at Target: Scaling Microservices Across Public and Private Clouds (2018)
- Playing with Nomad from HashiCorp (2017)
Oscar Health
- Scalable CI at Oscar Health with Nomad and Docker (2018)
eBay
- HashiStack at eBay: A Fully Containerized Platform Based on Infrastructure as Code (2018)
Dutch National Police
- Going Cloud-Native at the Dutch National Police (2018)
N26
- Tech at N26 - The Bank in the Cloud (2018)
Elsevier
- Eslevier’s Container Framework with Nomad, Terraform, and Consul (2017)
Graymeta
- Backend Batch Processing At Scale with Nomad (2017)
NIH NCBI
- NCBI’s Legacy Migration to Hybrid Cloud with Consul & Nomad (2018)
imgix
- Cluster Schedulers & Why We Chose Nomad Over Kubernetes (2017)

...and more!

Contributing

See the contributing directory for more developer documentation.

Developing with Vagrant

A development environment is supplied via Vagrant to make getting started easier.

Install Vagrant
Install Virtualbox

Bring up the Vagrant project

$ git clone https://github.com/hashicorp/nomad.git
$ cd nomad
$ vagrant up

The virtual machine will launch, and a provisioning script will install the needed dependencies within the VM.

Developing without Vagrant

Install Go 1.15.5+ (Note: gcc-go is not supported)

Clone this repo

$ git clone https://github.com/hashicorp/nomad.git
$ cd nomad

Bootstrap your environment
```
$ make bootstrap
```
(Optionally) Set a higher ulimit, as Nomad creates many file handles during normal operations
```
$ [ "$(ulimit -n)" -lt 1024 ] && ulimit -n 1024
```
Verify you can run tests
```
$ make test
```

Running a development build

Compile a development binary (see the UI README to include the web UI in the binary)
```
$ make dev
# find the built binary at ./bin/nomad
```
Start the agent in dev mode
```
$ sudo bin/nomad agent -dev
```
(Optionally) Run Consul to enable service discovery and health checks
1. Download Consul
2. Start Consul in dev mode
```
$ consul agent -dev
```

Compiling Protobufs

If in the course of your development you change a Protobuf file (those ending in .proto), you'll need to recompile the protos.

Install Buf
Compile Protobufs
```
$ make proto
```

Building the Web UI

See the UI README for instructions.

Create a release binary

To create a release binary:

$ make prerelease
$ make release
$ ls ./pkg

This will generate all the static assets, compile Nomad for multiple platforms and place the resulting binaries into the ./pkg directory.

API Compatibility

Only the api/ and plugins/ packages are intended to be imported by other projects. The root Nomad module does not follow semver and is not intended to be imported directly by other projects.

Languages

Go 76.9%

MDX 11%

JavaScript 8.2%

Handlebars 1.7%

HCL 1.4%

Other 0.7%

README.md Unescape Escape

Nomad

Overview

Getting Started

Documentation & Guides

Resources

Who Uses Nomad

Contributing

Developing with Vagrant

Developing without Vagrant

Running a development build

Compiling Protobufs

Building the Web UI

Create a release binary

API Compatibility

README.md