snakepit_puppet

puppet code for managing Mozilla's Snakepit cluster

TODOs

puppet
- future: worker: set environment for proxy
- future: manage users on hosts
- future: head: proxy installation & configuration
package configuration
- test updating instances
  - e.g. start with 11-4 and upgrade to 11-5
test infrastructure
- circleci: get apt caching working again
- circleci: cache bolt modules

notes

nfs mounts

Snakepit (the scheduler, https://github.com/mozilla/snakepit) only gives jobs write access to their jobs directory, the user's directory, and any group directories the user is a member of. A shared read-only space is also available.

/data
  /ro
    /shared (contents of mlchead:snakepit/shared/)
  /rw
    /group-GROUP (any groups you're in, contents of mlchead:/snakepit/groups/GROUP)
    /home (user dir, mlchead:/snakepit/home/USER)
    /pit (job dir, mlchead:/snakepit/pits/ID)

Slurm doesn't do any access control. If the slurm unix user can write to a directory, every job will be able to write to it.

/data
  /ro (contents of mlchead:/snakepit/shared)
  /rw (contents of mlchead:/data/rw)

provisioners

I had hoped to use bolt to do the masterless convergence, but it doesn't provide debug output like puppet apply and --noop usage isn't obvious.

testing

test-kitchen is the best place to start. vagrant is easier for rapid iteration and for testing the provisioner script.

vagrant testing

vagrant testing is lower level than test-kitchen and allows more realistic testing of the provisioning and convergence process.

vagrant up worker
# ssh into the 'head' or 'worker' instance
vagrant ssh worker

# once in the vagrant node
cd /vagrant

# puppet_apply convergence
#
# set role
echo slurm_worker > /etc/puppet_role

# uses main branch
sudo /vagrant/provisioner/converge_worker.sh
# override for testing
sudo PUPPET_REPO=https://github.com/aerickson/snakepit_puppet.git PUPPET_BRANCH=work_1 /vagrant/provisioner/converge_worker.sh

# create fake vault.yaml file
touch /etc/puppetlabs/environments/production/data/secrets/vault.yaml

# rerun converge script for a second time now that vault.yaml is in place

# for head, the process is similar
echo slurm_head > /etc/puppet_role
sudo /vagrant/provisioner/converge_head.sh

# bolt convergence (alternative method)
#
# run one of the following
# to converge a worker node
sudo bolt plan run roles::worker_converge hosts=localhost --verbose --log-level debug
# to converge the host as a head node
sudo bolt plan run roles::head_converge hosts=localhost --verbose --log-level debug

test-kitchen testing

test-kitchen automates the testing of roles and integrates inspec tests for verification.

test-kitchen is used for testing in CI, so the test-kitchen worker configuration does a partial converge for speed (see modules/roles/manifests/slurm_worker_post.pp).

# initial setup
brew install puppet-bolt  # or equivalent
bundle install
bolt module install  # install 3rd party modules to .modules

# converge head and worker roles
bundle exec kitchen converge
# you can also specify a specific kitchen target to converge/verify/etc
# bundle exec kitchen converge worker

# run integration tests
bundle exec kitchen verify

nvidia/cuda: keeping bare metal and containers in sync

Part of the challenge of using NVIDIA cards and CUDA in a container is that the versions of the software on the bare metal and the container need to be in sync (https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/concepts.html#background).

NVIDIA has a solution (NVIDIA Container Toolkit) that allows the versions to not match exactly, but it requires newer NVIDIA GPUs (Kepler and newer, https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#platform-requirements). Snakepit's GPUs are from a previous genenration and we can't use this solution.

We solve this in our slurm environmentt by using singularity's GPU mode (https://sylabs.io/guides/3.5/user-guide/gpu.html).

nvidia/cuda: keeping bare metal hosts in sync

NVIDIA's recommended intallatio process is to install the cuda or cuda-11-5 metapackages (https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html). These metapackages only loosely specify the version to use and float to the latest published packages. This causes issues when trying to keep hosts in sync (if they're installed or created at different points in time).

$ apt-cache show cuda-11-5
...
Depends: cuda-runtime-11-5 (>= 11.5.0), cuda-toolkit-11-5 (>= 11.5.0), cuda-demo-suite-11-5 (>= 11.5.50)
...

To remedy this problem, we find what packages the metapackage installs. Once the packages have been identified, we craft an install script that installs all of the constituent packages (those including cuda and nvidia in the name, the full dependency list is very large). The install script is then used to install the required packages on the bare metal hosts and any containers.

Full BOMs (dpkg --list output) of the before and after state are captured also.

creating and testing package configurations

0. run a proxy (optional)

These steps install lots of packages. Caching will speed things up dramatically.

Any caching web proxy on port 8123 will work. I like polipo (in homebrew).

#!/usr/bin/env bash

set -e

mkdir -p /tmp/cache/polipo
polipo -c polipo.conf

# polipo.conf contents
diskCacheRoot=/tmp/cache/polipo
logLevel=4

1. create a new installation script

rake pkg_config_create

# inspect the output in
#   modules/moz_slurm/create_package_configuration/boms/ and paste it into
#   modules/moz_slurm/testing_package_configs/install_packages.sh

2. test a new installation script

rake pkg_config_test

# things should complete without errors and
#   `nvidia-smi` should be present (but won't work yet).

misc

puppet lookup

puppet lookup  --hiera_config ./hiera.yaml --explain slurm::cluster_name

links

puppet module used for slurm
- https://github.com/treydock/puppet-slurm
NVIDIA's best practices for SLURM deployment
- https://github.com/NVIDIA/deepops/tree/master/docs/slurm-cluster

how to generate new munge keys

in the test-kitchen head node, run /usr/sbin/create-munge-key and overwrite the existing key and then base64 it.

in a python interpreter:

import base64
with open("/etc/munge/munge.key", "rb") as image_file:
    encoded_string = base64.b64encode(image_file.read()); print(encoded_string)

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.circleci		.circleci
data		data
manifests		manifests
modules		modules
plans		plans
provisioner		provisioner
tasks		tasks
test		test
.gitignore		.gitignore
.kitchen.yml		.kitchen.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE.md		LICENSE.md
Puppetfile		Puppetfile
README.md		README.md
Rakefile		Rakefile
Vagrantfile		Vagrantfile
bolt-project.yaml		bolt-project.yaml
ci-run-filter.py		ci-run-filter.py
ci_bolt_install.sh		ci_bolt_install.sh
hiera.yaml		hiera.yaml
in.txt		in.txt
inventory.yaml		inventory.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

snakepit_puppet

TODOs

notes

nfs mounts

provisioners

testing

vagrant testing

test-kitchen testing

nvidia/cuda: keeping bare metal and containers in sync

nvidia/cuda: keeping bare metal hosts in sync

creating and testing package configurations

0. run a proxy (optional)

1. create a new installation script

2. test a new installation script

misc

puppet lookup

links

how to generate new munge keys

About

Languages

License

mozilla-platform-ops/snakepit_puppet

Folders and files

Latest commit

History

Repository files navigation

snakepit_puppet

TODOs

notes

nfs mounts

provisioners

testing

vagrant testing

test-kitchen testing

nvidia/cuda: keeping bare metal and containers in sync

nvidia/cuda: keeping bare metal hosts in sync

creating and testing package configurations

0. run a proxy (optional)

1. create a new installation script

2. test a new installation script

misc

puppet lookup

links

how to generate new munge keys

About

Resources

License

Stars

Watchers

Forks

Languages