Right now the Lighthouse is running on Digital Ocean on a $4/month droplet. If you'd like to be added please let us know and share your public ssh key.
The lighthouse is primarily responsible for keeping track of all nodes in the swarm.
Idea of the setup is that a user
- Click some button to get a client
- We share the client with them
- They run the client and it connects them to the swarm
- Lighthouse on Digital Ocean
- Nebula VPN for swarm communication
- Clients in shell scripts but soon should be docker containers
- A fault tolerant PyTorch job that is responsible for the actual training
- Share results in some public dashboard