The RabbitMQ GPU scheduler is a system that enables deep learning model experiments to be centrally managed through message brokers.
- Improve experimental productivity by managing multiple GPU servers through message brokers
- No message is lost even if the consumer or producer shuts down
- Prioritizing messages (model experiments)
- Send message processing logs and notification of errors to Discord Channel
apt-get update
apt-get install rabbitmq-server
service rabbitmq-server start
rabbitmq-plugins enable rabbitmq_management
rabbitmqctl add_user admin 'password'
rabbitmqctl set_user_tags admin administrator
rabbitmqctl set_permissions -p / admin ".*" ".*" ".*"
rabbitmqctl delete_user guest
Create an account to use for the broker connection and delete the default guest account.
pip install pika
pip install python-dotenv
python -B producer.py -c config.yaml -p 1
-c : files to send
-p : message priority (default=0)
pip install pika
pip install python-dotenv
# In addition, libraries required for worker or model train
python -B consumer.py