-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Turn your computer into lxplus with a single command #20
Comments
Hi, I've been very interested in getting a lxplus like environment on top of cern/slc6-base. My start was this: https://github.com/lukasheinrich/hepsw-docker/tree/master/lxpluslike image at: https://hub.docker.com/r/lukasheinrich/lxplus-like/ basically I tried to snapshot what yum has installed on lxplus and tried to install the same. The same steps could be either done on top of hepsw/cvmfs-lhcb (I think @sbinet retrired the binet/* images) or on top of plain scl6-base with the option to mount cvmfs at runtime via
So I would love to see this explored more. Maybe some people have good contacts to LXPLUS people? (@pherterich ? ) |
Yeah, maybe we can come up with some default images people might want to use and write some instructions on how they can be used to do interesting things.
Wouldn't this require you to set up |
yes this will require a cvmfs setup on the host. But the advantage is that you don't need to run the container in privileged mode. Also, the docker image itself doesn't need to know anything about cvmfs which keeps the images more lightweight. |
btw this we use exactly this /cvmfs mounting + custom lightweight docker images for RECAST. On top of the /cvmfs mount people can also request to have a /afs mounted (I'm working on getting /eos to work). Finally some workflow steps need GRID authentication, so for those we mount a path |
I have been fiddling with |
I think it's a good idea to start from |
👍 on starting with @lukasheinrich: Do you mean this RECAST? http://recast.perimeterinstitute.ca/ |
I just found https://github.com/hepsw/docks |
@ibab yes that's the right website but it's the old version and we (@cranmer, me, and a couple of other people) have been making good progress on docker-based workflows for analysis (i.e. you can have separate docker images for each workflow step). If you have LHCb Analysis Code, it would be great to try this out! @adavidzh I also have a similar repository as @sbinet which I would like to merge into the hepsw organization that @sbinet setup, but I haven't gotten to it yet. Check this out: |
@lukasheinrich: Sounds good! Some people at LHCb are experimenting with automated analysis workflows (like producing all plots starting from some initial dataset). |
that would be great. Can you put me in touch with them? |
Do you have access to the lhcb-collaborative-working@cern.ch egroup? |
no it seems closed to LHCb members only. By the way, I have been in touch with @anaderi on including LHCb into RECAST. Maybe we can work form there? I would also be happy to present a short overview of what we already have in our infrastructure to the group (in case you have regular meetings) |
@lukasheinrich maybe you can discuss on one of the next collaborative-working meetings? |
that would be great. I'll include you in an email thread that we have had for a while, maybe we can discuss this further there. |
Looks to me that LHCb is having a lot of good fun. |
hey @adavidzh, somewhat similar to ATLAS, but we're getting there. We are in touch with a couple of CMS people as well (Ken Bloom, Mike Hildreth..) so maybe we can get something going. It seems like I will be giving a talk next Monday for the LHCb group, maybe you can get one-time access as well? PS: i'll include you in the same email thread as the others |
Getting back to docker for analysis: we should have a lesson on this! Showing people how easy it is to get going and have a |
Well, in CMS whatever is in cvmfs can also be installed in standalone (I got a working |
for
so, the basic one with just this is detailed in a CHEP paper: https://inspirehep.net/record/1413180?ln=en just to give an idea on the sizes of these images:
and for the
( AFAIK, there is no tool to extract the meaningful set of files under a |
So my experience in ATLAS is that we can get away with a reasonable image sizes at least for our analysis software by installing everything within Docker without /cvmfs access. This is easier built purely on top of ROOT or using a bare bones Gaudi/Athena based release. For running actual reconstruction type jobs, I only tried by mounting /cvmfs at runtime. That works well, but @betatim is right that it creates mutability. One reference I found for cvmfs versioning is here: see slide 7. It seems like CVMFS might have something like a On a Mac, mounting /cvmfs is a bit trickier. The docker-machine runs with user-permissions, while the /cvmfs mountpoint is root-owned, which does not work well together. To get around, I create a user-owned mount ponit in my home directory, manually mount cvmfs on that and then bind Cheers, |
For CMS, by chaining the RPM installation in a single Is there a better (leaner, closer to |
@adavidzh feel free to try this is on top of slc6-base and has most yum packages instaleld that LXPLUS has. Though this is certainly not lean :-/ |
@adavidzh indeed, the containers I was playing around with not optimal as their size can get up to 19 GB. However, if you need CMSSW on a computer that does not have network all the time, That said, |
@lukasheinrich: starting from 11 GB is not promising ;) (the CMS software comes in at ~6 GB). |
isn't http://docker.cern.ch/howtopr supposed to tackle this size-on-the-hub issue? |
I did not know CERN had their own registry. That it good news. |
I guess there is a somewhat irreduciple compromise that we have to make. if you want to have something LXPLUS like, meaining to be able to get stuff from cmvfs and expect it to work, you somehow depend on having system libraries installed that these cvmfs software depends on (think: getting a ROOT release, which depends on libX11 etc being installed, etc). Of course it would be a bit nicer if we could get meaningful images that only have a subset of those installed to keep the image size down. But then this will only work with a subset of cvmfs software that this image was tailored to. |
@kreczko yes, they are working on this. Also they are preparing a Google Container Enging-like service that you will able to control with Docker Swarm and Kubernetes. But this is still in a early development phase. |
@sbinet docker.cern.ch is really cool. Thanks! |
@adavidzh interesting, so CMS maintains its own list of low-level system libraries it needs (stuff I had to deal with e.g. include various openssl libs etc)? |
Hi everyone, I just gave a short presentation to the ATLAS analysis software group, and got some useful input / suggestions. One thing that has been traditionally hard to do from e.g. a Mac or Windows was to get access to the Grid. One solution is of course to have cvmfs installed, bind this to docker (analogously to the discussion above), and setup the Grid middleware from there. But to have a nicer encapsulation, I managed to get a minimal image on top of The example for now is ATLAS (panda is the ATLAS job submission interface), but the first couple of steps in the Dockerfile should be generalizable to other experiments https://github.com/lukasheinrich/asg-docker/blob/master/preasg-base/Dockerfile with that I can nicely submit Grid jobs from a Mac (just bind-mount a directory that has your certificate and key in it) Maybe people from the other expts could try getting their Grid layers to work? Cheers, |
Sooooo, what's the situation with this lesson? |
Do you already have someone for May 14th? |
Nothing scheduled. Also, I think the Friday is 13 May. :) |
Yes, Friday the 13th 😨 |
Sooooo, shall we schedule this? :D |
Let's wait a bit to see if someone wants to help. |
Ok. :) |
hi @ibab, yes I would definitely be interested in helping out. should we try to skype or something in the next week or so? Cheers, |
Yes, next week is good! |
Hey @ibab, @lukasheinrich: Just confirming that we're on for this Friday. Please let me know! Thanks. :) |
Date: TBD
Time: TBD
Location: TBD
Vidyo for remote participants (just click on the link to proceed): CERN_OpenScience
Session format: Work-Along
Lesson guide: @ibab, @lukasheinrich
Lesson materials: Coming…
Slides: Coming…
If you haven't done so yet, introduce yourself (#26) and list your project(s) (#24).
Original post:
Using docker, we can easily get access to tools available on cvmfs by running:
(Example for lhcb)
This allows you to easily use and work on your collaboration's software on your laptop, workstation or server.
This could also allow you to make workflows that depend on custom physics software reproducible.
(Just include a Dockerfile or script that calls docker in your repo)
Would someone be interested in learning more about this in one of the study groups?
Edit: Changed
sbinet
→hepsw
The text was updated successfully, but these errors were encountered: