dockstore-cgpwxs
provides a complete multi threaded WXS analysis for SNV and INDEL variants with
associated annotation of VCF files. This has been packaged specifically for use with the
Dockstore.org framework.
Master | Develop |
---|---|
- Supported input formats
- Usable Cores
- Other uses
- Verifying your deployment
- Development environment
- Release process
- LICENCE
Required input files are
Note: cram and csi/crai will not work via Dockstore/CWL yet but direct exec of scripts/ds-cgpwxs.pl
will allow use of these.
- Tumour BAM/CRAM file (plus bai/csi/crai + bas)
- Normal BAM/CRAM file (plus bai/csi/crai + bas)
- Core reference archive (e.g. core_ref_GRCh37d5.tar.gz)
- WXS reference archive (e.g. SNV_INDEL_ref_GRCh37d5.tar.gz)
- VAGrENT (annotation) reference archive (e.g. VAGrENT_ref_GRCh37d5_ensembl_75.tar.gz)
Inputs 1&2 are expected to have been mapped using dockstore-cgpmap
The data linked in the 'examples' area is a tiny section of chr21 from the cell line COLO-829. This will generate a small number of INDEL and SNV variants. All the SNV variants do not pass filtering (several only fail as not in a gene footprint, 'F009').
Please check the Wiki then raise an issue if you require additional information on how to generate your own reference files. Much of this information is available on the individual algorithm pages.
When running outside of a docker container you can set the number of CPUs via:
export CPU=N
-cores|-c
option ofds-cgpwxs.pl
If not set detects available cores on system.
All of the tools installed as part of dockstore-cgpmap and the above packages are available for direct use.
export CGPWXS_VER=X.X.X
docker pull quay.io/wtsicgp/dockstore-cgpwxs:$CGPWXS_VER
docker --rm -ti [--volume ...] quay.io/wtsicgp/dockstore-cgpwxs:$CGPWXS_VER bash
The resulting docker container has been tested with Singularity. The command to exec is:
ds-cgpwxs.pl -h
Expected use would be along the lines of:
export CGPWXS_VER=X.X.X
singularity pull docker://quay.io/wtsicgp/dockstore-cgpwxs:$CGPWXS_VER
singularity exec\
--workdir /.../workspace \
--home /.../workspace:/home \
--bind /.../ref/human:/var/spool/ref:ro \
--bind /.../example_data/cgpwxs:/var/spool/data:ro \
dockstore-cgpwxs-${CGPWXS_VER}.simg \
ds-cgpwxs.pl \
-reference /var/spool/ref/core_ref_GRCh37d5.tar.gz \
-annot /var/spool/ref/VAGrENT_ref_GRCh37d5_ensembl_75.tar.gz \
-snv_indel /var/spool/ref/SNV_INDEL_ref_GRCh37d5-fragment.tar.gz \
-tumour /var/spool/data/COLO-829_ex.cram \
-tidx /var/spool/data/COLO-829_ex.cram.crai \
-normal /var/spool/data/COLO-829-BL_ex.cram \
-nidx /var/spool/data/COLO-829-BL_ex.cram.crai \
-exclude NC_007605,hs37d5,GL%
For a system automatically attaching all local mount points (not default singularity behaviour)
you need not specify any exec
params (workdir, home, bind) but you should specify the -outdir
option for ds-cgpwxs.pl
to prevent data being written to your home directory.
By default results are written to the home directory of the container so ensure you bind
a large volume and set the -home
variable. As indicated above the location can be overridden
via the options of ds-cgpwxs.pl
The examples/
tree contains test json files populated with data that can be used to verify the
tool. More details can be found here.
This project uses git pre-commit hooks. Please enable them to prevent inappropriate large files being included. Any pull request found not to have adhered to this will be rejected and the branch will need to be manually cleaned to keep the repo size down.
Activate the hooks with
git config core.hooksPath git-hooks
This project is maintained using HubFlow.
- Make appropriate changes
- Build image locally
- Run all example inputs and verify any changes are acceptable
- Update
expected/
tree with output of last step - Bump version in
Dockerfile
andDockstore.cwl
- Push changes
- Check state on Travis
- Generate the release (add notes to GitHub)
- Confirm that image has been built on quay.io
- Update the [dockstore][dockstore-cgpwgs] entry, see their docs.
Copyright (c) 2016-2018 Genome Research Ltd.
Author: CASM/Cancer IT <cgphelp@sanger.ac.uk>
This file is part of dockstore-cgpwxs.
dockstore-cgpwxs is free software: you can redistribute it and/or modify it under
the terms of the GNU Affero General Public License as published by the Free
Software Foundation; either version 3 of the License, or (at your option) any
later version.
This program is distributed in the hope that it will be useful, but WITHOUT
ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more
details.
You should have received a copy of the GNU Affero General Public License
along with this program. If not, see <http://www.gnu.org/licenses/>.
1. The usage of a range of years within a copyright statement contained within
this distribution should be interpreted as being equivalent to a list of years
including the first and last year specified and all consecutive years between
them. For example, a copyright statement that reads ‘Copyright (c) 2005, 2007-
2009, 2011-2012’ should be interpreted as being identical to a statement that
reads ‘Copyright (c) 2005, 2007, 2008, 2009, 2011, 2012’ and a copyright
statement that reads ‘Copyright (c) 2005-2012’ should be interpreted as being
identical to a statement that reads ‘Copyright (c) 2005, 2006, 2007, 2008,
2009, 2010, 2011, 2012’."