QBD

QBD is a system to Query By Data.

Find all patients with mutations in a set of genes {X}.

Annotate the variants based on their frequency in a control population, likely impact on function, etc.

Return the list of patients and iPSC lines

(I imagine this would be a search box on the website, and expect it would be very valuable for people looking for iPSC lines -- it's related to your example I.3)
Run all the coding mutations through some or all of the following tools:
- SIFT
- PolyPhen
- GERP++
- Condel
- CADD
- fathmm
- MutationTaster
- MutationAssessor
- GESPA
- REVEL
(This list was stolen from a review: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433009/ )

Generate a table of the results suitable for further processing
a. Find all non-coding variants in each patient that overlap with that patient's ATAC-Seq peaks

b. Generate a consensus set of ATAC-Seq peaks and find the non-coding variants for all patients.
a. Use output from filters like those in cases 2 and 3 to generate a table of patients vs. variants.

b. Cluster patients by similarity in variants

c. Run dimensionality reduction algorithms on this table

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
Requirements.ipynb		Requirements.ipynb
TODO.md		TODO.md
Test.ipynb		Test.ipynb
again.sh		again.sh
connect.sh		connect.sh
test.sh		test.sh
test2.sh		test2.sh

Provide feedback