Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[WIP] explore all overlapping genomes, not just minimum metagenome cover #124

Open
wants to merge 21 commits into
base: latest
Choose a base branch
from

Conversation

ctb
Copy link
Member

@ctb ctb commented Dec 21, 2021

This PR explores the prefetch results (containing all overlapping genomes), as a way of beginning to evaluate strain specificity of the minimum metagenome cover.

Specifically,

  • adds a target map_prefetch that maps the sample reads to all overlapping genomes, not just the minimum set;
  • adds variant calling and SNP estimation in order to look at variation rates (incl ANI) between the metagenome and its various reference genomes.

relevant: #18

Next things to do -


variant calling on gather results

The SNP stuff is already pretty illuminating -- as expected, most but not all of the mock community members have very few SNPs, while the real communities ...do.

podar mock

snps-SRR606249

zymo mock

snps-SRR12324253

human gut real

snps-p8808mo11

oil well real

snps-SRR1976948

@ctb ctb changed the title [WIP] explore prefetch results [WIP] explore all overlapping genomes, not just minimum metagenome cover Jan 1, 2022
@ctb ctb changed the base branch from latest to add/vcf February 15, 2022 15:02
Base automatically changed from add/vcf to latest February 16, 2022 12:26
ctb added a commit that referenced this pull request Feb 16, 2022
* steal VCF code from #124

* fix n_snps calculation
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant