Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

OTU sequence #17

Open
ZarulHanifah opened this issue Sep 5, 2017 · 3 comments
Open

OTU sequence #17

ZarulHanifah opened this issue Sep 5, 2017 · 3 comments

Comments

@ZarulHanifah
Copy link

Hey NINJA-OPS developers,

Thank you for this wonderful software. It is very quick and simple.

May I ask, does NINJA actually produce an OTU representative fasta file? Because I need to construct a phylogenetic tree based on the OTU sequences (for phylogenetic calculation stuff in QIIME)

@GabeAl
Copy link
Owner

GabeAl commented Sep 5, 2017 via email

@ZarulHanifah
Copy link
Author

Thank you very much for the prompt reply!

Now I see; so I am familiar with QIIME, but only for the downstream analysis part (For my previous work, I used UParse for OTU clustering). I only noticed just now that the OTU ID was named from the aligned GreenGenes reference sequences as well.

From print_qiime_config.py, all the default files have been listed out. But I didnt see any reference tree. Should I extract the reference OTUs found in my samples and make a tree, or is there a reference tree that I should be using? If there is a GreenGene reference tree, can you share it here? That would be awesome.

Also, I side-question, if I am planning to do open-reference OTU picking, I assume that I just have to run NINJA, then extract the sequences from ninja_fail.txt and do de-novo OTU picking from there?

Thank you so much :)

@GabeAl
Copy link
Owner

GabeAl commented Sep 7, 2017

Sure, here's the tree! 👍

97_otus.zip

Yes, that's one reasonable way to do closed ref. If there are too many failures, I'd lower the id to 95% or so (the -s or --similarity option in NINJA-OPS) and that'll hopefully pare down the failures to a small enough number for reasonable de novo on them. Be sure to merge the OTU tables using QIIME's command (one mistake I've seen is to "cat" the files together directly).

A more complicated way would be to start the open_ref pipeline with some other aligner, but replace the first stage with NINJA-OPS. The open-ref pipeline in QIIME does some funky things with resampling and re-aligning to the random subsample, so it's a little different than pure de novo. I think if you have the computational resources, pure de novo on the failures might be as good or better in some cases... my conjecture.

Cheerio!

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants