Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Cidder secondary clustering #5

Closed
jsgounot opened this issue Jul 8, 2024 · 3 comments
Closed

Cidder secondary clustering #5

jsgounot opened this issue Jul 8, 2024 · 3 comments

Comments

@jsgounot
Copy link

jsgounot commented Jul 8, 2024

Hi,

thanks for your tools. I was wondering if you could implement a second clustering approach for Cidder as well, similar to the --determine-clusters option on Skder.

Thanks!
JS

@raufs
Copy link
Owner

raufs commented Jul 8, 2024

Thank you for the suggestion! Yes, of course, should be straightforward, will work on it for the next release.

Rauf

@raufs
Copy link
Owner

raufs commented Jul 12, 2024

@jsgounot Sorry for the delay, NCBI's GenBank database has been down this week and so was waiting on it to be back up to further test, but seems to work well on the test dataset and have added both a secondary clustering to find the best matching reference genome(s) for a focal genome based both on protein cluster containment (-n) or alternatively via skani ANI (-ns).

Also, I shifted the arguments a little with this latest release.

Version just released and should be on bioconda later today.

@jsgounot
Copy link
Author

Hey @raufs, thanks for the update! I'll try this out soon.

@raufs raufs closed this as completed Jul 13, 2024
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants