Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Wrong identity values in summary #506

Closed
EtienneBucher opened this issue Dec 1, 2020 · 5 comments
Closed

Wrong identity values in summary #506

EtienneBucher opened this issue Dec 1, 2020 · 5 comments

Comments

@EtienneBucher
Copy link

I have the following instance of Sequencesever running:
SequenceServer 2.0.0.rc7 using BLASTN 2.10.1+
Stangely enough, perfect hits never reach 100% in the summary table of hits:

Query coverage (%) | Total score | E value | Identity (%)

  1. | rice-Chr2 | 100 | 34779 | 0 | 79

However in the alignment I get the right values:
a. Score: 7479.86 (8294), E value: 0, Identity: 4147/4147 (100%), Gaps: 0/4147 (0%), Strand: + / -

I have the same issue with at least two different databases.
Any idea what could be wrong in my setup?

Cheers,
Etienne

@yeban
Copy link
Collaborator

yeban commented Dec 1, 2020

Hi Etiene. The table shows average identity of all matches (alignments) to a database sequence. The value in table would be 100% if there is only one match to the database sequence with identity of 100%. Does that help?

@EtienneBucher
Copy link
Author

Thank you for your quick reply!
Is that the expected behaviour? For me it is quite confusing to see these results when I blast against complete genomes. This may hide an excellent hit on a chromosome due to many other bad hits on the same chromosome.
Is there a way to have all the hits listed separately? Meaning that in the table I could have several Chr1 hits listed (e.g. a,b,c) to more easily navigate to the hits that may be of interest?

@yeban
Copy link
Collaborator

yeban commented Dec 4, 2020

Is that the expected behaviour? For me it is quite confusing to see these results when I blast against complete genomes. This may hide an excellent hit on a chromosome due to many other bad hits on the same chromosome.

I think that's fair. Let me see how we can best fix that in the next week or so.

Is there a way to have all the hits listed separately? Meaning that in the table I could have several Chr1 hits listed (e.g. a,b,c) to more easily navigate to the hits that may be of interest?

No. Would this still be important to you if the table showed the identity of the best hit? Just trying to understand.

@yeban
Copy link
Collaborator

yeban commented Jan 11, 2021

Sorry, I got wrapped up working on my thesis. I am going to need a 3-4 more weeks before I can resume work on sequenceserver. Thanks for being patient.

@yannickwurm
Copy link
Member

Hi just to follow-up - I agree that we should revert to the NCBI default of showing the %id for the top HSP for that database sequence. (this is for example, because we wouldn't want the score of the best hit to be pulled down by a diverged homolog)
This way the info we show for %id is also consistent with the info we show for evalue and bitscore.

@yeban yeban closed this as completed in 30ae761 May 26, 2021
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants