Wrong identity values in summary #506

EtienneBucher · 2020-12-01T12:42:55Z

I have the following instance of Sequencesever running:
SequenceServer 2.0.0.rc7 using BLASTN 2.10.1+
Stangely enough, perfect hits never reach 100% in the summary table of hits:

Query coverage (%) | Total score | E value | Identity (%)

| rice-Chr2 | 100 | 34779 | 0 | 79

However in the alignment I get the right values:
a. Score: 7479.86 (8294), E value: 0, Identity: 4147/4147 (100%), Gaps: 0/4147 (0%), Strand: + / -

I have the same issue with at least two different databases.
Any idea what could be wrong in my setup?

Cheers,
Etienne

yeban · 2020-12-01T13:28:02Z

Hi Etiene. The table shows average identity of all matches (alignments) to a database sequence. The value in table would be 100% if there is only one match to the database sequence with identity of 100%. Does that help?

EtienneBucher · 2020-12-01T21:03:29Z

Thank you for your quick reply!
Is that the expected behaviour? For me it is quite confusing to see these results when I blast against complete genomes. This may hide an excellent hit on a chromosome due to many other bad hits on the same chromosome.
Is there a way to have all the hits listed separately? Meaning that in the table I could have several Chr1 hits listed (e.g. a,b,c) to more easily navigate to the hits that may be of interest?

yeban · 2020-12-04T15:24:29Z

Is that the expected behaviour? For me it is quite confusing to see these results when I blast against complete genomes. This may hide an excellent hit on a chromosome due to many other bad hits on the same chromosome.

I think that's fair. Let me see how we can best fix that in the next week or so.

Is there a way to have all the hits listed separately? Meaning that in the table I could have several Chr1 hits listed (e.g. a,b,c) to more easily navigate to the hits that may be of interest?

No. Would this still be important to you if the table showed the identity of the best hit? Just trying to understand.

yeban · 2021-01-11T09:43:00Z

Sorry, I got wrapped up working on my thesis. I am going to need a 3-4 more weeks before I can resume work on sequenceserver. Thanks for being patient.

yannickwurm · 2021-05-24T12:35:36Z

Hi just to follow-up - I agree that we should revert to the NCBI default of showing the %id for the top HSP for that database sequence. (this is for example, because we wouldn't want the score of the best hit to be pulled down by a diverged homolog)
This way the info we show for %id is also consistent with the info we show for evalue and bitscore.

yeban closed this as completed in 30ae761 May 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong identity values in summary #506

Wrong identity values in summary #506

EtienneBucher commented Dec 1, 2020

yeban commented Dec 1, 2020

EtienneBucher commented Dec 1, 2020

yeban commented Dec 4, 2020

yeban commented Jan 11, 2021

yannickwurm commented May 24, 2021

Wrong identity values in summary #506

Wrong identity values in summary #506

Comments

EtienneBucher commented Dec 1, 2020

yeban commented Dec 1, 2020

EtienneBucher commented Dec 1, 2020

yeban commented Dec 4, 2020

yeban commented Jan 11, 2021

yannickwurm commented May 24, 2021