epitopeprediction using mhcflurry vs running directly mhcflurry #263

gianfilippo · 2025-01-28T19:19:07Z

Hi,

I ran your pipeline using MHCflurry and MHCflurry directly, on the same data
In the case of MHCflurry I used mhcflurry-predict and as input I had alleles from hlatyping nextflow pipeline (RNA) and a FASTA file from PrecisionProDB (using my VCFs as input)

How do you generate peptides from variants (or proteins) ?

I am not very familiar with this kind of analysis, and I am trying to understand all the steps involved.

Thanks

gianfilippo · 2025-01-28T23:26:22Z

UPDATE:
I reran the NEXTFLOW pipeline, this time using the .pergeno.protein_changed.fa files from PrecisionProDB as input, instead of the VCF files. I have an average of 200 sequences in the fasta files.
This way the input data is the same between the NEXTFLOW pipeline and MHCflurry, and the tool specific threshold are also the same, 500.
I see about one order of magnitude more prediction with the epitopeprediction than with MHCflurry in 6 of my 8 samples.
For two of the samples there are no predictions.
From what I can see, starting with protein fasta files as input results in an extra folder, generated_peptides, with peptide predictions and the number of predicted peptide is very large, of course, consistently with the final predictions.

I am a bit puzzled at this point, since if I start with VCFs, I end up with 2000-3000 predictions per sample, while if I start with about 200 changed proteins, I end up with about 40000-50000 predictions.

Could you please help me understand what I am clearly missing ?

Thanks

jonasscheid · 2025-02-03T07:54:33Z

Hi!
Apologies for the late response. I'm not quite familiar with PrecisionProDB, but in epitopeprediction pipeline you can also write out in silico mutated proteins based on vcf files by adding the flag --fasta_output. I think that is the easiest way to compare vs the output of PrecisionProDB. Let me know if I can assist further

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epitopeprediction using mhcflurry vs running directly mhcflurry #263

epitopeprediction using mhcflurry vs running directly mhcflurry #263

gianfilippo commented Jan 28, 2025

gianfilippo commented Jan 28, 2025

jonasscheid commented Feb 3, 2025

epitopeprediction using mhcflurry vs running directly mhcflurry #263

epitopeprediction using mhcflurry vs running directly mhcflurry #263

Comments

gianfilippo commented Jan 28, 2025

gianfilippo commented Jan 28, 2025

jonasscheid commented Feb 3, 2025