Skip to content

a simple python script that takes in a FASTA formatted MSA and, given a set of 'reference' sequences, deletes any sequences that contribute to insertions given a fractional abundance

Notifications You must be signed in to change notification settings

Zheludev/MSAcleaner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

MSAcleaner

a simple python script that takes in a FASTA formatted MSA and, given a set of 'reference' sequences, deletes any sequences that contribute to insertions given a fractional abundance

the output is then an un-aligned FASTA file ready for re-alignment

example

python MSAcleaner.py -i input.msa -ref reference.seqIDs -fxn 0.01 -o output.aa

to do:

I think the fractional abundance counter needs checking, -fxn 0 seems to work, and low -fxn values are ok, but something goes wrong when there are more sequences? I think its to do with rounding errors

About

a simple python script that takes in a FASTA formatted MSA and, given a set of 'reference' sequences, deletes any sequences that contribute to insertions given a fractional abundance

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages