This workflow uses assembled bacterial genome fasta files (but can be any fasta file) and executes the following steps:
- Genomic annotation
- Bakta to predict CDS and small proteins (sORF)
- Integron identification
- IntegronFinder2 to identify CALIN elements, In0 elements, and complete integrons
- Plasmid gene identification
- Plasmidfinder to identify and typing plasmid sequences
- Inserted sequence (IS) detection
- ISEScan to detect IS elements
- Aggregating outputs into a single JSON file
- ToolDistillator to extract and aggregate information from different tool outputs to JSON parsable files
- Assembled bacterial genome in fasta format.
- Genomic annotation:
- genome annotation in tabular, gff and several other formats
- annotation plot
- nucleotide and protein sequences identified
- summary of genomic identified elements
- Integron identification:
- integron identification in tabular format and a summary
- Plasmid gene identification:
- plasmid gene identified and associated blast hits
- Inserted Element (IS) detection:
- IS element list in tabular format
- is hits in fasta format
- ORF hits in protein and nucleotide fasta format
- IS annotation gff format
- Aggregating outputs:
- JSON file with information about the outputs of Bakta, IntegronFinder2, Plasmidfinder, ISEScan