Bacterial genome annotation workflow (v1.0)

This workflow uses assembled bacterial genome fasta files (but can be any fasta file) and executes the following steps:

Genomic annotation
- Bakta to predict CDS and small proteins (sORF)
Integron identification
- IntegronFinder2 to identify CALIN elements, In0 elements, and complete integrons
Plasmid gene identification
- Plasmidfinder to identify and typing plasmid sequences
Inserted sequence (IS) detection
- ISEScan to detect IS elements
Aggregating outputs into a single JSON file
- ToolDistillator to extract and aggregate information from different tool outputs to JSON parsable files

Inputs

Genomic annotation:
- genome annotation in tabular, gff and several other formats
- annotation plot
- nucleotide and protein sequences identified
- summary of genomic identified elements
Integron identification:
- integron identification in tabular format and a summary
Plasmid gene identification:
- plasmid gene identified and associated blast hits
Inserted Element (IS) detection:
- IS element list in tabular format
- is hits in fasta format
- ORF hits in protein and nucleotide fasta format
- IS annotation gff format
Aggregating outputs:
- JSON file with information about the outputs of Bakta, IntegronFinder2, Plasmidfinder, ISEScan

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
.dockstore.yml		.dockstore.yml
CHANGELOG.md		CHANGELOG.md
README.md		README.md
bacterial_genome_annotation-tests.yml		bacterial_genome_annotation-tests.yml
bacterial_genome_annotation.ga		bacterial_genome_annotation.ga
ro-crate-metadata.json		ro-crate-metadata.json