-
Notifications
You must be signed in to change notification settings - Fork 8
akutils format_database
Usage (order is important!!):
akutils format_database <input_fasta> <input_taxonomy> <input_primers> <read_length> <output_directory> <input_phylogeny>
This script will take an input reference fasta, an associated taxonomy file already formatted for use with QIIME, and a pair of primers (degenerate OK) and produces four pairs of fasta and taxonomy files as outputs:
- read1 (for use with read 1 only)
- read2 (for use with read 2 only)
- amplicon (for use with joined data)
- composite (combines the other three outputs for maximum completeness)
Tree file:
If a phylogeny is supplied (tree file), an associated tree will be filtered for each pair of taxonomy and fasta files.
Primers file:
<input_primers> must be formatted for Primer Prospector and contain no more than two primers. Example:
515f GTGCCAGCMGCCGCGGTAA
806r GGACTACHVGGGTWTCTAAT
Notes: Primer suffixes (f and r) are essential. Primer sequences are supplied 5->3 prime. Database fasta must be correctly oriented with respect to primer direction. Input files can have only ONE "." character immediately preceeding the file extension or this workflow will fail.
Example:
akutils format_database greengenes_97repset.fasta greengenes_97tax.txt 515-806.txt 150 16S_v4_db
The example will take representative sequences and associated taxonomy file for greengenes97 and use the primer file 515-806.txt to produce a set of database files for use with v4 amplicons generated with 515f and 806r in paired end 2x150 mode. Output will be placed in a directory called 16S_v4_db.
Citations:
Please cite Primer Prospector if you find this utility useful.
PrimerProspector: de novo design and taxonomic analysis of PCR primers (2011). William A. Walters, J. Gregory Caporaso, Christian L. Lauber, Donna Berg-Lyons, Noah Fierer, and Rob Knight. Bioinformatics 27(8):1159-1161.
Script wiki pages:
akutils
akutils align_and_tree
akutils check
akutils check_result
akutils configure
akutils core diversity
akutils format_database
akutils join_paired_reads
akutils phix_filtering
akutils pick_otus
akutils primer_file
akutils primer_list
akutils print_config
akutils strip_primers
akutils test
akutils test_result
akutils update
ancomR.sh
biom-summarize_folder.sh
biomtotxt.sh
concatenate_fastqs.sh
convert_table_for_ancomR.sh
fastq_data.sh
fasta_length_histogram.sh
fastq_length_histogram.sh
filter_fasta_by_length.sh
filter_fastq_by_length.sh
filter_observations_by_sample.sh
indicator_species.sh
ITSx_parallel.sh
mapcats.sh
preprocess_otus_for_ghost-tree.sh
slurm_builder.sh
synthetic_index.sh
two-way_permanova.sh
txttobiom.sh
unwrap_fasta.sh
Tutorial pages:
akutils tutorial
ITS analysis
slurm usage
Running akutils on monsoon
Example: 2x300, 2 loci
akutils wiki home:
akutils wiki home
akutils home page:
akutils home page
akutils github page:
akutils github page