TheiaProk main (2).png

Page Contents

Workflows available

Assembly_Fetch

Augur

BaseSpace_Fetch

Cauris_CladeTyper

Concatenate_Column_Content

Core_Gene_SNP

CZGenEpi_Prep

Find_Shared_Variants

Freyja Wastewater Analysis Series

GAMBIT_Query

Kraken2

kSNP3

Lyve_SET

MashTree_FASTA

Mercury_Prep_N_Batch

Pangolin Update

RASUSA

Rename_FASTQ

Snippy_Streamline

Snippy_Tree

Snippy_Variants

SRA_Fetch

TBProfiler_tNGS

Terra_2_GISAID

Terra_2_NCBI

TheiaCoV Workflow Series

TheiaEuk Workflow Series

TheiaMeta Workflow Series

TheiaProk Workflow Series

TheiaValidate

Transfer_Column_Content

Usher_PHB

VADR_Update

Zip_Column_Content

Guide to Phylogenetics

Overview

The TheiaProk workflows are for the assembly, quality assessment, and characterization of bacterial genomes. There are currently four TheiaProk workflows designed to accommodate different kinds of input data:

  1. Illumina paired-end sequencing (TheiaProk_Illumina_PE)
  2. Illumina single-end sequencing (TheiaProk_Illumina_SE)
  3. ONT sequencing (TheiaProk_ONT)
  4. Genome assemblies (TheiaProk_FASTA)

All input reads are processed through “core tasks” in the TheiaProk Illumina and ONT workflows. These undertake read trimming and assembly appropriate to the input data type. TheiaProk workflows subsequently launch default genome characterization modules for quality assessment, species identification, antimicrobial resistance gene detection, sequence typing, and more. For some taxa identified, “taxa-specific sub-workflows” will be automatically activated, undertaking additional taxa-specific characterization steps. When setting up each workflow, users may choose to use “optional tasks” as additions or alternatives to tasks run in the workflow by default.

Inputs

Core tasks (performed for all taxa)

Taxon-specific tasks

Outputs

✉️ [email protected] | X (formerly Twitter) | LinkedIn | 🌐 Website