Page Contents
Workflows available
Assembly_Fetch
Augur
Concatenate_Column_Content
Core_Gene_SNP
Freyja Wastewater Analysis
Kraken2
kSNP3
Lyve_SET
MashTree_FASTA
Mercury_Prep_N_Batch
Pangolin Update
Rasusa
Snippy_Streamline
Snippy_Tree
Snippy_Variants
SRA_Fetch
Terra_2_GISAID
Terra_2_NCBI
TheiaCoV Genomic Characterization
TheiaEuk
TheiaMeta
TheiaProk Workflow Series
TheiaValidate
Usher_PHB
VADR_Update
Zip_Column_Content
Overview
Rasusa functions to randomly downsample the number of raw reads to a user-defined threshold.
π Use Cases:
- to reduce computing resources when samples end up with drastically more data than needed to perform analyses
- to perform limit of detection (LOD) studies to identify appropriate minimum coverage thresholds required to perform downstream analyses
π§ Desired size may be specified by inputting any one of the following:
- coverage (e.g. 20X)
- number of bases (e.g. β5mβ for 5 megabases)
- number of reads (e.g. 100000 total reads)
- fraction of reads (e.g. 0.5 samples half the reads)
Inputs
Input all String values (other than when selecting dropdown option Strings) in quotations, e.g. β5mβ
Required Inputs
Optional Inputs
Outputs
π‘ Donβt Forget! Remember to use the subsampled reads in downstream analyses with this.read1_subsampled
and this.read2_subsampled
inputs.
β
Verify reads were successfully subsampled before downstream analyses by comparing read file size/s to the original read file size/s
View file sizes by clicking on the read file listed in the Terra data table and looking at the file size
Terra Outputs
References
βοΈ [email protected] | X (formerly Twitter) | LinkedIn | π Website