Page Contents

Workflows available

Concatenate_Column_Content

Freyja Wastewater Analysis

Mercury_Prep_N_Batch

Pangolin Update

Snippy_Streamline

Snippy_Variants

TheiaCoV Genomic Characterization

TheiaProk Workflow Series

Zip_Column_Content

Overview

Rasusa functions to randomly downsample the number of raw reads to a user-defined threshold.

📋 Use Cases:

to reduce computing resources when samples end up with drastically more data than needed to perform analyses
to perform limit of detection (LOD) studies to identify appropriate minimum coverage thresholds required to perform downstream analyses

🔧 Desired size may be specified by inputting any one of the following:

coverage (e.g. 20X)
number of bases (e.g. ”5m” for 5 megabases)
number of reads (e.g. 100000 total reads)
fraction of reads (e.g. 0.5 samples half the reads)

Inputs

Input all String values (other than when selecting dropdown option Strings) in quotations, e.g. “5m”

Required Inputs

Optional Inputs

Outputs

💡 Don’t Forget! Remember to use the subsampled reads in downstream analyses with this.read1_subsampled and this.read2_subsampled inputs.

✅ Verify reads were successfully subsampled before downstream analyses by comparing read file size/s to the original read file size/s

View file sizes by clicking on the read file listed in the Terra data table and looking at the file size

Terra Outputs

References

✉️ [email protected] | X (formerly Twitter) | LinkedIn | 🌐 Website