The Pipeline

From raw reads to publication-ready results.

Your FASTQ files pass through five configurable phases, each producing detailed reports, interactive visualizations, and downloadable outputs you can use directly in your research papers.

Step 01

Quality Control

fastp

Raw FASTQ files are assessed and trimmed with fastp. Low-quality bases, adapter sequences, and poly-G/X tails are removed. You can tune quality thresholds, minimum read length, and sliding-window parameters.

Outputs

Per-base quality scores
Read length distribution
Adapter content report
Passed/failed filter stats
Trimmed FASTQ files

Configurable parameters

Quality phred threshold
Minimum read length
Sliding window trimming
Poly-G / Poly-X trimming
Custom adapter sequences

Step 02

Preprocessing

fastp + custom filters

Cleaned reads are prepared for quantification. Paired-end files are validated, orphaned reads are handled, and files are organized into the structure downstream tools expect. This step ensures data integrity before alignment.

Outputs

Validated read pairs
Clean file manifests
Read count summaries

Configurable parameters

Pair validation mode
Orphan handling strategy

Step 03

Quantification

Salmon / STAR

Trimmed reads are quasi-mapped against a reference transcriptome using Salmon, or aligned with STAR for genome-level quantification. Gene-level expression counts are produced for every sample in your project.

Outputs

Gene-level counts (quant.sf)
Mapping rate per sample
Number of genes detected
Alignment statistics

Configurable parameters

Reference transcriptome
Quantification tool (Salmon / STAR)
Library type
Index build parameters

Step 04

Differential Expression

DESeq2

Expression counts from all samples are aggregated, normalized, and tested for differential expression using DESeq2. You assign samples to experimental groups (e.g. Control vs Treatment), and DESeq2 identifies genes with statistically significant expression changes.

Outputs

Results table (log2FC, p-adj, baseMean)
Volcano plot
MA plot
Heatmap of top DE genes
VST-normalized counts matrix
PCA plot

Configurable parameters

Reference group (Control)
Significance threshold (p-adj)
Log2 fold-change cutoff
Shrinkage estimator

Step 05

Pathway Analysis

GSEA / GO Enrichment

Differentially expressed gene lists are tested against curated pathway databases (MSigDB, Gene Ontology) using GSEA. The result is a map of which biological pathways and processes are activated or suppressed in your experiment.

Outputs

Enrichment score plots
GO enrichment table
Activation matrix heatmap
Leading edge gene lists
Downloadable results CSV

Configurable parameters

Gene set database
Permutation type
Minimum / maximum set size
Scoring metric

What you get at the end

After running the full pipeline, your project contains a complete, versioned record of every step, from the raw QC report through to pathway enrichment maps. Here's what you can view, customize, and download:

Volcano plots

MA plots

Heatmaps

PCA plots

Enrichment score curves

GO enrichment tables

Activation matrices

VST count matrices

Results CSVs

QC summary reports

Mapping statistics

Leading edge gene lists

Built for real research workflows

Beyond the core pipeline, NotchBio gives you the tools to manage, compare, and share your work.

Versioned pipeline runs

Every run records its exact parameters. Re-run with tweaked settings and compare results side by side without spreadsheet tracking.

Interactive visualizations

Volcano plots, MA plots, heatmaps, PCA, and enrichment maps are fully interactive. Zoom, hover, filter, and export figures directly from the browser.

Publication-ready exports

Download results as CSV, TSV, or high-resolution images. Every table and plot is formatted for direct inclusion in papers and supplementary materials.

Cell line & group management

Organize samples into experimental groups, assign cell lines, and track metadata. All downstream analysis respects your grouping automatically.

Re-run variations

Try different quality thresholds, quantification tools, or DE parameters. Each variation is stored as a separate version and nothing is overwritten.

SRA / GEO imports

Import public datasets from NCBI SRA, SRP, or GSE accession IDs with one click. Samples are downloaded and ready for analysis in minutes.

Start analyzing in minutes

Upload your FASTQ files or import from SRA, configure your parameters, and launch the pipeline. No server setup, no command line.

Get started free View pricing