Analysis Type: Whole Genome Analysis

Whole Genome Sequencing Analysis Software

VarSeq carries whole genome sequencing data from FASTQ to a signed clinical report: genome-scale variant analysis, structural and copy number detection, and non-coding interpretation in one validated platform.

Population Scale Ready
Sentieon DNAscope
BYOC & Air-Gapped Deployment

A Scalable, Unified Pipeline from FASTQ to Clinical Report

WGS interpretation software must handle more than just variant calling. Our unified platform integrates high-performance secondary analysis with clinical-grade tertiary analysis, delivering a seamless transition from raw data to actionable insights.

01

High-Performance Secondary Analysis

Powered by Sentieon's optimized implementation of GATK models, achieve mathematically equivalent results with massive throughput gains. Sentieon DNAscope provides improved accuracy through enhanced active region detection, powerful local assembly, and pre-trained machine learning models for both short-read and long-read sequencers.

Sentieon Details
02

Multi-Threaded WGS Analysis

Import and process genome-scale VCFs with multi-threaded performance. VarSeq handles the unique data scale of WGS without the performance bottlenecks found in legacy tools.

Analysis Workflow
03

Clinician-Ready Reporting

One-click generation of customizable clinical reports. Standardize your results with ACMG/AMP scoring, literature links, and lab-specific branding.

Reporting Details
WGS Filter Chain
4,892,140
Zygosity (Current) is (Heterozygous, Homozygous)
3,241,087
Genotype Qualities (GQ) (Current) ≥ 30
2,987,412
gnomAD AF < 0.01 OR Missing
128,491
Coding Impact OR CI-SpliceAI ≥ 0.5
2,847
Warehouse Cohort AF < 0.005
614
ClinVar & ACMG Auto-Classification
Benign412
Likely Benign156
VUS38
Likely Pathogenic1
Pathogenic2
Final variants
3

Full-Spectrum Genomic Analysis

WGS analysis requires more than just SNV calling. VarSeq provides a unified environment for detecting and interpreting the full range of variation at genome scale. For a deep dive into how WGS works, what variant types it detects, and how it compares to exome sequencing, see our whole genome sequencing guide.

SV & CNV Interpretation

Automate the interpretation of complex structural variants and copy number changes. Apply ACMG/ClinGen guidelines to ensure consistent classification across whole genome datasets.

CNV Interpretation

Pharmacogenomics (PGx)

Automate star-allele calling and drug-gene interpretation. Leverage CPIC guidelines to deliver actionable pharmacogenomic insights directly from WGS data.

PGx Capabilities

Short & Long-Read Support

Analyze data from any sequencing technology. VarSeq provides a unified platform for both short-read and long-read WGS, resolving complex regions with ease.

Long-Read Solutions
VSWarehouse
Enterprise Dashboard
Online
Genomes
102,847
+3,241 this month
Reports
28,419
+891 this month
ClinVar Changes
12
Flagged for review
Assessment Catalogs
WGS — Rare Disease24,891 variantsActive
Population Screening8,214 variantsActive
PGx Diplotypes648 assessmentsActive
5 Workspaces · 42 Active Users
SSO Enabled RBAC Active

Optimized for Population Scale & Enterprise Security

Managing genome-scale data requires an enterprise-grade infrastructure. VSWarehouse centralizes your variant assessments, clinical reports, and longitudinal data into a secure, searchable repository — whether deploying on-premise, in a private cloud (BYOC), or air-gapped environment.

  • Population-Aware Interpretation

    Generate cohort-level allele frequencies from thousands of genomes to inform clinical classification.

  • VSPipeline Automation

    Full automation from FASTQ to report, including CNV computation and cohort management.

  • Flexible Deployment

    Deploy anywhere to meet compliance requirements: cloud, on-prem, or hybrid.

Explore VSWarehouse

"Whole genome sequencing is emerging as a first-tier diagnostic test for rare genetic diseases."

Austin-Tse et al., Medical Genome Initiative

npj Genomic Medicine, 2022

Genome Scale Insights & Webcasts

Bioinformatics experts demonstrate advanced genome analysis and population scale interpretation.

Featured Articles

All Genome Articles

On-Demand Webcasts

View All Genome Webcasts

Frequently Asked Questions

Whole genome sequencing analysis software: variant scope, data scale, non-coding interpretation, and population-scale programs.

What is whole genome sequencing analysis software?

Whole genome sequencing analysis software refers to the computational tools used to process, interpret, and report findings from WGS data. The analysis pipeline operates in two stages. Secondary analysis software handles the transformation of raw sequencing reads (FASTQ files) into variant calls: alignment, duplicate marking, base quality recalibration, and variant calling across SNVs, indels, CNVs, and structural variants. Tertiary analysis software then annotates those variants against clinical databases, filters to clinically relevant candidates, applies ACMG or AMP classification criteria, and generates the signed clinical report. At genome scale, typically 4 to 5 million raw variant calls per sample, the tertiary analysis layer is where the diagnostic work happens, and it requires purpose-built clinical software rather than research-grade tools adapted for clinical use.

How does WGS analysis differ from exome analysis at the software level?

The core annotation, filtering, and classification steps are similar, but WGS creates three distinct software challenges that exome analysis does not. First, data scale: a whole genome produces 4 to 5 million variant calls versus 25,000 to 40,000 for an exome, requiring multi-threaded processing and efficient database indexing to remain performant. Second, variant scope: WGS captures the full spectrum of genomic variation including non-coding variants, repeat expansions, mitochondrial variants, and structural variants that most exome capture kits miss, so the software must handle all of these in a single analysis session. Third, storage and management: WGS files are 10 to 30x larger than exome files, making enterprise data management infrastructure (variant assessment catalogs, longitudinal tracking, cohort frequency databases) a prerequisite rather than an optional add-on.

What variant types can be detected and interpreted from WGS data?

WGS supports detection and interpretation of the full spectrum of genomic variation: single nucleotide variants (SNVs), small insertions and deletions (indels), copy number variants (CNVs) including whole-chromosome aneuploidies and sub-microscopic events, structural variants (SVs) and gene fusions, repeat expansions in tandem repeat loci, mitochondrial DNA variants, and non-coding regulatory variants. This breadth is the primary clinical advantage of WGS over targeted panels and exome sequencing. Cases that remain unresolved after panel or exome testing often carry the causative variant in a region that was not sequenced or not interpretable with the smaller assay. VarSeq handles all of these variant classes within a single analysis session, without requiring separate tools for structural variants or CNVs.

How does the software handle non-coding variant interpretation?

Non-coding variant interpretation is one of the most challenging aspects of clinical WGS analysis. The majority of the genome is non-coding, and the functional evidence for specific non-coding variants is far less developed than for coding variants. VarSeq addresses this through CI-SpliceAI integration (a deep learning model that predicts the impact of variants on splicing, including deep intronic variants that standard consequence predictors miss), alongside regulatory region annotations, conservation scores, and literature database integration via Mastermind (Genomenon). Variants in known regulatory elements, promoter regions, and established non-coding disease loci can be flagged and prioritized. Full non-coding interpretation remains an area of active development across the field; VarSeq provides the annotation infrastructure while the clinical evidence base continues to mature.

What does clinical-grade WGS analysis software require beyond research tools?

Research-grade WGS tools are designed for exploratory analysis, where flexibility and sensitivity take priority over reproducibility and auditability. Clinical-grade software has four additional requirements. Determinism: the same input must produce identical output every run to support CLIA validation. Annotation currency: clinical databases (ClinVar, gnomAD, ClinGen) update continuously, and the software must track which database version was used for every analysis to support result traceability. Audit trail: every variant assessment, filter configuration, and classification decision must be permanently logged for CAP inspection. And quality management: software deployed in an accredited clinical laboratory should be developed under a certified QMS with controlled release processes. VarSeq is developed under an ISO 13485-certified QMS; VarSeq Dx is CE marked under IVDR 2017/746.

Can VarSeq handle population-scale WGS programs?

Yes. VSWarehouse is designed for enterprise and population-scale WGS programs, storing variant assessments, diplotype classifications, and clinical reports longitudinally across tens of thousands of genomes. For national genomic programs and health system WGS initiatives, VSWarehouse supports cohort-level allele frequency generation (building an internal population frequency database from the program's own genomes), automated reclassification monitoring when external databases update classifications of previously assessed variants, and LIMS/EHR integration for structured result delivery. VSPipeline automation handles hands-off processing from FASTQ to report for high-throughput programs where manual case initiation is not operationally viable. Deployment options include on-premises, private cloud (BYOC), and air-gapped configurations to meet the data governance requirements of population-scale programs.

How does VarSeq support long-read WGS data?

VarSeq supports both short-read (Illumina) and long-read (PacBio HiFi, Oxford Nanopore) WGS data through the same tertiary analysis framework. Long-read sequencing resolves regions that are systematically difficult for short-read platforms: complex structural variants, tandem repeats, segmental duplications, and highly homologous gene families like SMN1/SMN2 and CYP2D6. Sentieon supports long-read alignment and variant calling, with output feeding directly into VarSeq for annotation and clinical interpretation. For programs where short-read WGS leaves a diagnostic gap, particularly in rare disease cases with negative short-read results, long-read WGS with the same downstream interpretation platform eliminates the need to rebuild the analysis workflow for the new assay type.

Master the Full Spectrum of Genomic Variation

Unlock the potential of whole genome sequencing for your laboratory with VarSeq's enterprise-scale genomic analysis software.

WGS Scale
Non-Coding Interpretation
ISO 13485 Certified QMS