VarSeq carries whole genome sequencing data from FASTQ to a signed clinical report: genome-scale variant analysis, structural and copy number detection, and non-coding interpretation in one validated platform.
WGS interpretation software must handle more than just variant calling. Our unified platform integrates high-performance secondary analysis with clinical-grade tertiary analysis, delivering a seamless transition from raw data to actionable insights.
Powered by Sentieon's optimized implementation of GATK models, achieve mathematically equivalent results with massive throughput gains. Sentieon DNAscope provides improved accuracy through enhanced active region detection, powerful local assembly, and pre-trained machine learning models for both short-read and long-read sequencers.
Sentieon DetailsImport and process genome-scale VCFs with multi-threaded performance. VarSeq handles the unique data scale of WGS without the performance bottlenecks found in legacy tools.
Analysis WorkflowOne-click generation of customizable clinical reports. Standardize your results with ACMG/AMP scoring, literature links, and lab-specific branding.
Reporting DetailsWGS analysis requires more than just SNV calling. VarSeq provides a unified environment for detecting and interpreting the full range of variation at genome scale. For a deep dive into how WGS works, what variant types it detects, and how it compares to exome sequencing, see our whole genome sequencing guide.
Automate the interpretation of complex structural variants and copy number changes. Apply ACMG/ClinGen guidelines to ensure consistent classification across whole genome datasets.
CNV InterpretationAutomate star-allele calling and drug-gene interpretation. Leverage CPIC guidelines to deliver actionable pharmacogenomic insights directly from WGS data.
PGx CapabilitiesAnalyze data from any sequencing technology. VarSeq provides a unified platform for both short-read and long-read WGS, resolving complex regions with ease.
Long-Read SolutionsManaging genome-scale data requires an enterprise-grade infrastructure. VSWarehouse centralizes your variant assessments, clinical reports, and longitudinal data into a secure, searchable repository — whether deploying on-premise, in a private cloud (BYOC), or air-gapped environment.
Generate cohort-level allele frequencies from thousands of genomes to inform clinical classification.
Full automation from FASTQ to report, including CNV computation and cohort management.
Deploy anywhere to meet compliance requirements: cloud, on-prem, or hybrid.
npj Genomic Medicine, 2022
WGS captures the full spectrum of variation — coding, non-coding, and structural — to diagnose cases that remain unresolved after exome or panel testing.
Whole genome sequencing reveals the complete landscape of somatic variation including structural rearrangements, gene fusions, and non-coding driver mutations.
Enterprise-scale WGS programs require automated pipelines, population-level cohort management, and robust infrastructure for high-throughput genome interpretation.
VarSeq provides a comprehensive ecosystem for whole genome sequencing in clinical and research laboratories.
Integrated clinical-grade CNV calling from genome read depths. Resolve large-scale structural events.
Resolve complex SVs and tricky genes with PacBio HiFi and Oxford Nanopore long-read sequencing.
High-performance visualization of genome-scale BAM and VCF files with interactive annotation overlays.
VSPipeline automation from FASTQ to clinical report for hands-off high-throughput WGS processing.
Bioinformatics experts demonstrate advanced genome analysis and population scale interpretation.
Whole genome sequencing analysis software: variant scope, data scale, non-coding interpretation, and population-scale programs.
Whole genome sequencing analysis software refers to the computational tools used to process, interpret, and report findings from WGS data. The analysis pipeline operates in two stages. Secondary analysis software handles the transformation of raw sequencing reads (FASTQ files) into variant calls: alignment, duplicate marking, base quality recalibration, and variant calling across SNVs, indels, CNVs, and structural variants. Tertiary analysis software then annotates those variants against clinical databases, filters to clinically relevant candidates, applies ACMG or AMP classification criteria, and generates the signed clinical report. At genome scale, typically 4 to 5 million raw variant calls per sample, the tertiary analysis layer is where the diagnostic work happens, and it requires purpose-built clinical software rather than research-grade tools adapted for clinical use.
The core annotation, filtering, and classification steps are similar, but WGS creates three distinct software challenges that exome analysis does not. First, data scale: a whole genome produces 4 to 5 million variant calls versus 25,000 to 40,000 for an exome, requiring multi-threaded processing and efficient database indexing to remain performant. Second, variant scope: WGS captures the full spectrum of genomic variation including non-coding variants, repeat expansions, mitochondrial variants, and structural variants that most exome capture kits miss, so the software must handle all of these in a single analysis session. Third, storage and management: WGS files are 10 to 30x larger than exome files, making enterprise data management infrastructure (variant assessment catalogs, longitudinal tracking, cohort frequency databases) a prerequisite rather than an optional add-on.
WGS supports detection and interpretation of the full spectrum of genomic variation: single nucleotide variants (SNVs), small insertions and deletions (indels), copy number variants (CNVs) including whole-chromosome aneuploidies and sub-microscopic events, structural variants (SVs) and gene fusions, repeat expansions in tandem repeat loci, mitochondrial DNA variants, and non-coding regulatory variants. This breadth is the primary clinical advantage of WGS over targeted panels and exome sequencing. Cases that remain unresolved after panel or exome testing often carry the causative variant in a region that was not sequenced or not interpretable with the smaller assay. VarSeq handles all of these variant classes within a single analysis session, without requiring separate tools for structural variants or CNVs.
Non-coding variant interpretation is one of the most challenging aspects of clinical WGS analysis. The majority of the genome is non-coding, and the functional evidence for specific non-coding variants is far less developed than for coding variants. VarSeq addresses this through CI-SpliceAI integration (a deep learning model that predicts the impact of variants on splicing, including deep intronic variants that standard consequence predictors miss), alongside regulatory region annotations, conservation scores, and literature database integration via Mastermind (Genomenon). Variants in known regulatory elements, promoter regions, and established non-coding disease loci can be flagged and prioritized. Full non-coding interpretation remains an area of active development across the field; VarSeq provides the annotation infrastructure while the clinical evidence base continues to mature.
Research-grade WGS tools are designed for exploratory analysis, where flexibility and sensitivity take priority over reproducibility and auditability. Clinical-grade software has four additional requirements. Determinism: the same input must produce identical output every run to support CLIA validation. Annotation currency: clinical databases (ClinVar, gnomAD, ClinGen) update continuously, and the software must track which database version was used for every analysis to support result traceability. Audit trail: every variant assessment, filter configuration, and classification decision must be permanently logged for CAP inspection. And quality management: software deployed in an accredited clinical laboratory should be developed under a certified QMS with controlled release processes. VarSeq is developed under an ISO 13485-certified QMS; VarSeq Dx is CE marked under IVDR 2017/746.
Yes. VSWarehouse is designed for enterprise and population-scale WGS programs, storing variant assessments, diplotype classifications, and clinical reports longitudinally across tens of thousands of genomes. For national genomic programs and health system WGS initiatives, VSWarehouse supports cohort-level allele frequency generation (building an internal population frequency database from the program's own genomes), automated reclassification monitoring when external databases update classifications of previously assessed variants, and LIMS/EHR integration for structured result delivery. VSPipeline automation handles hands-off processing from FASTQ to report for high-throughput programs where manual case initiation is not operationally viable. Deployment options include on-premises, private cloud (BYOC), and air-gapped configurations to meet the data governance requirements of population-scale programs.
VarSeq supports both short-read (Illumina) and long-read (PacBio HiFi, Oxford Nanopore) WGS data through the same tertiary analysis framework. Long-read sequencing resolves regions that are systematically difficult for short-read platforms: complex structural variants, tandem repeats, segmental duplications, and highly homologous gene families like SMN1/SMN2 and CYP2D6. Sentieon supports long-read alignment and variant calling, with output feeding directly into VarSeq for annotation and clinical interpretation. For programs where short-read WGS leaves a diagnostic gap, particularly in rare disease cases with negative short-read results, long-read WGS with the same downstream interpretation platform eliminates the need to rebuild the analysis workflow for the new assay type.
Unlock the potential of whole genome sequencing for your laboratory with VarSeq's enterprise-scale genomic analysis software.