Sentieon provides high-computing efficiency, fast turnaround time, and 100% consistency for secondary analysis NGS. Sentieon DNAscope delivers improved accuracy through enhanced active region detection, powerful local assembly of reads, and pre-trained machine learning models for both short-read and long-read sequencers — bundled directly with Golden Helix tertiary analysis.
Secondary analysis NGS is the critical computational phase that transforms raw sequencer output into structured variant data. It receives millions of short DNA reads in FASTQ format and produces a catalog of genetic differences relative to a reference genome, typically in VCF format.
Achieving high-throughput in clinical genomics requires variant calling software that balances speed, accuracy, and reproducibility. Legacy pipelines often face bottlenecks in processing whole-genome and whole-exome datasets, leading to extended turnaround times and hardware scaling challenges.
Golden Helix integrates Sentieon as a high-performance DNA sequencing pipeline engine, providing alignment, deduplication, and variant calling in a single optimized solution. By implementing GATK and MuTect2 mathematical models in low-level C and assembly language, Sentieon delivers 10X-50X performance gains while maintaining mathematical equivalence. Sentieon DNAscope goes further with pre-trained machine learning models that improve accuracy through enhanced active region detection and more powerful local assembly of reads for both short-read and long-read sequencers.
Modern genomic research demands a high-performance variant calling approach that can scale to national-level genome projects. Sentieon achieves this through optimized software implementation of the GATK Best Practices workflows.
Sentieon provides the most efficient secondary analysis solution for high-throughput environments where processing speed and consistency are critical.
The software should maximize CPU utilization to deliver 10X-50X improvements in core-hours over standard Java-based pipelines.
Deterministic results with no downsampling in high-coverage regions ensure consistent variant calls for clinical delivery.
Support for joint calling on 100,000+ samples without intermediate file merging, enabling massive cohort studies.
Unified support for DNAseq (germline) and TNseq (tumor-normal somatic) variant calling in one engine.
The secondary analysis should integrate directly with tertiary analysis for a seamless "FASTQ to Report" workflow.
Deploy anywhere: on-premises workstations, centralized servers, or air-gapped secure networks.
Golden Helix provides a complete bioinformatics pipeline designed to receive data directly from the sequencer and carry it through to clinical reporting. Sentieon processes the raw FASTQ data into a VCF, which is then imported into VarSeq for filtering and annotation.
Achieve 10X faster FASTQ-to-VCF and up to 50X faster BAM-to-VCF processing compared to standard BWA-GATK pipelines.
Sentieon implements mathematically equivalent GATK models for proven accuracy. DNAscope goes further with ML-optimized calling that improves sensitivity and specificity for both short-read and long-read data.
Integrate Sentieon into VSPipeline for full automation from raw sequencing reads to clinician-ready PDF reports.

"Sentieon provides the upstream secondary analysis that feeds directly into VarSeq's tertiary interpretation engine."
Map sequencing reads to the reference genome using highly efficient BWA-MEM or BWA-MEM2 algorithms.
Identify and mark PCR artifacts to ensure variant calls are based on unique biological fragments.
Apply Bayesian models (DNAseq) or ML-optimized calling (DNAscope) to identify SNVs and Indels with deterministic, 100% consistent results.
Produce standard VCF files ready for clinical-grade tertiary analysis and interpretation in VarSeq.
Sentieon is designed for massive datasets, supporting joint calling on 100,000+ samples without intermediate file merging.

Complete solution for germline SNV and Indel detection. DNAseq provides mathematically equivalent GATK results at 10X speed, while DNAscope uses ML models for even higher accuracy on both short and long reads.
Variant InterpretationTumor-normal pair somatic variant detection matching MuTect and MuTect2 mathematics with significantly improved efficiency.
Somatic SolutionsScale to national genome projects with joint calling for 100,000+ samples. Deterministic results ensure data consistency across cohorts.
Whole Genome AnalysisYes. Sentieon's DNAseq implements the exact same mathematical models as Broad Institute's Best Practice Workflows, achieving mathematically equivalent results with massive throughput gains. DNAscope goes beyond GATK equivalence by incorporating pre-trained machine learning models for improved accuracy on both short-read and long-read data.
No. Sentieon is a pure software solution optimized for standard CPU architectures. It delivers 10X-50X performance gains without requiring specialized hardware like GPUs or FPGAs.
Unlike some bioinformatics tools that may produce slightly different results across multiple runs due to multi-threading randomness, Sentieon ensures that the exact same input always produces the exact same output.
Yes. Sentieon is highly modular and can be integrated into VSPipeline or custom automation scripts, enabling a hands-off workflow from FASTQ generation to clinical reporting.
Learn how to optimize your NGS pipeline for speed and accuracy with Sentieon and VarSeq.
Join high-throughput clinical labs and research centers worldwide using Sentieon for high-performance variant calling.