Golden Helix · Clinical Genomics Guide
WES vs WGS
Choosing the Right Sequencing Strategy
A clinical comparison of whole exome and whole genome sequencing. Diagnostic yield, cost, variant detection, and the criteria that determine which is the right choice for your lab or your patient.
Introduction
There is no universal right answer.
There is the right answer for this case.
Whole exome sequencing and whole genome sequencing are the two most powerful tools in clinical genomics. Both use next generation sequencing. Both can identify the genetic cause of rare disease. For years labs and clinicians have debated which one to order.
The honest answer is that it depends on the patient population, the clinical indication, the variants you need to detect, the infrastructure your lab has in place, and increasingly, the cost-per-diagnosis rather than the cost-per-test. This guide gives lab directors, bioinformaticians, and ordering clinicians the framework to make that decision, grounded in current evidence rather than vendor positioning.
Definitions
What Each Test Actually Is
Whole exome sequencing (WES)
WES sequences the protein-coding regions of all ~20,000 genes in the human genome, collectively the exome. About 60 million base pairs, roughly 1 to 2% of the total genome, but containing an estimated 85% of known disease-causing variants. A capture step uses synthetic probes that hybridize to exonic regions, pulling them out of the DNA library before the run. This enrichment focuses read depth on the clinically relevant regions, typically achieving 80 to 100x mean coverage on a standard run.
Whole genome sequencing (WGS)
WGS sequences all approximately 3.2 billion base pairs of the human genome: coding and non-coding, exonic and intronic, regulatory and repetitive. No capture step. The entire genome is sequenced directly. WGS typically achieves 30 to 40x mean coverage on a standard clinical run, with higher depths (100x+) used for somatic tumor analysis or mosaic variant detection.
Side by Side
Key Differences at a Glance
The headline differences, before we get into where each one belongs clinically.
| Dimension | Whole Exome Sequencing | Whole Genome Sequencing |
|---|---|---|
| Genomic coverage | ~1–2% (exons only) | ~99% (whole genome) |
| Base pairs sequenced | ~60 million | ~3.2 billion |
| Typical mean coverage | 80–100x | 30–40x |
| Raw data per sample | ~10–20 GB | ~100–200 GB |
| Typical clinical cost | $1,000–$3,000 | $2,000–$5,000 |
| Turnaround time | Faster (less data) | Slower (more data) |
| SNV / Indel detection | Strong in exons | Strong genome-wide |
| CNV detection | Limited | Strong |
| Structural variants | Limited | Strong |
| Non-coding variants | No | Yes |
| Repeat expansion detection | Limited | Better (best with long-read) |
| Mitochondrial DNA | Partial | Full |
| Capture / GC bias | Yes (capture step introduces bias) | None (no capture step) |
| Established clinical use | Extensive | Growing rapidly |
The Evidence
Diagnostic Yield: What Studies Show
The central clinical question is not which technology is more comprehensive in theory, WGS obviously covers more of the genome. The question is whether that additional coverage translates into more diagnoses, and for which patients. The evidence has shifted decisively in recent years.
A landmark 2023 study in the New England Journal of Medicine sequenced the genomes of 822 families with suspected rare monogenic disease who had not received a diagnosis from prior testing. WGS achieved a molecular diagnosis in 29.3% of previously undiagnosed families: a substantial yield from a population already filtered through prior testing, including (in many cases) WES.
A 2015 study in PNAS (Belkadi et al.) comparing WES and WGS head-to-head on the same individuals found that WGS detected ~650 high-quality coding SNVs per sample that WES missed: roughly 3% of coding variants. These are variants in regions where WES probes do not capture efficiently, particularly low-complexity sequences, high-GC regions, and exon-intron boundaries.
A 2023 systematic review and meta-analysis in PLOS ONE found that WGS had a consistently higher diagnostic yield than WES for pediatric patients with suspected genetic disorders, diagnosing 15 to 35% of WES-negative families across multiple studies.
The practical implication: if a patient has had WES with no diagnosis, WGS should be considered the next step, not another panel and not a WES repeat. The incremental diagnostic yield is real and clinically meaningful.
Where WGS gains its diagnostic advantage
- Non-coding regulatory variants. Promoters, enhancers, and splice regulatory regions that affect expression without altering the protein-coding sequence. WES misses these entirely.
- Deep intronic variants causing aberrant splicing. Variants hundreds of base pairs from an exon that create cryptic splice sites. Invisible to WES; need WGS or RNA sequencing.
- Structural variants and large CNVs. WES capture disrupts the read-pair architecture that SV callers depend on. WGS provides significantly better detection of inversions, translocations, and large insertions.
- Incomplete exome capture. Every WES kit has regions of the exome it captures poorly: high-GC exons, first and last exons, regions with homology to other loci. Gaps vary by capture kit.
- Repeat expansions. Huntington disease, fragile X, spinocerebellar ataxias. Poorly detected by short-read WES. WGS improves detection. Long-read WGS is the gold standard.
Decision Logic
When to Choose WES
Despite WGS's diagnostic advantages, WES remains the appropriate choice in many clinical scenarios. The decision is not always "WGS if you can afford it."
- 01
Clinical indication maps cleanly to coding variants
For well-characterized Mendelian conditions where the mutational spectrum is dominated by coding SNVs and indels (BRCA1/2 hereditary cancer, CFTR-related cystic fibrosis), WES provides sufficient coverage at lower cost and faster turnaround.
- 02
Payer coverage supports WES but not WGS
WGS reimbursement remains inconsistent. In many systems WES is the covered option for rare disease diagnosis. Ordering WGS when payer coverage is absent shifts cost to the patient, a clinical and ethical consideration.
- 03
Lab infrastructure is not validated for WGS-scale data
WGS generates 10x more raw data than WES. If storage, compute, and pipeline are not validated for WGS scale, WES is the pragmatic choice until infrastructure is ready. Running WGS through a WES-validated pipeline is a validation failure waiting to happen.
- 04
Turnaround time is critical and rapid WGS is not available
Rapid WGS for critically ill neonates is increasingly available but not at every institution. Where WGS TAT cannot meet clinical needs, WES may be the right interim approach.
- 05
Trio analysis on a constrained budget
Sequencing proband plus both parents dramatically improves diagnostic yield over proband-only WES by enabling de novo variant detection. A trio WES at $6,000–$9,000 total may outperform proband-only WGS at $3,000–$5,000 for many indications.
- 01
Capture bias
The hybridization capture step introduces GC-content bias. High-GC exons, including many disease-relevant promoter-proximal exons, are systematically underrepresented. This creates consistent coverage gaps that vary by kit vendor.
- 02
Missing non-coding disease
An estimated 15 to 25% of Mendelian disease diagnoses in WES-negative patients are eventually explained by non-coding variants, most of which WGS would detect.
- 03
SV blindspot
WES has poor sensitivity for copy number variants below ~50 kb and essentially no sensitivity for balanced structural variants. Conditions with a significant SV mutational mechanism are at risk of false negatives.
- 04
Inter-kit variability
Different WES capture kits cover different exonic regions. A negative WES result on one platform may not mean the variant is undetectable. It may mean the variant falls in a gap specific to that kit.
Decision Logic
When to Choose WGS
WGS is no longer a research-only technology. It is entering clinical production at labs globally and is increasingly supported by professional society guidelines as a first- or second-line test for specific indications.
- 01
Patient is WES-negative with strong clinical suspicion
The highest-yield application of WGS. A 2023 NEJM study demonstrated 29% diagnostic yield in this population. If a patient has had WES, gene panels, and chromosomal microarray without a diagnosis, WGS is the rational next step.
- 02
Critically ill neonate requiring rapid diagnosis
Rapid WGS programs returning results in 24 to 72 hours have demonstrated clinical utility in NICUs and PICUs, where a molecular diagnosis can directly change management within a time-sensitive window. RCTs show rapid WGS changes management in 20 to 40% of cases.
- 03
Structural variant etiology is suspected
For conditions where large deletions, duplications, or rearrangements are a primary mechanism (many neurodevelopmental disorders), WGS's superior SV detection makes it preferred over WES plus chromosomal microarray.
- 04
A comprehensive first-line approach is feasible
When WGS cost and infrastructure are not limiting, WGS-first eliminates stepwise testing and the delays it introduces. The cost-per-diagnosis argument increasingly supports this approach for undiagnosed rare disease programs.
- 05
Mitochondrial disease is on the differential
WGS provides full mitochondrial genome coverage at high depth (mtDNA is present in hundreds of copies per cell). WES mtDNA coverage is partial and inconsistent.
- 06
Non-coding or regulatory variants are clinically suspected
For genes where non-coding pathogenic variants are documented (LDLR promoter variants in familial hypercholesterolemia, HBB regulatory variants in thalassemia, deep intronic variants in CFTR), WGS provides the only NGS-based route to detection.
Economics
The Cost-Per-Diagnosis Argument
The common framing of WES vs WGS as a cost comparison ("WES is cheaper, therefore WES first") is increasingly outdated. The relevant metric is not cost-per-test. It is cost-per-diagnosis.
A 2022 health economic analysis in Genetics in Medicine (Lavelle et al.) modeled cost-effectiveness of WES vs WGS for children with rare undiagnosed conditions. While WGS has a higher upfront cost, its higher diagnostic yield, particularly in WES-negative patients, results in a lower cost-per-diagnosis when accounting for the avoided cost of subsequent testing, repeat consultations, and empirical treatment of undiagnosed patients.
The stepwise approach (panel first, then WES, then WGS) was the most expensive path per diagnosis in the model, despite having the lowest cost at each individual step.
Operations
Operational Implications for the Lab
The WES vs WGS decision is not just a clinical genetics question. It is an infrastructure and operations question that directly affects the bioinformatics pipeline, storage architecture, and interpretation workflow.
Bioinformatics pipeline
WGS requires a pipeline validated for whole-genome scale. Not simply a matter of pointing a WES pipeline at a larger input file. Specific differences:
- Variant callers must be configured for genome-wide operation, not just exon-targeted regions. A caller tuned for high-depth WES may perform differently at 30x WGS depth.
- SV and CNV callers must be included. A WGS pipeline that only calls SNVs and indels is discarding a significant fraction of WGS's diagnostic value.
- Non-coding annotation needs additional sources and filtering strategies. Without appropriate non-coding databases and effect predictors, WGS generates enormous noise in non-coding regions.
- Compute and storage requirements are 5 to 10x higher. WGS needs significantly more RAM, CPU, and storage at every stage.
Interpretation workflow
A standard WES tertiary analysis might produce 200 to 500 candidate variants after filtering. WGS on the same patient with non-coding regions included can produce several thousand candidates without appropriate filtering strategies. Labs transitioning to WGS must invest in non-coding variant filtering frameworks, phenotype-driven prioritization, and expanded ACMG/AMP evidence gathering for non-coding variation. VarSeq handles both WES and WGS tertiary analysis within a single unified platform.
Storage at scale
| Metric | WES | WGS |
|---|---|---|
| Raw FASTQ per sample | ~10 GB | ~100–200 GB |
| BAM file per sample | ~8–12 GB | ~80–120 GB |
| VCF per sample | ~100 MB | ~500 MB – 1 GB |
| Storage per 100 samples/week | ~2–3 TB/year | ~20–30 TB/year |
A lab running 100 WGS cases per week needs 1 to 1.5 petabytes of raw data per year if retaining FASTQs. Most institutions adopt tiered retention: VCFs and clinical reports indefinitely, BAMs for 2 to 5 years, FASTQs compressed or discarded after a defined validation period.
Common Questions
Frequently Asked Questions
Is WGS better than WES?
Is WES the same as WGS?
Can WES be rejected by insurance?
What is the diagnostic yield of WES vs WGS?
Should I use hg19 or hg38?
How much more storage does WGS need than WES?
Keep Reading
Related Resources
Go deeper on either test, or on how a VCF becomes a clinical report regardless of which test produced it.
Handle Both WES and WGS in One Platform
VarSeq supports tertiary analysis for targeted panels, whole exome, and whole genome sequencing within a unified workflow, with variant filtering, ACMG-guided interpretation, and clinical reporting that scales from gene panels to whole genomes without requiring separate pipelines.