Golden Helix · Clinical Genomics Guide
Whole Exome Sequencing (WES)
What It Is, How It Works, When to Order It
A clinical guide to WES for clinicians considering the test, families researching it, and lab professionals running or evaluating a WES program. What it detects, what it does not, and where it fits in the rare disease workup.
Introduction
One test, across every protein-coding gene
in the human genome.
When a patient presents with a complex unexplained condition (developmental delay, unusual birth defects, a constellation of symptoms that does not fit a known syndrome), a physician needs a test broad enough to look across all 20,000 human genes at once. Whole exome sequencing is that test.
In the last decade WES has moved from a research tool available only in specialized academic centers to a clinical standard ordered by pediatricians, neurologists, and geneticists around the world. It has ended diagnostic odysseys that lasted years. It has identified treatment-relevant variants in patients who had already undergone extensive testing without answers. And it has fundamentally changed the economics of rare disease diagnosis by replacing sequential single-gene testing with a single comprehensive assay.
Definition
What Is Whole Exome Sequencing?
Whole exome sequencing (WES) is a genetic test that sequences all of the protein-coding regions of the human genome, collectively the exome. About 60 million base pairs spread across ~20,000 genes. That is roughly 1 to 2% of the total genome, but contains an estimated 85% of known disease-causing genetic variants.
Why the exome, not the whole genome?
The human genome contains ~3.2 billion base pairs, but most do not directly encode proteins. The protein-coding portions, the exons, are the segments cells actively read to produce the proteins that carry out virtually every biological function. Most variants causing Mendelian disease, disrupting enzyme function, altering structural proteins, or affecting signaling do so by changing exon sequence.
By focusing sequencing on the exome rather than the entire genome, WES captures the regions most likely to harbor disease-causing variants at a cost and data volume significantly lower than WGS. A standard clinical WES run produces ~10–20 GB of data per sample, roughly one-tenth the WGS data volume, while identifying the vast majority of clinically actionable variants.
What is an exon?
Exons are not contiguous. Each gene is made of multiple exons separated by non-coding introns. When a gene is expressed, the cell transcribes the entire gene (exons and introns together) and then splices out the introns to produce a mature mRNA consisting only of exons. That mRNA is translated into a protein. A variant in an exon that changes an amino acid, introduces a premature stop codon, or disrupts a splice site at the exon-intron boundary can alter or abolish protein function. Those are the variants WES is designed to detect.
The Pipeline
From Sample to Report
WES follows the same general NGS workflow as other sequencing assays, with one critical addition: the exome capture step. Understanding the full pipeline helps clinicians interpret results and helps labs make informed decisions about test design.
- 01
Sample collection & DNA extraction
Peripheral blood is most common; saliva or buccal swabs work when blood is not feasible. Tissue is used for specific cases. DNA quality matters: degraded or low-quantity input produces shorter fragments, uneven capture, and reduced coverage in some exonic regions.
- 02
Library preparation
DNA is fragmented to ~150–400 bp and adapters are ligated. The adapters carry index sequences for sample multiplexing on the same flow cell.
- 03
Exome capture (enrichment)
The step that distinguishes WES from WGS. Synthetic probes complementary to exonic sequences hybridize and pull exon-containing fragments out of the library. Non-exonic fragments are washed away. Capture kits differ (Agilent SureSelect, Illumina Exome, IDT xGen, Twist Biosciences) and the choice matters for which regions are reliably covered.
- 04
Sequencing
Captured library is loaded onto a sequencer, usually Illumina, with 150 bp paired-end reads. Because sequencing is focused on the enriched fraction, WES typically achieves 80–100x mean coverage on targeted regions, vs 30–40x for WGS. Output: a FASTQ file with billions of reads and per-base Phred quality scores.
- 05
Secondary analysis: alignment & variant calling
Reads map to GRCh38 using BWA-MEM, producing a BAM file. PCR duplicates are removed, base qualities recalibrated. GATK HaplotypeCaller calls germline SNVs and small indels. CNV detection needs separate algorithms analyzing read-depth across the exome. Output: a VCF.
- 06
Tertiary analysis: annotation, filtering, classification, reporting
The diagnostic work. Annotation with ClinVar, gnomAD, OMIM, in silico predictors, SpliceAI. Filtering on population frequency and functional consequence. Phenotype-driven prioritization with HPO terms. ACMG/AMP classification across 28 criteria. Final clinical report. See the tertiary analysis guide for depth on this stage.
Capabilities
What WES Detects (and What It Misses)
Understanding both sides is essential for appropriate test ordering and accurate result interpretation. Being explicit about WES limitations protects patients from false reassurance and guides appropriate follow-up.
Detected
Single Nucleotide Variants
Single base changes in exonic sequence: missense (amino acid change), nonsense (premature stop), synonymous. WES detects germline SNVs in exonic regions with high sensitivity at adequate coverage.
Detected
Small Insertions & Deletions
Indels of 1 to 50 bp. Frameshift indels not divisible by three typically cause loss of function. Sensitivity decreases for larger indels and those in homopolymer runs.
Detected
Splice-Site Variants
Variants at canonical splice donor and acceptor sites (GT/AG at exon-intron boundaries) are captured because probes typically extend slightly beyond exon edges. Deep intronic splice variants more than 10–20 bp from the exon are not detected.
Detected (limited)
Exon-Level Copy Number Variants
WES detects single-exon or multi-exon deletions and duplications through read-depth analysis. Sensitivity is lower than chromosomal microarray for large CNVs, and limited to regions covered by the exome capture.
Not detected
Non-Coding Variants
Variants in promoters, enhancers, deep introns, and other regulatory regions are not captured. 15 to 25% of WES-negative rare disease patients eventually receive a diagnosis through WGS or RNA sequencing, many from non-coding variants.
Not detected
Large Structural Variants & Repeat Expansions
Balanced inversions, translocations, and complex rearrangements are not reliably detected. Repeat expansions causing Huntington, fragile X, and myotonic dystrophy are missed entirely. Mitochondrial genome coverage is partial.
Test Ordering
When to Order Whole Exome Sequencing
WES is not appropriate for every genetic testing scenario. It is most powerful when the clinical question is broad, the differential diagnosis spans many genes, and prior targeted testing has been unrevealing or is impractical.
- 01
Undiagnosed rare disease with suspected genetic etiology
The core use case. Patients with complex, unexplained phenotypes that do not fit a recognizable syndrome, particularly those with normal chromosomal microarray, panel testing, and metabolic workup.
- 02
Developmental delay & intellectual disability
The ACMG and AAP recommend exome or genome sequencing as a first-line or early-line test for children with global developmental delay or intellectual disability of unknown cause (Manickam et al., 2021 ACMG clinical guideline).
- 03
Unexplained epilepsy
The NSGC, endorsed by the American Epilepsy Society, recommends exome or genome sequencing for all individuals with unexplained epilepsy. Genetic etiology is identified in 30 to 50% of cases, and the genetic cause directly affects treatment selection in many.
- 04
Multiple congenital anomalies
Two or more structural birth defects of different organ systems that suggest a syndromic genetic condition.
- 05
Genetically heterogeneous conditions
Conditions where dozens or hundreds of genes can cause the same presentation: cardiomyopathy, skeletal dysplasia, primary immunodeficiency. Efficiently addressed by WES rather than sequential single-gene testing.
- 06
Prior panel testing has been non-diagnostic
When a targeted gene panel appropriate for the phenotype returned negative, WES is the rational next step before considering WGS.
Family-Based Testing
Trio Exome Sequencing
One of the most powerful applications of WES is trio sequencing: sequencing the affected patient (proband) together with both biological parents simultaneously.
Trio WES dramatically improves diagnostic yield by enabling de novo variant detection. De novo variants are present in the child but absent from both parents, meaning they arose as new mutations, not inherited ones. De novo variants are a major cause of severe early-onset genetic conditions including autism spectrum disorder, severe intellectual disability, and many epileptic encephalopathies.
Without parental sequencing, a de novo variant looks like any other rare heterozygous variant. It must be evaluated on functional evidence alone. With parental sequencing, a de novo variant in a gene strongly linked to the child's phenotype carries substantially more evidence for pathogenicity, often meeting ACMG criteria for Pathogenic or Likely Pathogenic outright.
Trio WES also enables
- Compound heterozygosity confirmation: two variants in the same gene, one on each chromosome, that together cause autosomal recessive disease. Parental data confirms each variant is inherited from a different parent.
- X-linked inheritance confirmation: distinguishing maternal carrier inheritance vs de novo arising in an affected male, which changes recurrence risk counseling.
- Reduced VUS rate: parental segregation data helps reclassify variants of uncertain significance.
Studies consistently show trio WES achieves 10 to 15 percentage points higher diagnostic yield than proband-only WES for pediatric rare disease, a clinically significant difference that justifies the additional sequencing cost in most cases.
Evidence
Diagnostic Yield by Indication
WES diagnostic yield varies by clinical indication, patient selection, and whether trio or proband-only sequencing is performed. The most reliable figures come from large clinical cohort studies.
| Clinical Indication | Proband-Only WES | Trio WES |
|---|---|---|
| Undiagnosed rare disease (general) | 20–30% | 35–50% |
| Intellectual disability / developmental delay | 25–35% | 40–55% |
| Epilepsy | 20–45% | 35–50% |
| Multiple congenital anomalies | 30–40% | 45–60% |
| Autism spectrum disorder | 10–15% | 20–30% |
| Prior panel-negative patients | 15–25% | 25–40% |
Individual yield varies based on how well the phenotype is characterized, the specificity of the indication, and the lab's analytical approach. Highly specific phenotypes with strong genotype-phenotype correlations yield more diagnoses than broad presentations. A negative WES does not rule out a genetic cause: it rules out the causes detectable by WES. About 10 to 15% of WES-negative patients eventually receive a diagnosis through WGS, RNA sequencing, or re-analysis of existing WES data as new gene-disease relationships are established.
Economics
Cost and Insurance Coverage
Clinical WES pricing in the United States varies by laboratory and testing configuration. These figures reflect typical clinical laboratory pricing inclusive of variant interpretation. Research-grade WES from commercial providers costs considerably less but excludes clinical reporting and CLIA-certified interpretation.
| Configuration | Typical US Clinical Cost | CPT Codes |
|---|---|---|
| Proband-only WES | $1,000 – $2,500 | CPT 81415 |
| Duo WES (proband + one parent) | $1,500 – $3,500 | CPT 81415 + 81416 |
| Trio WES (proband + both parents) | $2,000 – $5,000 | CPT 81415 + 81416 ×2 |
| WES re-analysis | Varies | CPT 81417 |
Insurance coverage for clinical WES has expanded substantially. Medicare covers WES for specific indications, primarily pediatric patients with suspected genetic conditions; coverage is governed by Local Coverage Determinations that vary by MAC region. Most commercial payers cover WES for children with unexplained developmental delay, intellectual disability, epilepsy, or multiple congenital anomalies when ordered by an appropriate specialist and prior testing has been non-diagnostic. Medical necessity documentation is typically required.
Common Questions
Frequently Asked Questions
What does whole exome sequencing test for?
How accurate is whole exome sequencing?
How long does whole exome sequencing take?
What is the difference between WES and a gene panel?
Does insurance cover whole exome sequencing?
What happens if WES comes back negative?
Can adults have whole exome sequencing?
Keep Reading
Related Resources
Build a Clinical WES Program
VarSeq provides end-to-end exome analysis for clinical labs: from VCF import and automated annotation to ACMG-guided variant classification and signed clinical reports. Trio analysis, CNV detection, and phenotype-driven prioritization are built in.