VarSeq Updated with CNV Annotations, CNV PhoRank and Region Assessment Catalogs

· Gabe Rudy · News, Events, & Announcements

VarSeq Updated with CNV Annotations, CNV PhoRank, and Region Assessment Catalogs

This year we have released multiple advances to support CNV analysis in VarSeq, expanding our target region based VS-CNV caller to handle exomes and low-depth genomes as well as additional supporting algorithms like calling Loss of Heterozygosity regions.

To top this off, VarSeq 1.4.7 has been shipped with many expanded capabilities to support the analysis, cataloging, and reporting of the CNVs called in VarSeq. These include a new set of annotation algorithms and expanded annotation catalogs designed specifically for CNV and other region-based data, as well as updates to the Assessment Catalog to store CNVs, regions, and per-sample information.

Annotations on Every Table

VarSeq 1.4.7 now supports annotation of virtually every table in VarSeq. This includes annotation of coverage regions, LoH regions, and CNVs, as well as support for computed fields and counts per gene on all tables. Now, annotating one of these secondary tables is as simple as clicking the ‘Add’ button and selecting the appropriate annotation choice from the Secondary Tables menu.

VarSeq Updated

We have added a wide array of CNV annotation tracks. These include 1kG CNVs and Large Variants, which provides frequency information about CNVs, ClinVar and ClinGen, which provide clinical assessment and interpretation information, Genomic Super Dups, which includes regions that are highly homologous to other repetitive regions of the genome, and many others.

CNVs can also be annotated against our gene tracks, allowing our users to easily identify the set of genes and specific exons of those genes that overlap a given CNV event. For very large events that may cover thousands of genes, we limit the detailed per-transcript analysis to those that overlap either of the end-points of the CNV event.

Also, for understanding the function and clinical relevance of CNV events, we have the ability to annotate against our licensed OMIM gene and phenotype annotations as well as the ClinGen Haploinsufficiency and Triplosensitivy gene and region annotations.

Filters on Every Table

Previously, the only way to filter CNV tables was through VarSeq’s per-table queries, which works fine for one or two filters, but, for more complex CNV workflows including filters over several annotations, we want to use the same powerful filter chain used for our variant tables. With VarSeq 1.4.7, complex filter chains can be applied to any table in a project.

VarSeq Updated

In the context of CNVs, filtering based on annotations allows users to reduce false positives and filter out common CNVs. Using this functionality, you can quickly examine a shortlist of clinically relevant CNVs.

PhoRank Your CNVs

To prioritize and rank the CNVs that have a high likelihood of impacting the observed symptoms, we have added CNV support for our phenotype driven gene ranking tool, PhoRank.

CNV PhoRank ranks CNVs and genes based on their relevance to user-specified phenotypes as defined by the GO and HPO biomedical ontologies. CNV PhoRank reports the names, scores, and ontology paths for the five highest-ranking genes overlapping a given CNV, along with the sum and maximum scores overall overlapping genes. By filtering and sorting on these scores, users can narrow the search space, prioritizing CNVs based on their genes’ relevance to an individual’s phenotypes.

VarSeq Updated

Assessment Catalogs for Regions, CNVs, and Variants Too

Assessment Catalogs provide an interface to track variants across projects. Each stored variant can be annotated with custom information input through a form. Projects can use these custom assessment catalogs as annotation sources and variants can be filtered based on the stored values.

In VarSeq 1.4.7, assessment catalogs can now store CNV and Region information in addition to variants. CNVs in an assessment catalog are indexed on genomic position and CNV type, while Regions are simply indexed on position. However, custom keys can also be added to a catalog, allowing multiple records to exist for a given Variant, CNV, or Region. For example, you could have one assessment per sample, disease, or test assay.

Enhanced Exports

We have greatly enhanced the export capability in VarSeq 1.4.7. Both CNV and Region tables can now be exported as annotation tracks or exported directly to an assessment catalog. Also, the display of the genomic region field can now be customized, allowing Chromosome, Start, and Stop to be displayed and exported as separate fields.

Of course, it is impossible to cover all of the improvements to VarSeq in a single, short blog post. If you want to know more, check out our Release Notes for a full list of the many improvements and new features. Finally, we want to thank our many engaged users for their feedback and suggestions as we continue the development of VarSeq as a clinical interpretation suite.

Ready to try VarSeq? Start a free trial with Golden Helix today.

Leave a comment

Gabe Rudy

About Gabe Rudy

Gabe Rudy is the Vice President of Product and Engineering at Golden Helix, where for over two decades he has led the development of clinically validated software solutions that power precision medicine worldwide. Under his leadership, Golden Helix has delivered a suite of best-in-class tools for genomic analysis, including CNV calling, pharmacogenomics, carrier screening, and somatic variant interpretation. These solutions are designed for flexible deployment across on-premises, private cloud, and managed cloud environments, and are used by organizations ranging from small diagnostic teams to large clinical laboratories and even national-scale genomic initiatives. With a background in Computer Science and graduate work in compiler optimization and high-performance computing, Gabe brings a unique blend of software architecture expertise and deep domain knowledge in genomics. Since 2006, he directed product strategy and engineering at Golden Helix, ensuring the company stays at the forefront of innovation while maintaining the highest standards of usability, scalability, and quality. Gabe is an active participant in the genomics community, regularly presenting on topics such as NGS best practices, variant interpretation workflows, and the integration of AI into clinical diagnostics. His work has supported thousands of labs across the globe in the adoption of robust, intuitive, and clinically actionable bioinformatics workflows. Based in Bozeman, Montana, Gabe balances his passion for advancing precision medicine with family life and a love for the outdoors.

View all posts by Gabe Rudy →