cnvOffSeq: detecting intergenic copy number variation using off-target exome sequencing data.
cnvOffSeq: detecting intergenic copy number variation using off-target exome sequencing data.
Motivation: Exome sequencing technologies have transformed the field of Mendelian genetics and allowed for efficient detection of genomic variants in protein-coding regions. The target enrichment process that is intrinsic to exome sequencing is inherently imperfect, generating large amounts of unintended off-target sequence. Off-target data are characterized by very low and highly heterogeneous coverage and are usually discarded by exome analysis pipelines. We posit that off-target read depth is a rich, but overlooked, source of information that could be mined to detect intergenic copy number variation (CNV). We propose cnvOffseq, a novel normalization framework for off-target read depth that is based on local adaptive singular value decomposition (SVD). This method is designed to address the heterogeneity of the underlying data and allows for accurate and precise CNV detection and genotyping in off-target regions.
Results: cnvOffSeq was benchmarked on whole-exome sequencing samples from the 1000 Genomes Project. In a set of 104 gold standard intergenic deletions, our method achieved a sensitivity of 57.5% and a specificity of 99.2%, while maintaining a low FDR of 5%. For gold standard deletions longer than 5 kb, cnvOffSeq achieves a sensitivity of 90.4% without increasing the FDR. cnvOffSeq outperforms both whole-genome and whole-exome CNV detection methods considerably and is shown to offer a substantial improvement over naïve local SVD.
i639–i645
Bellos, E
719c8ef8-c89d-4231-810a-867dd59d31dc
LJ, Coin
1ae78de8-a10c-4be2-ba3a-e601dce37b9d
1 September 2014
Bellos, E
719c8ef8-c89d-4231-810a-867dd59d31dc
LJ, Coin
1ae78de8-a10c-4be2-ba3a-e601dce37b9d
Bellos, E and LJ, Coin
(2014)
cnvOffSeq: detecting intergenic copy number variation using off-target exome sequencing data.
Bioinformatics, 30 (17), .
(doi:10.1093/bioinformatics/btu475).
Abstract
Motivation: Exome sequencing technologies have transformed the field of Mendelian genetics and allowed for efficient detection of genomic variants in protein-coding regions. The target enrichment process that is intrinsic to exome sequencing is inherently imperfect, generating large amounts of unintended off-target sequence. Off-target data are characterized by very low and highly heterogeneous coverage and are usually discarded by exome analysis pipelines. We posit that off-target read depth is a rich, but overlooked, source of information that could be mined to detect intergenic copy number variation (CNV). We propose cnvOffseq, a novel normalization framework for off-target read depth that is based on local adaptive singular value decomposition (SVD). This method is designed to address the heterogeneity of the underlying data and allows for accurate and precise CNV detection and genotyping in off-target regions.
Results: cnvOffSeq was benchmarked on whole-exome sequencing samples from the 1000 Genomes Project. In a set of 104 gold standard intergenic deletions, our method achieved a sensitivity of 57.5% and a specificity of 99.2%, while maintaining a low FDR of 5%. For gold standard deletions longer than 5 kb, cnvOffSeq achieves a sensitivity of 90.4% without increasing the FDR. cnvOffSeq outperforms both whole-genome and whole-exome CNV detection methods considerably and is shown to offer a substantial improvement over naïve local SVD.
This record has no associated files available for download.
More information
Published date: 1 September 2014
Identifiers
Local EPrints ID: 500669
URI: http://eprints.soton.ac.uk/id/eprint/500669
ISSN: 1367-4803
PURE UUID: 0c6b4d28-9cb7-4bc2-9e66-14c4d958b84a
Catalogue record
Date deposited: 08 May 2025 17:07
Last modified: 09 May 2025 02:14
Export record
Altmetrics
Contributors
Author:
E Bellos
Author:
Coin LJ
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics