cnvCapSeq: detecting copy number variation in long-range targeted resequencing data.
cnvCapSeq: detecting copy number variation in long-range targeted resequencing data.
Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR.
Bellos, Evangelos
719c8ef8-c89d-4231-810a-867dd59d31dc
Kumar, Vikrant
63752f94-626e-41cb-bf30-7b379e9a3413
Lin, Clarabelle
1e4bfea9-ebf3-4fc7-bfaa-d68037abc67e
Maggi, Jordi
776d472f-e45d-4dd9-9538-5eef320dcd4c
Phua, Zai Yang
c26cd932-0bc4-434d-8fa9-d0bfbfab75a9
Cheng, Ching Yu
a4b291fb-8738-4871-9ff1-46b884e1928e
Cheung, Chui Ming Gemmy
5ea55fb6-2341-44b2-ae58-be2393c9b097
Hibberd, Martin L.
146ffeef-e88a-4f8e-a6ef-b6c0a91d16f7
Wong, Tien Yin
ee4cdad4-b55a-4a61-8825-0cede177438b
Coin, Lachlan J.M.
2fabf43c-c1fb-497b-8184-c347362b4872
Davila, Sonia
d848060e-8d80-4abd-8f7b-6d75674b1f85
16 September 2014
Bellos, Evangelos
719c8ef8-c89d-4231-810a-867dd59d31dc
Kumar, Vikrant
63752f94-626e-41cb-bf30-7b379e9a3413
Lin, Clarabelle
1e4bfea9-ebf3-4fc7-bfaa-d68037abc67e
Maggi, Jordi
776d472f-e45d-4dd9-9538-5eef320dcd4c
Phua, Zai Yang
c26cd932-0bc4-434d-8fa9-d0bfbfab75a9
Cheng, Ching Yu
a4b291fb-8738-4871-9ff1-46b884e1928e
Cheung, Chui Ming Gemmy
5ea55fb6-2341-44b2-ae58-be2393c9b097
Hibberd, Martin L.
146ffeef-e88a-4f8e-a6ef-b6c0a91d16f7
Wong, Tien Yin
ee4cdad4-b55a-4a61-8825-0cede177438b
Coin, Lachlan J.M.
2fabf43c-c1fb-497b-8184-c347362b4872
Davila, Sonia
d848060e-8d80-4abd-8f7b-6d75674b1f85
Bellos, Evangelos, Kumar, Vikrant, Lin, Clarabelle, Maggi, Jordi, Phua, Zai Yang, Cheng, Ching Yu, Cheung, Chui Ming Gemmy, Hibberd, Martin L., Wong, Tien Yin, Coin, Lachlan J.M. and Davila, Sonia
(2014)
cnvCapSeq: detecting copy number variation in long-range targeted resequencing data.
Nucleic Acids Research, 42 (20), [e158].
(doi:10.1093/nar/gku849).
Abstract
Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR.
Text
gku849
- Version of Record
More information
Accepted/In Press date: 5 September 2014
Published date: 16 September 2014
Identifiers
Local EPrints ID: 500458
URI: http://eprints.soton.ac.uk/id/eprint/500458
ISSN: 0305-1048
PURE UUID: 73a2ccba-e38e-4385-ac96-973f4d84258d
Catalogue record
Date deposited: 30 Apr 2025 16:44
Last modified: 22 Aug 2025 02:45
Export record
Altmetrics
Contributors
Author:
Evangelos Bellos
Author:
Vikrant Kumar
Author:
Clarabelle Lin
Author:
Jordi Maggi
Author:
Zai Yang Phua
Author:
Ching Yu Cheng
Author:
Chui Ming Gemmy Cheung
Author:
Martin L. Hibberd
Author:
Tien Yin Wong
Author:
Lachlan J.M. Coin
Author:
Sonia Davila
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics