Does haplotype diversity predict power for association mapping of disease susceptibility?
Does haplotype diversity predict power for association mapping of disease susceptibility?
Many recent studies have established that haplotype diversity in a small region may not be greatly diminished when the number of markers is reduced to a smaller set of haplotype-tagging single-nucleotide polymorphisms (SNPs) that identify the most common haplotypes. These studies are motivated by the assumption that retention of haplotype diversity assures retention of power for mapping disease susceptibility by allelic association. Using two bodies of real data, three proposed measures of diversity, and regression-based methods for association mapping, we found no scenario for which this assumption was tenable. We compared the chi-square for composite likelihood and the maximum chi-square for single SNPs in diplotypes, excluding the marker designated as causal. All haplotype-tagging methods conserve haplotype diversity by selecting common SNPs. When the causal marker has a range of allele frequencies as in real data, chi-square decreases faster than under random selection as the haplotype-tagging set diminishes. Selecting SNPs by maximizing haplotype diversity is inefficient when their frequency is much different from the unknown frequency of the causal variant. Loss of power is minimized when the difference between minor allele frequencies of the causal SNP and a closely associated marker SNP is small, which is unlikely in ignorance of the frequency of the causal SNP unless dense markers are used. Therefore retention of haplotype diversity in simulations that do not mirror genomic allele frequencies has no relevance to power for association mapping. TagSNPs that are assigned to bins instead of haplotype blocks also lose power compared with random SNPs. This evidence favours a multi-stage design in which both models and density change adaptively.
design, genetics, research support, methods, genetic predisposition to disease, disease, haplotypes, non-u.s.gov't, single nucleotide, polymorphism, models, disease susceptibility, humans, genetic, gene frequency, human
157-164
Zhang, Weihua
1a759991-f2d4-4324-b8e2-c5b4c2b527d6
Collins, Andrew
7daa83eb-0b21-43b2-af1a-e38fb36e2a64
Morton, Newton E.
c668e2be-074a-4a0a-a2ca-e8f51830ebb7
July 2004
Zhang, Weihua
1a759991-f2d4-4324-b8e2-c5b4c2b527d6
Collins, Andrew
7daa83eb-0b21-43b2-af1a-e38fb36e2a64
Morton, Newton E.
c668e2be-074a-4a0a-a2ca-e8f51830ebb7
Zhang, Weihua, Collins, Andrew and Morton, Newton E.
(2004)
Does haplotype diversity predict power for association mapping of disease susceptibility?
Human Genetics, 115 (2), .
(doi:10.1007/s00439-004-1122-x).
Abstract
Many recent studies have established that haplotype diversity in a small region may not be greatly diminished when the number of markers is reduced to a smaller set of haplotype-tagging single-nucleotide polymorphisms (SNPs) that identify the most common haplotypes. These studies are motivated by the assumption that retention of haplotype diversity assures retention of power for mapping disease susceptibility by allelic association. Using two bodies of real data, three proposed measures of diversity, and regression-based methods for association mapping, we found no scenario for which this assumption was tenable. We compared the chi-square for composite likelihood and the maximum chi-square for single SNPs in diplotypes, excluding the marker designated as causal. All haplotype-tagging methods conserve haplotype diversity by selecting common SNPs. When the causal marker has a range of allele frequencies as in real data, chi-square decreases faster than under random selection as the haplotype-tagging set diminishes. Selecting SNPs by maximizing haplotype diversity is inefficient when their frequency is much different from the unknown frequency of the causal variant. Loss of power is minimized when the difference between minor allele frequencies of the causal SNP and a closely associated marker SNP is small, which is unlikely in ignorance of the frequency of the causal SNP unless dense markers are used. Therefore retention of haplotype diversity in simulations that do not mirror genomic allele frequencies has no relevance to power for association mapping. TagSNPs that are assigned to bins instead of haplotype blocks also lose power compared with random SNPs. This evidence favours a multi-stage design in which both models and density change adaptively.
This record has no associated files available for download.
More information
Published date: July 2004
Keywords:
design, genetics, research support, methods, genetic predisposition to disease, disease, haplotypes, non-u.s.gov't, single nucleotide, polymorphism, models, disease susceptibility, humans, genetic, gene frequency, human
Identifiers
Local EPrints ID: 25068
URI: http://eprints.soton.ac.uk/id/eprint/25068
ISSN: 0340-6717
PURE UUID: 01308319-f9e9-4027-99a3-dd258c4f0c96
Catalogue record
Date deposited: 05 Apr 2006
Last modified: 16 Mar 2024 02:42
Export record
Altmetrics
Contributors
Author:
Weihua Zhang
Author:
Newton E. Morton
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics