Carcinogenesis Advance Access originally published online on November 25, 2005
Carcinogenesis 2006 27(5):1030-1037; doi:10.1093/carcin/bgi284
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Concordance of multiple analytical approaches demonstrates a complex relationship between DNA repair gene SNPs, smoking and bladder cancer susceptibility
1 Department of Community and Family Medicine, Section of Biostatistics and Epidemiology, 2 Department of Pathology, 3 Department of Genetics, Computational Genetics Laboratory, Dartmouth Medical School, Lebanon, NH 03756, USA, 4 Department of Environmental Health and 5 Department of Genetics and Complex Diseases, Harvard School of Public Health, 665 Huntington Avenue, Boston, MA 02115, USA
* To whom correspondence should be addressed Email: Angeline.Andrew{at}dartmouth.edu
| Abstract |
|---|
|
|
|---|
Study results of single nucleotide polymorphisms (SNPs) and cancer susceptibility are often conflicting, possibly because of the analytic challenges of testing for multiple genetic and environmental risk factors using traditional analytic tools. We investigated the relationship between DNA repair gene SNPs, smoking, and bladder cancer susceptibility in 355 cases and 559 controls enrolled in a population-based study of bladder cancer in the US. Our multifaceted analytical approach included logistic regression, multifactor dimensionality reduction, and hierarchical interaction graphs for the analysis of genegene and geneenvironment interactions followed by linkage disequilibrium and haplotype analysis. Overall, we did not find an association between any single DNA repair gene SNP and bladder cancer risk. We did find a marginally significant elevated risk of the XPD codon 751 homozygote variant among never smokers [adjusted odds ratio (OR) 2.5, 95% confidence interval (CI) 1.06.2]. In addition, the XRCC1 194 variant allele was associated with a reduced bladder cancer risk among heavy smokers [adjusted OR 0.4, 95% CI 0.20.9)]. The best predictors of bladder cancer included the XPD codon 751 and 312 SNPs along with smoking. Interpretation of this multifactor model revealed that the relationship between the XPD SNPs and bladder cancer is mostly non-additive while the effect of smoking is mostly additive. Since the two XPD SNPs are in significant linkage disequilibrium (D' = 0.52, P = 0.0001), we estimated XPD haplotypes. Individuals with variant XPD haplotypes were more susceptible to bladder cancer [e.g. adjusted OR 2.5, 95% CI 1.73.6] and the effect was magnified when smoking was considered. These results support the hypothesis that common polymorphisms in DNA repair genes modify bladder cancer risk and emphasize the need for a multifaceted statistical approach to identify genegene and geneenvironment interactions.
Abbreviations: BER, base excision repair; CI, confidence interval; CVC, cross-validation consistency; DSB, double strand break; MDR, multifactor dimensionality reduction; NER, nucleotide excision repair; OR, odds ratio; SNP, single nucleotide polymorphism
| Introduction |
|---|
|
|
|---|
Cancer is a multifactorial disease that results from complex interactions between many genetic and environmental factors (1). This is particularly true for the sporadic forms of cancer that, in contrast to familial cancer syndromes, tend to be common in the population. As a result, it is generally believed that there will not be single genes or single environmental factors (i.e. silver bullets) that have large effects on disease susceptibility. Rather, each risk factor is likely to contribute to cancer susceptibility through a combination of nonadditive and additive interactions with other risk factors. This complex genetic architecture is consistent with other common diseases such as cardiovascular disease (2). In fact, it has been suggested that non-additive interactions are a ubiquitous component of the genetic architecture of many common human diseases (3). Given these complexities, a successful research strategy for identifying risk factors for common human cancers must consider combinations of genetic variations and environmental exposures. We describe here a large casecontrol study of bladder cancer susceptibility that specifically evaluates genegene and geneenvironment interactions using a multifaceted analytical approach that combines traditional statistical methods with novel computational algorithms.
In 2004, an estimated 60 240 people in the US were diagnosed with bladder cancer and 12 710 died of the disease (4). In the USA, bladder cancer incidence ranks fourth among men, and tenth among women. Occupational exposure to chemicals such as 2-naphthylamine and benzidine, or exposure to 4-aminobiphenyl, and aromatic amines through tobacco smoke, plays a significant role in initiation of bladder cancer. Bladder cancer risk is up to 4-fold higher among cigarette smokers compared with non-smokers (5). Casecontrol studies provide evidence of a familial predisposition to bladder cancer (68) indicating that some susceptibility factors may be heritable.
One such heritable factor is DNA repair polymorphisms that increase susceptibility to DNA damage resulting from bladder carcinogens [reviewed in (9)]. Results of our previous study indicated a 40% reduction in risk of bladder cancer among those with at least one XRCC1 399 variant allele compared with those with one or two wild-type alleles (10). However, to date, epidemiology studies of bladder cancer risk in relation to these polymorphisms are either conflicting (i.e. for XPD, XRCC1 and XRCC3 (1118), rare (i.e. for XPC PAT) (19), or non-existent (i.e. for APE1). Also some studies raise the possibility of genegene interaction between polymorphisms, i.e. between XRCC1 194 and XRCC3 241, XRCC1 399/XRCC1 194 and XPD 751/XPD 312 for bladder (18) and lung cancers (20,21). Available data are largely based on hospital-based studies, many of insufficient size to evaluate potential geneexposure interactions. Additionally, differential findings could be related to population admixture. Differences could also be due to the presence of genegene and geneenvironment interactions that are not well understood due to the analytic challenges of testing for multiple genetic and environmental risk factors using traditional analytical tools (22).
To expand on our prior XRCC1 results and clarify the role of polymorphisms in DNA repair genes in bladder cancer susceptibility, we examined multiple single nucleotide polymorphisms (SNPs) in the base excision repair (BER), nucleotide excision repair (NER) and double strand break (DSB) repair pathways in a population-based study of 355 bladder cancer cases and 559 controls from New Hampshire. We specifically evaluated the presence of genegene and geneenvironment effect modification using both traditional and novel analytic approaches.
| Materials and methods |
|---|
|
|
|---|
Study group
We identified all cases of bladder cancer diagnosed among New Hampshire residents, ages 2574 years, from July 1, 1994 to June 30, 1998 from the State Cancer Registry. Within 15 days of diagnosis, the state mandated rapid reporting system requires submission of an initial report of cancer, and a definitive report within 120 days. To be eligible for the study, subjects were required to have a listed telephone number and speak English. We sought physician consent before contacting eligible bladder cancer patients. We interviewed a total n = 459 bladder cancer cases, which was 85% of the cases confirmed to be eligible for the study. Non-participants included (n = 10) whose physician denied patient contact, (n = 63) were reported as deceased by a household member or physician, (n = 3) no answer after 40 attempts distributed over day, evenings and weekends, (n = 75) declined participation and (n = 8) were too ill to take part. A standardized histopathology review was conducted by the study pathologist, and from this review we excluded eleven subjects who were initially reported to the cancer registry as bladder cancer.
All controls were selected from population lists. Controls <65 years of age were selected using population lists obtained from the New Hampshire Department of Transportation. The file contains the names and addresses of those holding a valid driver's license for the state of New Hampshire. Controls 65 years of age and older were chosen from data files provided by the Centers for Medicare & Medicaid Services (CMS) of New Hampshire. The method of control selection used in our study has been successfully employed in other casecontrol studies conducted in the region [e.g. (23)]. For efficiency, we shared a control group with a study of non-melanoma skin cancer conducted covering an overlapping diagnostic period of July 1, 1993 to June 30, 1995 (23). We selected additional controls for bladder cancer cases diagnosed from July 1, 1995 to June 30, 1997 frequency matched to these cases on age (2534, 3544, 4554, 5564, 6569, 7074 years) and gender. Controls were randomly assigned a reference date from among the diagnosis dates of the case group to whom they were matched. We interviewed a total n = 665 controls (the total shared control group and additional controls), which was 70% of the controls confirmed to be eligible for the study. Of the potential participants, (n = 18) were reported as deceased by a member of the household, (n = 17) no answer after 40 attempts distributed over day, evenings and weekends, (n = 261) declined, (n = 29) were mentally incompetent or too ill to take part.
Personal interview
Informed consent was obtained from each participant and all procedures and study materials were approved by the Committee for the Protection of Human Subjects at Dartmouth College. Consenting participants underwent a detailed in-person interview, usually at their home. Questions covered sociodemographic information (including level of education), lifestyle factors such as use of tobacco (including frequency, duration and intensity of smoking), family history of cancer and medical history prior to the diagnosis date of the bladder cancer cases or reference date assigned to controls. Recruitment procedures for both the shared controls from the non-melanoma skin cancer and additional controls were identical and ongoing concomitantly with the case interviews. Casecontrol status and the main objectives of the study were not disclosed to the interviewers. To ensure consistent quality of the study interviewer, interviews were tape recorded with the consent of the participants and routinely monitored by the interviewer supervisor. To assess comparability of cases and controls, we asked subjects if they currently held a driver's license or a Medicare enrolment card.
Genotyping
DNA was isolated from peripheral circulating blood lymphocyte specimens harvested at the time of interview using Qiagen genomic DNA extraction kits (QIAGEN, Valencia, CA). We chose to examine DNA repair genes with polymorphisms that have previously been examined in relation to bladder cancer (XRCC1, XRCC3, XPD, XPC) as well as other pathway members that physically interact with these genes (APE1). Genotyping for non-synonymous SNPs XRCC3 C/T at position 241, APE1 T/G at position 148, XPD G/A at position 312 and A/C at position 751, XRCC1 C/T at position 194 was performed by Qiagen Genomics using their SNP mass-tagging system. For XRCC1 G/A at position 399 and XPC PAT /+, genotyping was performed by PCRRFLP as described previously (10). Of the 1113 participating cases and controls, genotyping was performed on DNA isolated from blood on 914 (82%). For quality control purposes, laboratory personnel were blinded to casecontrol status. These assays achieved >95% accuracy as assessed using and negative and positive quality controls (including every 10th sample as a masked duplicate). Data were missing on 103 individuals for XRCC1 194, 70 for XRCC1 399, 2 for XRCC3, 111 for XPD 312, 53 for XPD 751, 131 for XPC and 3 for APE1.
Statistical analysis
The goal of the statistical analysis was to assess the relationship between DNA repair gene SNPs, smoking and bladder cancer susceptibility. To assess the independent main effects of each SNP, we conducted logistic regression analyses for individuals with one or two variant alleles in comparison to those homozygous wild-type for each individual SNP. Assessment of genegene and geneenvironment interactions was carried out using both logistic regression and Multifactor Dimensionality Reduction (MDR). In addition, we employed a third method that uses information theory to build interaction graphs for confirming, visualizing and interpreting genegene and geneenvironment interactions identified using logistic regression and MDR. Finally, we carried out linkage disequilibrium and haplotype analyses to assess the allelic effects of predictive SNPs. Each method is described in detail below.
We computed the odds ratio (OR) for the joint effects of gene pairs using individuals who are homozygous wild-type at both loci as the referent group and evaluated interactions between bladder cancer risk factors [gender, smoking variables [(e.g. never, <35 pack-years,
35 pack-years)], and genotype by including interaction terms in a logistic regression model. The pack-year cut-point was chosen based on the median number of pack-years overall. Statistical significances of the interactions were assessed using likelihood ratio tests comparing the models with and without interaction terms.
The nonparametric MDR approach was selected to complement logistic regression for the analysis of genegene and geneenvironment interactions. We briefly describe MDR here. The details of MDR are described elsewhere (2427) and reviewed by (28). MDR is a data reduction (i.e. constructive induction) approach that seeks to identify combinations of multilocus genotypes and discrete environmental factors that are associated with either high risk or low risk of disease. Thus, MDR defines a single variable that incorporates information from several loci and/or environmental factors that can be divided into high risk and low risk combinations. This new variable can be evaluated for its ability to classify and predict outcome risk status using cross-validation and permutation testing. With n-fold cross-validation, the data are divided into n equal size pieces. An MDR model is fit using (n 1)/n of the data (i.e. the training set) and then evaluated for its generalizability on the remaining 1/n of the data (i.e. the testing set). The fitness or value of an MDR model is assessed by estimating accuracy in the training set and the testing set. Accuracy is a function of the percentage of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) and is defined as (TP + TN)/(TP + TN + FP + FN). This process is repeated for all n pieces of the data and the n testing accuracies are averaged to provide an estimate of predictive ability or generalizability. We also estimate the degree to which the same best model is discovered across n divisions of the data. This is referred to as the cross-validation consistency or CVC (24,29). A CVC of n in n-fold cross-validation is optimal. Here, we selected the best MDR model as the one with the maximum testing accuracy. A testing accuracy of 0.5 is expected under the null hypothesis. Statistical significance is determined using permutation testing. Here, the casecontrol labels are randomized m times and the entire MDR model fitting procedure repeated on each randomized dataset to determine the expected distribution of testing accuracies under the null hypothesis. It is the combination of cross-validation and permutation testing that reduces the chances of making a type I error due to multiple testing (30,31). In this study, we used 10-fold cross-validation and 1000-fold permutation testing. MDR results were considered statistically significant at the 0.05 level. The MDR software is open-source and freely available from http://www.epistasis.org.
A third approach based on information theory was used to confirm, visualize and interpret the results obtained by logistic regression and MDR. Jakulin and Bratko (32) have provided a metric for determining the gain in information about a class variable (e.g. casecontrol status) from merging two variables together over that provided by the variables independently (32,33). This measure of information gain allows us to gauge the benefit of considering two (or more) attributes as one unit. While the concept of information gain is not new (34), its application to the study of variable interactions has been the focus of several recent studies (32,33,35). Consider two variables, A and B, and a class label C. Let H(X) be the Shannon entropy [see (36)] of X. The information gain (IG) of A, B and C can be written as (1) and defined in terms of Shannon entropy (2 and 3).
![]() | (1) |
![]() | (2) |
![]() | (3) |
These measures of entropy are particularly useful for building interaction graphs that facilitate the interpretation of the relationship between variables. Interaction graphs are comprised of a node for each variable with pairwise connections between them. The percentage of entropy removed (i.e. information gain) by each variable is visualized for each node. The percentage of entropy removed for each pairwise Cartesian product of variables is visualized for each connection. Thus, the independent main effects of each SNP, for example, can be quickly compared to the interaction effect. Additive and non-additive interactions can be quickly assessed and used to interpret MDR models that consist of distributions of cases and controls for each genotype combination. A positive entropy (plotted in green) indicates interaction while a negative entropy (plotted in red) indicates redundancy. Interaction entropy analysis was performed using the Orange software package (37). Since the MDR and interaction entropy analysis tools do not permit missing values, missing values were imputed 10 independent times using S-plus and analyses were performed using each of the 10 datasets. The results reported were consistent across all 10 datasets.
Interaction dendrograms are also a useful way to visualize interaction (32). Here, hierarchical clustering is used to build a dendrogram that places strongly interacting attributes, as determined by interaction entropy, close together at the leaves of the tree. Jakulin and Bratko (32) define the following dissimilarity measure, D(A, B), that is used by a hierarchical clustering algorithm to build a dendrogram. The value of 1000 is used as an upper bound to scale the dendrograms.
![]() |
For the two genes with more than one SNP: XPD and XRCC1, we assessed linkage disequilibrium in homozygotes using a chi-square test and inferred haplotypes using PHASE 2.0. This Bayesian method reconstructs the haplotype using Markov chain Monte Carlo techniques by statistically inferring the phase at linked loci from the genotype (38). PHASE is reported to show lower error rates than either the maximum likelihood [expectation maximization algorithm], or the parsimony method (Clark algorithm) (39). Analyses were stratified by age (<50,
50), sex and smoking status (ever, never), (never, former, current) or the median smoking intensity (never, <35 pack-years,
35 pack- years). We further analyzed the association between genotype and tumor invasiveness (non-invasive versus invasive tumors) using logistic regression, excluding in-situ tumors.
| Results |
|---|
|
|
|---|
The study population contained more men than women, and the age distribution was comparable among cases and controls among both sexes (Table I). The majority of the study population was Caucasian (Table I), representing the ethnic make-up of the New Hampshire population. The prevalence of smoking was higher among the cases, as was a first-degree family history of bladder cancer (Table I). The variant allele frequencies for the study population were BER: APE1 148 (0.475), XRCC1 194 (0.07); DSB: XRCC3 241 (0.385); NER: XPD 751 (0.37), XPD 312 (0.35), XPC PAT (0.42).
|
We began by evaluating the independent effects of each DNA repair SNP on bladder cancer susceptibility using logistic regression. We did not observe that the main effects of the BER polymorphisms at APE1 148 and XRCC1 194 were related to bladder cancer risk. Among heavy smokers, however, XRCC1 194 was associated with a significantly reduced risk of bladder cancer [XRCC1 194 heterozygote adjusted OR 0.4, 95% confidence interval (CI) 0.20.9]. APE1 148 also conferred a slightly reduced risk in the heavy smoking group. Overall the ORs for bladder cancer were not related to the DSB polymorphism XRCC3 241, nor was there evidence of a genesmoking interaction (Table II). In the NER pathway, the XPD 751, XPD 312 or XPC PAT variant genotypes were not associated with an increased bladder cancer risk overall; however, we did observe an elevated risk of the XPD 751 variant among never smokers that was marginally statistically significant [adjusted OR 2.5, 95% CI 1.06.2] with a significant genesmoking interaction (P = 0.04) (Table II). Among individuals who smoked
35 pack-years, homozygous variants for XPD 312 had a slightly higher bladder cancer risk [adjusted OR 1.8, 95% CI 0.93.9] compared to XPD 312 homozygous wild-types (Table II). XPC PAT, and XPD 751 variant alleles conferred a slightly reduced risk in heavy smokers, however this effect was not statistically significant (Table II).
|
DNA repair is a complex process involving the cooperation of multiple enzymes in pathways that respond to damage induced by endogenous or exogenous agents, such as tobacco. Therefore, we also evaluated the bladder cancer risk associated with genetic variation in more than one gene and smoking, as described. In the MDR analysis (Table III), pack-years of smoking was the strongest single-factor for predicting bladder cancer risk (average testing accuracy = 0.63, CVC = 10/10). The combination of XPD 751 and XPD 312 was the best two-factor model, with a testing accuracy of 0.65 and a CVC of 8.7/10 (P = 0.001). The three-factor model added pack-years of smoking to XPD 751 and XPD 312 for the most accurate (0.66) model that remained highly consistent in the cross validation (8.6/10) (Table III). All of the four-factor models included XPD 751, XPD 312, and pack-years of smoking. XPC PAT was the most common fourth factor across the 10 datasets, however the addition of this factor decreased the testing accuracy (0.65) and CVC (4.5/10).
|
After identifying the high risk combinations of factors using MDR, we applied interaction entropy algorithms to facilitate interpretation of the relationship between the variables. As shown in the hierarchical interaction graphs in Figure 1, we found small percentages of the entropy in casecontrol status explained by XPD 751 (0.1%), or XPD 312 (0.15%) considered independently, but a large percentage of entropy explained by the interaction between these two loci (5.74%) (Figure 1A). Pack-years of smoking had a large independent effect (3.78%), however we did not detect a substantial non-additive interaction between smoking and the XPD SNPs considering them individually (Figure 1A), or as a single genotype combination (Figure 1B).
|
Likewise, the interaction dendrogram (Figure 2) placed XPD 751 and XPD 312 on the same branch. Their position in the diagram indicates that this is the strongest interaction. Pack-years of smoking is located on a different branch than the XPD SNPs, supporting the evidence from the interaction entropy graphs that show that there is not a strong relationship between these factors (Figure 2).
|
We then fit logistic regression models for the independent and joint effects of the XPD polymorphisms in models adjusted for age, gender, and smoking. Compared with individuals who were wild-type at both loci, bladder cancer risk was elevated in individuals who were XPD variant at the 751 locus only [adjusted OR 3.6, 95% CI 2.26.3] or XPD variant at the 312 locus only [adjusted OR 5.2, 95% CI 3.09.0], but was not as high for variants at both loci (genegene interaction P < 0.0001). We re-applied interaction entropy algorithms using the XPD 751, XPD 312 genotype combination. The XPD SNPs explained 4.13% of the entropy in casecontrol status and did not indicate an interaction with pack-years of smoking (Figure 1B). A chi-square test indicated that these two XPD loci were in linkage disequilibrium (P < 0.0001, D = 0.12, D' = 0.52), (while XRCC1 399 and XRCC1 194 were not). Because of the linkage disequilibrium, we analyzed the XPD haplotypes estimated by PHASE in relation to bladder cancer risk using logistic regression with adjustment for age, gender and smoking (shown in Table IV). As in the joint SNP analysis, we found an increased risk for haplotypes with a variant allele at one loci [XPD 312 G/751 C, frequency 0.07, adjusted OR 1.7, 95% CI 1.22.4; XPD 312 A/751 A, frequency 0.05, adjusted OR 2.5, 95% CI 1.73.6]. Bladder cancer risk was consistently elevated for individuals with the low frequency haplotypes regardless of smoking status. Among the heavy smokers, bladder cancer risk was associated with a 4-fold bladder cancer risk among those with the XPD 312 A/751 A haplotype [adjusted OR 4.4, 95% CI 2.28.8].
|
Risk estimates were not altered significantly when analyses were stratified by level of tumor invasiveness (e.g. invasive versus non-invasive), by type of cancer (e.g. transitional cell versus other), by gender, or restricted to white ethnicity.
| Discussion |
|---|
|
|
|---|
The current study demonstrates a comprehensive analytical strategy for investigating risk factors for diseases with a complex genetic architecture. We investigated the hypothesis that prevalent SNPs in DNA repair genes modify genetic susceptibility to bladder cancer. We used a multifaceted analytical approach that combines traditional statistical methods with novel computational algorithms to evaluate genegene and geneenvironment interactions. The relationship between DNA repair polymorphisms and cancer risk may be particularly complex because the effects of genetic variation in the repair process may depend on the presence of a DNA lesion (e.g. geneenvironment interaction) or the presence or absence of polymorphisms in other genes in the same or a different pathway. Thus, we suspect that some of the conflicts between the results of previous studies might be due to uncharacterized genegene or geneenvironment interactions. We addressed this issue by evaluating multiple SNPs in the NER, BER and DSB repair pathways and observed variant allele frequencies that were consistent with those reported in the literature (9,11,40,41). We further assessed the association between genotype, genotype combinations and haplotype with smoking status and bladder cancer risk using multiple traditional and novel statistical approaches.
As more and more studies evaluate risk associated with multiple genes and environmental factors, it has become clear that traditional logistic regression analysis approaches are not adequate for modeling complex multi-factor interactions (42). For this reason, we utilized the recently developed MDR and interaction entropy strategies to assess and interpret potential interactions. This approach improves statistical power to efficiently identify potential genegene and geneenvironment interactions. The results of these novel algorithms were consistent with our logistic regression analysis for the two-way interaction models. We attempted to test three way interactions to replicate our findings from the MDR analysis in logistic regression; however, the model failed to converge due to the small number of individuals in some cells. Thus, our experience highlights the need for alternative, more powerful methods. Out of all of the possible two-factor combinations tested, MDR analysis selected XPD 751 and XPD 312 as the best two predictors of bladder cancer risk. The three-factor model including XPD 751, XPD 312, and pack-years of smoking was the strongest model overall since it had the highest level of testing accuracy and showed good CVC (Table III). Adding other factors (e.g. the four-factor model) lowered the CVC, reduced the testing accuracy and raised the P-value. The MDR three-factor model indicated that XPD 751, XPD 312, and pack-years of smoking are a high risk combination of factors, but did not specify whether or not there is a synergistic relationship. The interaction entropy and interaction dendogram analyses (Figures 1 and 2) helped us interpret the nature of the interactions in these multifactor models, and revealed that the relationship between the XPD SNPs and bladder cancer is mostly non-additive while the effect of smoking is mostly additive.
XPD is an enzyme in the NER pathway that removes certain DNA crosslinks, UV photolesions, and bulky chemical adducts (43). Non-synonymous SNPs in the XPD gene result in the substitution of glutamine in place of lysine at position 751 and asparagine for aspartic acid at position 312. As reported previously (44), we found that the two SNPs were in linkage disequilibrium, with a higher frequency of XPD 312 Asp, XPD 751 Lys and as such, also examined the risk associated with XPD haplotypes using the PHASE estimation software. Prior, smaller hospital-based studies produced inconsistent results and to date have not examined the bladder cancer risk associated with XPD haplotype (13,15). As in our study, interactions have previously been observed for XPD 312 and 751 in relation to lung cancer risk, and several studies found that the risk of lung cancer associated with the variant allele was higher among non-smokers than among smokers (9,44). In lymphoblastoid cell lines, double variants had an enhanced apoptotic response to UV-induced damage, possibly explaining our findings of an increased risk among those with a variant allele in either XPD 312 or XPD 751 but not for those variant at both loci (20,45). We also observed elevated bladder cancer risk for individuals with the low frequency haplotypes and the bladder cancer ORs for XPD haplotypes also did not vary dramatically by smoking status (Table IV). Thus, future, larger studies of XPD haplotype using more SNPs may be informative.
We also looked at SNPs in other repair pathways, including BER and DSB repair. XRCC1 mediates interactions with its BER partners, including APE1, and thereby modulates enzymatic activity throughout the pathway (46). A common amino acid substitution (Arg to Gln) occurs in the BRCT1 domain at codon 399, a region involved in binding polyADPribose polymerase (PARP) and APE1 (46). The relatively uncommon non-synonymous SNP in XRCC1 at position 194 was unrelated to bladder cancer risk overall in our study and others. We observed a lower risk among heavy smokers, similar to what was reported for breast cancer risk (17,47), but with wide confidence intervals. APE1 148, XRCC1 194, XPC PAT and XPD 751 variants also conferred a slightly reduced risk in heavy smokers; however, with the exception of the XRCC1 194 heterozygotes, these effects were not statistically significant.
DSBs may result from replication errors or the action of exogenous agents (9). XRCC3 is required for stabilization of the RAD51 complex in repair of DSBs and cross-links, and for maintaining chromosome stability during cell division (45,48). Results of independent analyses of the DSB pathway XRCC3 241 polymorphism and bladder cancer risk have been inconsistent (1315,18) and we did not observe an increased risk of bladder cancer associated with this polymorphism in our study population, although with limited statistical power. It is possible that XRCC3 has another biologic function that is modified by the codon 241 amino acid substitution or 241 may be in linkage disequilibrium with another causal polymorphism. Larger studies of multiple SNPs and haplotypes are needed.
Compensatory activity between different DNA repair proteins probably exists (9). Indeed, we observed only a few individuals that were completely wild type for all four of the common DNA repair genes with polymorphisms studied (6 controls and 3 cases). This emphasizes the importance of considering the implications of genetic variation in multiple genes simultaneously. Investigating multiple SNPs also necessitates careful consideration of multiple comparisons issues, a benefit of using the MDR and interaction entropy approaches. Our study involved a number of comparisons, and associations arising out of chance must be considered as a possible explanation for statistically significant results. Further, some of the differences in observed associations across studies may be due to population stratification. The population of New Hampshire is relatively homogeneous and primarily Caucasian, thus, the likelihood of extensive population stratification in our study is generally lower than in more ethnically diverse locations. Restriction to self-reported Caucasian race did not affect our results (data not shown). Future studies employing similar analytic strategies may help to elucidate the impact of population stratification.
In summary, our data indicate that there are potentially important effects of common variations in individual DNA repair genes, particularly in XPD on bladder cancer risk and that this risk may be modified by exposure history. We demonstrate the application of a multifaceted analytical approach that produces concordant results and emphasizes the utility of novel bioinformatic analysis tools that make the investigation of genegene and geneenvironment interactions feasible in a study of reasonable size. Our findings highlight the importance of considering the genetic susceptibility of individuals to complex diseases such as cancer using data on multiple polymorphisms along with a spectrum of potential carcinogenic exposures.
| Acknowledgments |
|---|
We would like to thank all members of the New Hampshire Health Study team for making this project possible. This publication was funded in part by National Institute of Health grant numbers CA099500, CA82354, CA57494, from the National Cancer Institute (NCI) and grants ES00002, 5 P42 ES05947, RR018787, and ES07373 from the National Institute of Environmental Health Sciences (NIEHS), NIH. Additional support for Dr. Andrew was kindly provided through a fellowship from the American Society of Preventive Oncology and the Cancer Research Foundation of America. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIEHS, NIH, NCI, ASPO, or CRFA.
Conflict of Interest Statement: None declared.
| References |
|---|
|
|
|---|
- Pharoah,P.D., Dunning,A.M., Ponder,B.A. and Easton,D.F. (2004) Association studies for finding cancer-susceptibility genetic variants. Nat. Rev. Cancer, 4, 850860.[CrossRef][Web of Science][Medline]
- Sing,C.F., Stengard,J.H. and Kardia,S.L. (2003) Genes, environment, and cardiovascular disease. Arterioscler. Thromb. Vasc. Biol., 23, 11901196.
[Abstract/Free Full Text] - Moore,J.H. (2003) The ubiquitous nature of epistasis in determining susceptibility to common human diseases. Hum. Hered., 56, 7382.[CrossRef][Web of Science][Medline]
- Jemal,A., Tiwari,R.C., Murray,T., Ghafoor,A., Samuels,A., Ward,E., Feuer,E.J. and Thun,M.J. (2004) Cancer statistics, 2004. CA Cancer J. Clin., 54, 829.
[Abstract/Free Full Text] - Silverman,D.T., Morrison,A.S. and Devesa,S.S. (1996) Bladder cancer. In Schottenfeld D. and Fraumeni,J.F. (eds), Cancer Epidemiology and Prevention. Oxford University Press, New York, pp. 11561179.
- Cartwright,R.A. (1979) Genetic association with bladder cancer. Br. Med. J., 2, 798.
[Free Full Text] - Sullivan,J.W. (1982) Epidemiologic survey of bladder cancer in greater New Orleans. J. Urol., 128, 281283.[Web of Science][Medline]
- Kantor,A.F., Hartge,P., Hoover,R.N. and Fraumeni,J.F.Jr. (1985) Familial and environmental interactions in bladder cancer risk. Int. J. Cancer, 35, 703706.[CrossRef][Web of Science][Medline]
- Goode,E.L., Ulrich,C.M. and Potter,J.D. (2002) Polymorphisms in DNA repair genes and associations with cancer risk. Cancer Epidemiol. Biomarkers Prev., 11, 15131530.
[Abstract/Free Full Text] - Kelsey,K.T., Park,S., Nelson,H.H. and Karagas,M.R. (2004) A population-based casecontrol study of the XRCC1 Arg399Gln polymorphism and susceptibility to bladder cancer. Cancer Epidemiol. Biomarkers Prev., 13, 13371341.
[Abstract/Free Full Text] - Stern,M.C., Johnson,L.R., Bell,D.A. and Taylor,J.A. (2002) XPD codon 751 polymorphism, metabolism genes, smoking, and bladder cancer risk. Cancer Epidemiol. Biomarkers Prev., 11, 10041011.
[Abstract/Free Full Text] - Schabath,M.B., Delclos,G.L., Grossman,H.B., Wang,Y., Lerner,S.P., Chamberlain,R.M., Spitz,M.R. and Wu,X. (2005) Polymorphisms in XPD exons 10 and 23 and bladder cancer risk. Cancer Epidemiol. Biomarkers Prev., 14, 878884.
[Abstract/Free Full Text] - Matullo,G., Guarrera,S., Carturan,S., Peluso,M., Malaveille,C., Davico,L., Piazza,A. and Vineis,P. (2001) DNA repair gene polymorphisms, bulky DNA adducts in white blood cells and bladder cancer in a casecontrol study. Int. J. Cancer, 92, 562567.[CrossRef][Web of Science][Medline]
- Sanyal,S., Festa,F., Sakano,S., Zhang,Z., Steineck,G., Norming,U., Wijkstrom,H., Larsson,P., Kumar,R. and Hemminki,K. (2004) Polymorphisms in DNA repair and metabolic genes in bladder cancer. Carcinogenesis, 25, 729734.
[Abstract/Free Full Text] - Shen,M., Hung,R.J., Brennan,P., Malaveille,C., Donato,F., Placidi,D., Carta,A., Hautefeuille,A., Boffetta,P. and Porru,S. (2003) Polymorphisms of the DNA repair genes XRCC1, XRCC3, XPD, interaction with environmental exposures, and bladder cancer risk in a casecontrol study in northern Italy. Cancer Epidemiol. Biomarkers Prev., 12, 12341240.
[Abstract/Free Full Text] - Matullo,G., Palli,D., Peluso,M. et al. (2001) XRCC1, XRCC3, XPD gene polymorphisms, smoking and 32PDNA adducts in a sample of healthy subjects. Carcinogenesis, 22, 14371445.
[Abstract/Free Full Text] - Stern,M.C., Umbach,D.M., van Gils,C.H., Lunn,R.M. and Taylor,J.A. (2001) DNA repair gene XRCC1 polymorphisms, smoking, and bladder cancer risk. Cancer Epidemiol. Biomarkers Prev., 10, 125131.
[Abstract/Free Full Text] - Stern,M.C., Umbach,D.M., Lunn,R.M. and Taylor,J.A. (2002) DNA repair gene XRCC3 codon 241 polymorphism, its interaction with smoking and XRCC1 polymorphisms, and bladder cancer risk. Cancer Epidemiol. Biomarkers Prev., 11, 939943.
[Abstract/Free Full Text] - Sak,S.C., Barrett,J.H., Paul,A.B., Bishop,D.T. and Kiltie,A.E. (2005) The polyAT, intronic IVS11-6 and Lys939Gln XPC polymorphisms are not associated with transitional cell carcinoma of the bladder. Br. J. Cancer, 92, 22622265.[CrossRef][Web of Science][Medline]
- Zhou,W., Liu,G., Miller,D.P., Thurston,S.W., Xu,L.L., Wain,J.C., Lynch,T.J., Su,L. and Christiani,D.C. (2003) Polymorphisms in the DNA repair genes XRCC1 and ERCC2, smoking, and lung cancer risk. Cancer Epidemiol. Biomarkers Prev., 12, 359365.
[Abstract/Free Full Text] - Chen,S., Tang,D., Xue,K., Xu,L., Ma,G., Hsu,Y. and Cho,S.S. (2002) DNA repair gene XRCC1 and XPD polymorphisms and risk of lung cancer in a Chinese population. Carcinogenesis, 23, 13211325.
[Abstract/Free Full Text] - Thornton-Wells,T.A., Moore,J.H. and Haines,J.L. (2004) Genetics, statistics and human disease: analytical retooling for complexity. Trends Genet., 20, 640647.[CrossRef][Web of Science][Medline]
- Karagas,M.R., Tosteson,T.D., Blum,J., Morris,J.S., Baron,J.A. and Klaue,B. (1998) Design of an epidemiologic study of drinking water arsenic exposure and skin and bladder cancer risk in a U.S. population. 106 (suppl. 4), 10471050.
- Ritchie,M.D., Hahn,L.W., Roodi,N., Bailey,L.R., Dupont,W.D., Parl,F.F. and Moore,J.H. (2001) Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am. J. Hum. Genet., 69, 138147.[CrossRef][Web of Science][Medline]
- Ritchie,M.D., Hahn,L.W. and Moore,J.H. (2003) Power of multifactor dimensionality reduction for detecting genegene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity. Genet. Epidemiol., 24, 150157.[CrossRef][Web of Science][Medline]
- Hahn,L.W., Ritchie,M.D. and Moore,J.H. (2003) Multifactor dimensionality reduction software for detecting genegene and geneenvironment interactions. Bioinformatics, 19, 376382.
[Abstract/Free Full Text] - Hahn,L.W. and Moore,J.H. (2004) Ideal discrimination of discrete clinical endpoints using multilocus genotypes. In Silico Biol., 4, 183194.[Medline]
- Moore,J.H. (2004) Computational analysis of genegene interactions using multifactor dimensionality reduction. Expert. Rev. Mol. Diagn., 4, 795803.[CrossRef][Web of Science][Medline]
- Moore,J.H. (2003) Cross-validation consistency for the assessment of genetic programming results in microarray studies. LNCS, 2611, 99106.
- Coffey,C.S., Hebert,P.R., Ritchie,M.D., Krumholz,H.M., Gaziano,J.M., Ridker,P.M., Brown,N.J., Vaughan,D.E. and Moore,J.H. (2004) An application of conditional logistic regression and multifactor dimensionality reduction for detecting genegene interactions on risk of myocardial infarction: the importance of model validation. BMC Bioinformatics, 5, 4959.[CrossRef][Medline]
- Coffey,C.S., Hebert,P.R., Krumholz,H.M., Morgan,T.M., Williams,S.M. and Moore,J.H. (2004) Reporting of model validation procedures in human studies of genetic interactions. Nutrition, 20, 6973.[CrossRef][Web of Science][Medline]
- Jakulin,A. and Bratko,I. (2003) Analyzing attribute dependencies. In Lavrac,N., Gamberger,D., Blockeel,H. and Todorovski,L. (eds) PKDD 2003, Vol. 2838, Springer-Verlag, Cavtat, Croatia, pp. 229240.
- Jakulin,A., Bratko,I., Smrke,D., Demsar,J. and Zupan,B. (2003) Attribute interactions in medical data analysis. In Proceedings of the 9th Conference on Artificial Intelligence in Medicine in Europe (AIME 2003), Protaras, Cyprus, October 1822. Lecture Notes in Artificial Intelligence, Vol. 2780, Springer, pp. 229238.
- McGill,W.J. (1954) Multivariate information transmission. Psychometrika, 19, 97116.[CrossRef][Web of Science]
- Jakulin,A. and Bratko,I. (2004) Testing the significance of attribute interactions. In Proceedings of the 21st International Conference on Machine Learning, Banff, Canada, July 48, 2004. Omnipress, Madison, WI.
- Pierce,J.R. (1980) An Introduction to Information TheorySymbols, Signals and Noise. Dover Publications, New York.
- Demsar,J. and Zupan,B. (2004) Orange: From Experimental Machine Learning to Interactive Data Mining. White Paper, Faculty of Computer and Information Science, University of Ljublijana, Ljublijana, Slovenia, July 48, 2004. July 48, 2004. Omnipress, Madison, WI.
- Wilke,R.A., Reif,D.M. and Moore,J.H. (2005) Combinational pharmacogenetics. Nat. Rev. Drug Discov., 4, 911918[CrossRef][Web of Science][Medline]
- Stephens,M., Smith,N.J. and Donnelly,P. (2001) A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet., 68, 978989.[CrossRef][Web of Science][Medline]
- Stephens,M. and Donnelly,P. (2003) A comparison of bayesian methods for haplotype reconstruction from population genotype data. Am. J. Hum. Genet., 73, 11621169.[CrossRef][Web of Science][Medline]
- Qiao,Y., Spitz,M.R., Guo,Z., Hadeyati,M., Grossman,L., Kraemer,K.H. and Wei,Q. (2002) Rapid assessment of repair of ultraviolet DNA damage with a modified host-cell reactivation assay using a luciferase reporter gene and correlation with polymorphisms of DNA repair genes in normal human lymphocytes. Mutat. Res., 509, 165174.[Web of Science][Medline]
- Hadi,M.Z., Coleman,M.A., Fidelis,K., Mohrenweiser,H.W. and Wilson,D.M.3rd (2000) Functional characterization of Ape1 variants identified in the human population. Nucleic Acids Res., 28, 38713879.
[Abstract/Free Full Text] - Moore,J.H. and Williams,S.M. (2002) New strategies for identifying genegene interactions in hypertension. Ann. Med., 34, 8895.[CrossRef][Web of Science][Medline]
- Cleaver,J.E. (2000) Common pathways for ultraviolet skin carcinogenesis in the repair and replication defective groups of xeroderma pigmentosum. J. Dermatol. Sci., 23, 111.[CrossRef][Web of Science][Medline]
- Butkiewicz,D., Rusin,M., Enewold,L., Shields,P.G., Chorazy,M. and Harris,C.C. (2001) Genetic polymorphisms in DNA repair genes and risk of lung cancer. Carcinogenesis, 22, 593597.
[Abstract/Free Full Text] - Ronen,A. and Glickman,B.W. (2001) Human DNA repair genes. Environ. Mol. Mutagen., 37, 241283.[CrossRef][Web of Science][Medline]
- Marsin,S., Vidal,A.E., Sossou,M., Murcia,J.M., Le Page,F., Boiteux,S., De Murcia,G. and Radicella,J.P. (2003) Role of XRCC1 in the coordination and stimulation of oxidative DNA damage repair initiated by the DNA glycosylase hOGG1. J. Biol. Chem., 278, 4406844074.
[Abstract/Free Full Text] - Han,J., Hankinson,S.E., De Vivo,I., Spiegelman,D., Tamimi,R.M., Mohrenweiser,H.W., Colditz,G.A. and Hunter,D.J. (2003) A prospective study of XRCC1 haplotypes and their interaction with plasma carotenoids on breast cancer risk. Cancer Res., 63, 85368541.
[Abstract/Free Full Text] - Bishop,D.K., Ear,U., Bhattacharyya,A., Calderone,C., Beckett,M., Weichselbaum,R.R. and Shinohara,A. (1998) Xrcc3 is required for assembly of Rad51 complexes in vivo. J. Biol. Chem., 273, 2148221488.
[Abstract/Free Full Text]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. C. Stern, J. Lin, J. D. Figueroa, K. T. Kelsey, A. E. Kiltie, J.-M. Yuan, G. Matullo, T. Fletcher, S. Benhamou, J. A. Taylor, et al. Polymorphisms in DNA Repair Genes, Smoking, and Bladder Cancer Risk: Findings from the International Consortium of Bladder Cancer Cancer Res., September 1, 2009; 69(17): 6857 - 6864. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Michiels, A. Laplanche, T. Boulet, P. Dessen, B. Guillonneau, A. Mejean, F. Desgrandchamps, M. Lathrop, A. Sarasin, and S. Benhamou Genetic polymorphisms in 85 DNA repair genes and bladder cancer risk Carcinogenesis, May 1, 2009; 30(5): 763 - 768. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lao, W. Gu, and Q. Huang A meta-analysis on XRCC1 R399Q and R194W polymorphisms, smoking and bladder cancer risk Mutagenesis, November 1, 2008; 23(6): 523 - 532. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Milne, R. Fagerholm, H. Nevanlinna, and J. Benitez The importance of replication in gene-gene interaction studies: multifactor dimensionality reduction applied to a two-stage breast cancer case-control study Carcinogenesis, June 1, 2008; 29(6): 1215 - 1218. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Wang, D. Chang, F.-l. Hu, H. Sui, B. Han, D.-d. li, and Y.-s. Zhao DNA Repair Gene XPD Polymorphisms and Cancer Risk: A Meta-analysis Based on 56 Case-Control Studies Cancer Epidemiol. Biomarkers Prev., March 1, 2008; 17(3): 507 - 517. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Chen, A. M. Kamat, M. Huang, H.B. Grossman, C. P. Dinney, S. P. Lerner, X. Wu, and J. Gu High-order interactions among genetic polymorphisms in nucleotide excision repair pathway genes and smoking in modulating bladder cancer risk Carcinogenesis, October 1, 2007; 28(10): 2160 - 2165. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Figueroa, N. Malats, N. Rothman, F. X. Real, D. Silverman, M. Kogevinas, S. Chanock, M. Yeager, R. Welch, M. Dosemeci, et al. Evaluation of genetic variation in the double-strand break repair pathway and bladder cancer risk Carcinogenesis, August 1, 2007; 28(8): 1788 - 1793. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Rothman, M. Garcia-Closas, and D. W Hein Commentary: Reflections on G. M. Lower and colleagues' 1979 study associating slow acetylator phenotype with urinary bladder cancer: meta-analysis, historical refinements of the hypothesis, and lessons learned Int. J. Epidemiol., February 1, 2007; 36(1): 23 - 28. [Full Text] [PDF] |
||||
![]() |
M. Huang, C. P. Dinney, X. Lin, J. Lin, H. B. Grossman, and X. Wu High-Order Interactions among Genetic Variants in DNA Base Excision Repair Pathway Genes and Smoking in Bladder Cancer Susceptibility Cancer Epidemiol. Biomarkers Prev., January 1, 2007; 16(1): 84 - 91. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










