Translate this page into:
Association of CDKAL1, CDKN2A/B & HHEX gene polymorphisms with type 2 diabetes mellitus in the population of Hyderabad, India
Reprint requests: Dr B. Mohan Reddy, Molecular Anthropology Group, Biological Anthropology Unit, Indian Statistical Institute, Street # 8, Habsiguda, Hyderabad 500 007, Telangana, India e-mail: bmrisi@gmail.com; bmr@isical.ac.in
-
Received: ,
This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.
This article was originally published by Medknow Publications & Media Pvt Ltd and was migrated to Scientific Scholar after the change of Publisher.
Abstract
Background & objectives:
The genome-wide association studies (GWAS) have shown an association of type 2 diabetes mellitus (T2DM) with several novel genes. We report here the findings on the pattern of genetic association of three genes (CDKAL1, CDKN2A/B and HHEX) with T2DM in the population of Hyderabad, south India.
Methods:
A sample of 1379 individuals (758 T2DM cases and 621 controls) from Hyderabad, India, were genotyped for five single nucleotide polymorphisms (SNPs) of CDKAL1 (rs7754840, rs7756992) CDKN2A/B (rs10811661) and HHEX (rs1111875, rs7923837) genes on Sequenom Mass Array platform.
Results:
The risk allele frequencies of the CDKAL1 and CDKN2A/B SNPs were relatively higher in cases than in the controls and the logistic regression analysis yielded significant odds ratios suggesting that the variant alleles conferred risk for developing T2DM in this population. The HHEX gene did not show either allelic or genotypic association with T2DM. The multivariate logistic regression analysis with reference to both alleles and genotypes of CDKAL1 SNPs showed significant association, suggesting an important role for this gene in the T2DM pathophysiology.
Interpretation & conclusions:
A significant association was seen of all the three SNPs of CDKAL1 and CDKN2A/B genes with T2DM but none of the two SNPs of HHEX. Further studies are required to cross-validate our findings in a relatively larger sample. It is also necessary to explore other SNPs of HHEX gene to unequivocally establish the pattern of association of this gene with T2DM in this population.
Keywords
Allele frequency
genetic association
HHEX gene
single nucleotide polymorphism (SNP)
type 2 diabetes mellitus (T2DM)
Type 2 diabetes mellitus (T2DM) is a complex disease with both genetic and environmental factors leading to various complications such as coronary heart disease, diabetic retinopathy, neuropathy and nephropathy. It has emerged as an epidemic affecting millions of people throughout the world. With rapid increase in urbanization coupled with sedentary occupations and lack of physical activity, there is an escalating prevalence of T2DM in the developing countries like India1. The typical characteristic feature of ‘Asian Indian Phenotype’1, triggered by the environmental factors, makes Indians more predisposed to develop T2DM. On the other hand, India with its unique population structure and a number of endogamous caste and tribal populations, presents enormous ethnic, linguistic and genetic heterogeneity of its population in different geographic regions. Despite this heterogeneity and high prevalence of T2DM in India and given the large number of genetic variants that have been hitherto discovered through the candidate gene approach and GWAS (genome-wide association study), only a few validation studies have been carried out2-6. Many more populations from diverse regions and ethnic backgrounds need to be studied to depict representative susceptible/protective genetic profile associated with T2DM that might be different for this region when compared to the Caucasian/Western populations.
In India, Andhra Pradesh (AP) has been reported to have a relatively higher prevalence of T2DM than in other States in south India, Hyderabad with a prevalence of 16.6 per cent1. Inspite of this, except for a couple of our studies789 on the association of TCF7L2, IGF2BP2, SLC30A8, IRS-1, CAPN10 and PPARG gene polymorphisms with T2DM in the population of Hyderabad, there was no other genetic study pertaining to this problem. The GWAS studies have revealed association of a number of novel genes with T2DM10111213141516. The single nucleotide polymorphisms (SNPs) were subsequently replicated in different populations with strong association with T2DM and the meta-analysis conducted for each of these genes also confirmed the risk for developing T2DM17181920. Studies also showed a significant association of CDKAL1 [cyclin-dependent kinase 5 (CDK5) regulatory subunit associated protein-1-like1] with small decreases in insulin response to a glucose load1214. The CDKN2A and CDKN2B (cyclin dependent kinase 4 inhibitor 2A/B) are known as tumour suppressor genes and highly expressed in the pancreatic islets and play an important role in the beta cell function and regeneration. Graurp et al21 showed that these genes confer impaired glucose homeostasis and pancreatic β-cell dysfunction. On the other hand, HHEX (haemopoetically expressed homeobox) encodes a transcription factor which is involved in Wnt signaling pathway, a fundamental pathway required for cell growth and development10. HHEX regulates cell proliferation and tissue specification underlying vascular and hepatic differentiation, while the gene IDE (insulin degrading enzyme) is involved in the function of pancreas. The SNPs in the HHEX-IDE locus were found to be associated with decreased pancreatic beta cell function22. We report here the results of analysis of five SNPs from the three genes: rs7754840 and rs7756992 (CDKAL1), rs10811661 (CDKN2A/B) and rs1111875 and rs7923837 (HHEX).
Material & Methods
A total of 758 T2DM cases (443 males and 315 females) and 621 controls (395 males and 226 females) were included during the year 2010-2011. The T2DM patients were recruited from J.P Endocrine Center, Hyderabad, based on their medical record and adhering to American Diabetes Association (ADA, 2010)23 criteria that specifies fasting plasma glucose (FPG) ≥ 126 mg/dl or 2 h plasma glucose (PPG) ≥ 200 mg/dl or random plasma glucose (random blood sugar) (RBG) ≥ 200 mg/dl to be characterized as diabetic. The age of the patients ranged between 31-80 yr. We conducted free diabetes camps for the employees of different organizations in Hyderabad and individuals with ≥ 40 yr of age were tested for random blood sugar (where fasting and 2 h plasma glucose were not feasible) and those having blood sugar <140 mg/dl and with no positive family background for T2DM were recruited as controls. The background information and the anthropometric measurements collected on both cases and controls have been described elsewhere789. Given that the patients of the clinics as well as the employees of different organizations from where the controls were drawn constituted similar ethnic and linguistic backgrounds, broad matching for ethnicity of the cases and controls was ensured. Blood sample (5 ml) was drawn from all patients and controls after obtaining written informed consent. The study protocol was approved by the ethics committee of Indian Statistical Institute, Kolkata.
Genomic DNA was extracted using the phenol-chloroform method24 and the isolated DNA was subsequently quantified using NanoDrop™ 2000 (Thermo Scientific, Wilmington, Delaware, USA). The SNPs of the respective genes CDKAL1 (rs7754840, rs7756992), CDKN2A/B (rs10811661) and HHEX (rs1111875, rs7923837) were genotyped as part of a select panel of 15 SNPs of 9 prominent T2DM genes on Sequenom Massarray platform (Sequenom. Inc, San Diego, CA, USA)at The Centre for Genomic Application (TCGA), Delhi. Due to resource constraints, all GWAS identified SNPs could not be genotyped, and given the limited choice SNPs that were most significantly associated and more often replicated were genotyped. The assays were performed according to the manufacture's specifications25 and quality control (QC) procedure as described elsewhere7.
Statistical analysis: Most of the statistical analyses were performed with the help of SPSS statistical software (version 18.0, IBM SPSS, Chicago, IL, USA). Data for continuous variables were expressed as mean±SD, median and range. Student's t test was used to determine the significance of difference in the means of the continuous variables between cases and controls. Chi-square test was used to assess the significance of heterogeneity of allele and the genotype frequencies between cases and controls. Logistic regression analysis was done under the assumption of general model, i.e. By taking the most frequent genotype as reference and body mass index (BMI) and waist-hip ratio (WHR) as covariates. The risk was estimated based on the significance of odds ratio (OR) with 95% C.I. Obtained. Multivariate logistic regression was also carried out using R PROGRAM (version 2.15.2; R Foundation, http://www.r-project.org/). Post-hoc power of the study was estimated using G*Power (version 3.1) Universität Kiel, Keil Germany) and Quanto (version 1.2.4, Kiel University of South California, Los Angeles, CA, USA) softwares. The Hardy-Weinberg equilibrium was estimated by the χ2 test using Pypop (version 0.7.0, Thomson Laboratories, Department of Integrative Biology, University of California, Berkeley, CA, USA) and Haploview software (version 4.1, Broad Institute of MIT and Harvard, Cambridge, USA) was used to estimate linkage disequilibrium (LD).
Results
The details on the clinical profile of the subjects have been published elsewhere78. The patients showed significantly higher values for BMI, waist-hip ratio and random blood glucose (P<0.001) compared to controls.
Allele and genotypic association of different SNPs: The allele and genotype frequencies of the five SNPs are shown in Tables I and II. All these SNPs were found to be in Hardy-Weinberg equilibrium (Table I). The frequency of the risk alleles of CDKAL1 SNPs, rs7754840 (C) and rs7756992 (G) and T allele of rs10811661 of CDKN2A/B SNP were relatively higher in T2DM patients than in the controls and the logistic regression analysis yielded significant odds ratios suggesting that the variant alleles conferred risk for developing T2DM. The frequencies of the heterozygotes (GC, AG) and homozygote genotypes (CC, GG) of the two SNPs of CDKAL1 gene were found to be higher in patients than in the controls (P<0.003), suggesting risk confering nature. Although a similar pattern of association was observed in the homozygote genotype frequency (TT) of CDKN2A/B gene, the heterozygote (CT) was found to be higher in frequency in the controls. In the logistic regression analysis of the genotypes the heterozygote (GC) and homozygote (CC) genotypes of CDKAL1 SNP showed significant odds ratio (P<0.01) and the observed association remained significant even after adjusting for covariates. While the genotype frequency of CDKN2A/B gene showed a marginal significance, it was not significant in the logistic regression analysis, with or without covariates. On the other hand, the two SNPs of HHEX gene (rs1111875, rs7923837) showed similar allele as well as the genotype frequencies among the patients and controls, suggesting lack of evidence for its association with T2DM.


Combined role of these genes: Results of multivariate logistic regression analysis: The linkage disequilibrium (LD) plot for different pairs of the five SNPs of the three genes considered in this study (Figure) suggested strong LD between the two SNPs (rs7754840, rs7756992) of CDKAL1 and HHEX (rs1111875, rs7923837), respectively (CDKAL1: D’=0.90, r2=0.73; HHEX: D’=0.87, r2=0.70) and there was no LD between the other pairs of SNPs. Similar results were obtained when LD plots were constructed for cases and controls separately (results not presented). Therefore, one of the two SNPs with greater significance from each of the two above genes were considered, along with rs1081661 of CDKN2A/B in the multivariate logistic regression. The allele and genotype data were analysed separately to investigate the combined role of these genes in the T2DM aetiology. The odds ratio was obtained using the most frequent genotype/allele as the reference. Allele based multivariate logistic regression suggested significant odds ratio for minor allele (G) of rs7756992 from CDKAL1, which was consistent with the individual SNP analysis. This SNP also showed similar pattern of association in genotypic multivariate logistic regression (P=0.009), which remained significant even after bonferroni correction for multiple testing, suggesting the independent risk conferring role of this gene in the manifestation of T2DM, after adjusting for confounding nature of effects of the other genes, if any (Table III). On the other hand, in allele based multivariate logistic regression analysis, rs10811661 of CDKN2A/B did not show significant association. However, in case of genotype wise multivariate logistic regression analysis, rs10811661 showed consistent results with individual SNP analysis. Although the minor allele G of rs1111875 (HHEX gene) also showed significant association with T2DM in the multivariate context (with protective role against developing T2DM), it was not significant after bonferroni correction for multiple testing. A similar, albeit non-significant pattern was observed in the genotype based logistic regression, both in the context of multivariate and individual SNP analyses.

- Linkage disequilibrium plot for the five SNPs of HHEX, CDKN2A/B and CDKAL1 genes. D’ values mentioned in the LD plot. LD is seen only between SNPs of the same gene, not across the genes.

Power of study: The post-hoc power of the study was calculated using G power for the SNPs of the CDKAL1, CDKN2A/B and HHEX genes. The three SNPs- rs7754840, rs7756992 and rs10811661 of the first two genes with significant odds ratio of 1.36, 1.35 and 1.28 yielded statistical power (1-β error probability) of 99, 99 and 97 per cent, respectively. Considering the non-significant association of SNPs of HHEX gene, we tried to estimate if our study had the adequate statistical power to detect the true association if existent in the population. We estimated power of study to detect true association by considering the significant odds ratio obtained for a north Indian sample by Chauhan et al4 for one of the two SNPs (rs1111875) of HHEX gene as reference. Taking the log of odds ratio (log 1.27 = 0.104) from the above study as the effect size and given our sample size of 1379, the power of 97 per cent was obtained for our study (P=0.05), which implied that our study was sufficiently powered to detect true association of HHEX SNPs if existent in our population. We further estimated minimum effect size necessary to detect association with 90 per cent power, which turned out to be 0.087 for the two SNPs of HHEX gene. The observed effect size was virtually zero (-0.01), for both the SNPs (rs1111875 and rs7923837) of HHEX gene, ruling out the possibility of any association of these SNPs with T2DM in this population. Alternatively, based on prevalence rate of T2DM in Hyderabad city1, power of study was estimated using Quanto software. Given the estimated prevalence of 16 per cent for T2DM1, minor allele frequency of 0.27 and 0.30 for the two SNPs (rs7754840, rs7756992) of CDKAL1 gene and with odds ratios of 1.36 and 1.35, we obtained power of about 95 and 99 per cent, respectively, which were qualitatively similar to the results obtained by using G power. This implied that given the high prevalence of T2DM in our population and with the sample size of 1379 our study was sufficiently powered to detect any true association of these genes with T2DM.
Comparison with earlier studies among Indian populations: Despite unique Indian population structure with large number of endogamous groups and enormous ethnic, linguistic and geographic heterogeneity, very few studies have hitherto focused on replication of earlier findings of genetic association of T2DM2345. The risk allele frequencies of all the SNPs considered in our study were compared with other Indian studies (Table IV). A significant allelic heterogeneity was observed for the CDKAL1 and CDKN2A/B genes among the few Indian populations studied, primarily because of the differences in the allele frequency between the northern and southern populations. Because of this heterogeneity in allele frequency and given that the data were available for only a couple of other Indian populations further analysis could not be performed.

Discussion
A significant association of all the three SNPs of CDKAL1 and CDKN2A/B genes was observed with T2DM, as observed earlier among the European populations10111213141516. However, neither of the two SNPs of HHEX gene showed significant association. The logistic regression analysis of the alleles of CDKAL1 and CDKN2A/B gene yielded significant odds, conferring risk towards developing T2DM. The biological significance of the observed genetic association can be explained by the already established role of CDKAL1 in insulin secretion, which acts through impaired pancreatic beta cell function26 and of CDKN2A/B in the beta cell function and regeneration. The non-significant association of HHEX gene in our population could be due to differential genetic predisposition of Europeans and Indian populations towards T2DM besides the fact that the Indian subcontinent itself is characterized by vast genetic differences between the regional populations27. In support to this, Cia et al20 conducted a sub-grouped meta-analysis by ethnicity and observed a non-significant association of the HHEX gene with T2DM in the Indian population, suggesting that ethnicity was the main cause of the heterogeneity among the studies. With reference to other Indian studies, our results on CDKAL1 gene were consistent with Chennai study3, whereas one of the northern Indian studies did not show significant association of rs7754840 with T2DM2. Although another study constituting samples from northern and western India4 showed strong association of CDKAL1 with T2DM, the single SNP studied was different from those of the present study. The rs10811661 of CDKN2A/B also showed strong association with T2DM in our population as was the case with north and western Indian studies45. However, the variant near HHEX gene (rs1111875) was not significantly associated with T2DM in our study similar to other Indian populations studied23. The other SNP of this gene (rs7923837) was also not associated with T2DM in our population, although it was found significantly associated in a study from Chennai3. This may suggest that the T2DM susceptible genes vary with the ethnic background. This draws support from an Indian GWAS study which provides evidence for possible genetic heterogeneity between Indo-Europeans and Dravidian ethnic groups within India22. Overall, there is an inconsistency in the pattern of association of these genes with T2DM among Indian populations. Therefore, it is important to explore a number of geographically and ethnically heterogenous populations within India, not only to validate already known variants but also to see if any new genetic variants that might be unique to Indian populations can be found, before one can project genetic susceptibility/protective profile that would be representative of the Indian population in general.
Despite large sample size and sufficient statistical power of the study, we could not find significant association of HHEX gene with T2DM in our population. In spite of the biological significance of HHEX, it does not seem to play a significant role in the manifestation of T2DM in this Indian population. This can be explained by the possibility of these SNPs not being causal variant for T2DM in this population and/or probably not tagged to the same causal variants as in the case of Western populations. Further studies are required to cross-validate our findings in relatively larger samples as well as to explore novel SNPs of HHEX gene that might be associated with T2DM in this Indian population.
Conflicts of Interest: None.
References
- Epidemiology of type 2 diabetes: Indian scenario. Indian J Med Res. 2007;125:217-30.
- [Google Scholar]
- Impact of nine common type 2 diabetes risk polymorphisms in Asian Indian Sikhs: PPARG2 (Pro12Ala), IGF2BP2, TCF7L2 and FTO variants confer a significant risk. BMC Med Genet. 2008;9:59.
- [Google Scholar]
- Replication of recently described type 2 diabetes gene variants in a South Indian population. Metabolism. 2010;59:1760-6.
- [Google Scholar]
- Impact of common variants of PPARG, KCNJ11, TCF7L2, SLC30A8, HHEX, CDKN2A, IGF2BP2, and CDKAL1 on the risk of type 2 diabetes in 5,164 Indians. Diabetes. 2010;59:2068-74.
- [Google Scholar]
- Association of common variants of CDKN2A/2B rs10811661 (C/T) and WFS1 rs6446482 (C/G) to type 2 diabetes mellitus in the Indian population of Eastern Uttar Pradesh. J Diabetes Metab. 2012;3:1-5.
- [Google Scholar]
- Genetic etiology of type 2 diabetes mellitus: A review. Int J Diabetes Dev Ctries. 2011;31:51-64.
- [Google Scholar]
- Association of TCF7L2 gene polymorphisms with T2DM in the population of Hyderabad, India. PLoS One. 2013;8:e60212.5.
- [Google Scholar]
- No detectable association of IGF2BP2 and SLC30A8 genes with type 2 diabetes in the population of Hyderabad, India. Meta Gene. 2013;1:15-23.
- [Google Scholar]
- Association of IRS-1, CAPN10 and PPARG gene polymorphisms with T2DM in the high risk population of Hyderabad, India. J Diabetes. 2014;6:564-73.
- [Google Scholar]
- A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature. 2007;445:881-5.
- [Google Scholar]
- Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science. 2007;316:1336-41.
- [Google Scholar]
- A variant in CDKAL1 influences insulin response and risk of type 2 diabetes. Nat Genet. 2007;39:770-5.
- [Google Scholar]
- A genome-wide association study of type 2 diabetes in finns detects multiple susceptibility variants. Science. 2007;316:1341-5.
- [Google Scholar]
- Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science. 2007;316:1331-6.
- [Google Scholar]
- Adiposity-related heterogeneity in patterns of type 2 diabetes susceptibility observed in genome-wide association data. Diabetes. 2009;58:505-10.
- [Google Scholar]
- Confirmation of multiple risk loci and genetic impacts by a genome-wide association study of type 2 diabetes in the Japanese Population. Diabetes. 2009;58:1690-9.
- [Google Scholar]
- Association between type 2 diabetes and CDKN2A/B: a meta-analysis study. Mol Biol Rep. 2012;39:1609-16.
- [Google Scholar]
- Hematopoietically-expressed homeobox gene three widely-evaluated polymorphisms and risk for diabetes: a meta-analysis. PLoS One. 2012;7:e49917.
- [Google Scholar]
- Meta-analysis of the effect of HHEX gene polymorphism on the risk of type 2 diabetes. Mutagenesis. 2011;26:309-14.
- [Google Scholar]
- Studies of association of variants near the HHEX, CDKN2A/B, and IGF2BP2 genes with type 2 diabetes and impaired insulin release in 10,705 Danish subjects: validation and extension of genome-wide association studies. Diabetes. 2007;56:3105-11.
- [Google Scholar]
- Genome-wide association study for type 2 diabetes in Indians identifies a new susceptibility locus at 2q[21] Diabetes. 2013;62:977-86.
- [Google Scholar]
- American Diabetes Association (ADA). Diagnosis and classification of Diabetes mellitus. Diabetes Care. 2010;33(Suppl 1):62-9.
- [Google Scholar]
- Molecular cloning: a laboratory manual. New York: Cold Spring Harbor Laboratory Press; 1989.
- SNP genotyping using the Sequenom Massarray iPLEX Platform. Curr Protoc Hum Genet. 2009;60 Chapter 2: Unit 212
- [Google Scholar]
- Common variants of the novel type 2 diabetes genes CDKAL1 and HHEX/IDE are associated with decreased pancreatic beta-cell function. Diabetes. 2007;56:3101-4.
- [Google Scholar]