> top > projects > testtesttest > docs > PMC:9949264 > spans > 48683-49385

PMC:9949264 / 48683-49385 JSONTXT

Genome-wide Association Study Meta-analysis of Blood Pressure Traits and Hypertension in Sub-Saharan African Populations: An AWI-Gen Study Abstract Most hypertension-related genome-wide association studies (GWAS) focus on non-African populations, despite hypertension (a major risk factor for cardiovascular disease) being highly prevalent in Africa. The AWI-Gen study GWAS meta-analysis for blood pressure-related traits (systolic and diastolic blood pressure, pulse pressure, mean-arterial pressure and hypertension) from three sub-Saharan African geographic regions (N=10,775), identified two genome-wide significant signals (p<5E-08): systolic blood pressure near P2RY1 (rs77846204; intergenic variant, p=4.25E-08) and pulse pressure near Linc01256 (rs80141533; intergenic variant, p=4.25E-08). No genome-wide signals were detected for the AWI-Gen GWAS meta-analysis with previous African-ancestry GWASs (UK Biobank (African), Uganda Genome Resource). Suggestive signals (p<5E-06) were observed for all traits, with 29 displaying pleiotropic effects and several replicating known associations. Polygenic risk scores developed from studies on different ancestries had limited transferability, with multi-ancestry models providing better prediction. This study provides insights into the genetics and physiology of blood pressure variation in African populations. Introduction Hypertension (HTN) is a major risk factor for cardiovascular diseases (CVD) such as coronary heart disease, heart valve diseases, atrial fibrillation, aortic syndromes, cerebral stroke and renal failure1,2 Between 1990 and 2019, hypertension prevalence almost doubled for adults (aged 30–79 years) and affected 1.25 billion people living in low- and middle-income countries3. This increase is attributed to population growth and ageing, and is predicted to increase to 1.56 billion people by 20254,5 in addition, HTN is a leading risk factor for premature deaths and disability worldwide6,7, accounting for 17.9 million deaths in 20188. it is present in ~22% of the global population, with the highest prevalence observed in Africa (27%), particularly in urban communities and in older people9. HTN prevalence and awareness differs between and within the sub-Saharan African countries10. There is a paucity of data on the prevalence, treatment and control of HTN in many African countries and therefore its contribution to related conditions, such as hypertrophic cardiomyopathy, is not fully understood11,12. Major research focuses on genetic associations with HTN, due to its high prevalence and the fact that it doubles the risk for CVD8,13. Familial studies have shown HTN associations amongst immediate family members, with genetic factors explaining approximately 30–50% of blood pressure (BP) variation amongst individuals14,15. However, these studies have limitations in identifying genetic variants responsible for increased risk of developing HTN. Genome-wide association studies (GWASs) have explained only a modest proportion of the genetic heritability (3–6%) for blood pressure (BP) and HTN16. The GWAS Catalog17 18 includes data from the first BP/HTN case-control studies conducted in 2007 for HTN19 and for BP as a quantitative trait20. The GWAS Catalog currently includes 7,982 genetic associations with BP based on 380 studies and 586 associations with HTN based on 120 studies (https://www.ebi.ac.uk/gwas, accessed 17 November 2022). Early GWAS studies outlined the complexity of studying BP-related traits and emphasised the importance of large sample sizes to enable the detection of genetic associations19,20. Large-scale GWAS discovery meta-analyses have shown significant genetic associations with BP and HTN16,21,22. The largest BP GWAS to date by Evangelou, et al.22 included over 1 million individuals of European ancestry from the UK Biobank (UKBB) and the International Consortium of Blood Pressure (ICBP), identifying 535 BP-related loci. Only a small number of GWAS for genetic associations with BP and HTN have been performed on the African continent. Despite HTN being highly prevalent in Africa23, most studies have focused on the European populations16. Studies based on African-ancestry populations comprise mainly of African American (AA) populations24–28, with the first GWAS for HTN in AA conducted in 2009 by Adeyemo, et al.29. Hendry, et al.30 studied a black South African population (n = 1947 with ~700 women who are also present in our study) with samples genotyped using the Metabochip (~200,000 single nucleotide polymorphisms (SNPs) previously associated with cardiometabolic traits). They found genetic associations with systolic and diastolic BP in genes of interest (NOS1AP, MYRF and POC1B) and in some intergenic regions (DACH1/LOC440145 and INTS10/LPL)30. African populations have high genetic diversity, allele frequency differences and low linkage disequilibrium (LD) when compared to other populations31 and therefore GWASs from sub-Saharan Africa have the potential to discover novel BP-related SNPs. However, it is important to recognise and adjust for extensive population structure across different African regions32,33 in genetic association studies and to use genotyping arrays such as the Human Heredity and Health in Africa (H3Africa) SNP array, that is enriched for common genetic variants in African populations34. In this study, the sub-Saharan African cohort of older adult participants, referred to as the Africa Wits-INDEPTH partnership for Genomic Studies (AWI-Gen)35,36, was used and DNA samples genotyped with the H3Africa SNP array. The aim of the study was to identify genetic associations with four continuous BP-related traits (systolic BP (SBP), diastolic BP (DBP), pulse pressure (PP) and mean-arterial pressure (MAP)) and one categorical trait (HTN), in three sub-Saharan African regions represented in the AWI-Gen study. To boost power, the findings were meta-analyzed with other studies the included African or African-ancestry participants. Fine mapping and genetic risk score analysis and transferability were also assessed. Results Participants in the AWI-Gen cohort had a mean age (SD) of 51.8 (8.2) years, with slightly more women (54.7%). The average BMI of the cohort was 25.1 (6.7) kg/m2 (defined as overweight (body mass index (BMI) between 25.0 to 29.9 kg/m2). The average resting heat rate was within the normal range (<100 beats per minute). The majority of study participants fell within the normal to pre-HTN BP category (126.9/83mmHg; 3818 HTN cases, 6918 HTN controls), with most self-reported as not using anti-hypertension medication (AHM) (76.8%) (Table 1). Among individuals identified as having HTN, more had stage 1 (17.2%), were not using AHM (16.3%), didn’t have parents with HTN (12.7%) and were un-aware of their HTN status and not controlling for HTN (11.5%) (Table S 2). The discovery GWASs for the five BP traits (SBP DBP, HTN, PP, MAP) was conducted on 10,700 sub-Saharan African participants with 13,976,041 SNPs. For each BP-related trait, quality control (QC) was performed and adjustments were made for the use of AHM (Figure S 1). The power calculation revealed that the current study has at least 80 % power to detect an effect size beta of ~0.60 for SNPs with MAF>0.10 (Figure S 3). Genetic associations with BP traits Association studies were performed in two stages: Stage 1 – meta-analysis of the GWAS for the three geographic regions represented in the AWI-Gen cohort (N=10,775); Stage 2 – Meta analysis of Stage 1 with GWAS from other studies on African and African-ancestry populations (UKBB African-ancestry (UKBBa, N=3,058) and Uganda Genome Resource (UGR, N=6,400)) (Figure 2). Genetic associations with each of the five BP traits are shown, using Miami plots and genomic inflation by the QQ-plots, for both Stage 1 and 2. There was no indication of genomic inflation, since the genomic inflation factor (GIF), lambda (λ), was <1.05 for all five BP traits, indicating adequate control for population sub-structure (Figure S 2). Independent GWASs for each AWI-Gen region (East, West, and South) were also conducted (Figure S 4). When comparing Stage 1 and Stage 2 GWASs for SBP and DBP (Figure 2), the signals differed by region. Prior to the Stage 1 meta-analyses, genome-wide (GW) associations for 41 SNPs with 9 displaying pleiotropic effects at GW significance were found in the three independent AWI-Gen regions (Table S 3). Thus, 12 signals each for East (2 displaying pleiotropic) and South (4 pleiotropic), with 5 and 11 SNPs reaching suggestive significance (p<5E-06) respectively, were observed. For West Africa 14 signals (2 pleiotropic) with only six reaching suggestive significance (Table S 4) were identified. Due to regional differences, the mega-analyses (a single GWAS for the entire AWI-Gen study for each trait) when compared to the AWI-Gen meta-analyses gave different GW associations with different associations identified across regions (Figure S 4). To account for the diverse signals driven by each independent region (regional differences), a meta-analysis of the three independent region GWASs was done using Han-Eskin random effects (RE2) model (Stage 1). Stage 1 GWAS Suggestive associated genomic regions (p<1E-06) from the Stage 1 discovery GWAS (identified in FUMA), are shown in Table 2. Across the five traits, 129 independent genomic regions were identified (136 independent SNPs with 130 lead SNPs), with 29 genomic region signals associating with more than one BP-trait (see bold SNPs in Table S 4), illustrating pleiotropic effects. When comparing the Stage 1 GWAS by region, suggestive signals (p<5E-04) differed across the East, West and South African regions (Table S 4). The GW significance threshold (p<5E-08) was reached for SBP with rs77846204 (imputed intergenic variant in RP11–38P22.2, p=4.95E-08) (Table 2), driven by the West (p=4.16E-07) and East (p=3.24E-04) AWI-Gen regions (Table S 4). This signal displayed pleiotropic effects with DBP (p=1.66E-06) and MAP (p=1.51E-07) (see bold SNPs in Table S 4) and had a high allele frequency in previous studies for all ancestries (>0.8, Ancestries: African, Admixed American, East Asian, European, African Americans). GW significance was also reached for PP with rs115808349 (imputed intergenic variant in ELL2P2, p=1.76E-08), driven by the East AWI-Gen region (p=2.25E-05) (Table S 4) and had low allele frequency for all ancestries, with African being the highest (0.05). Several suggestive independent genomic regions (Table 2) were observed across the five BP-related traits (40 SBP, 25 DBP, 21 HTN, 33 PP, and 31 MAP). The strongest signals (lowest p-values) for traits that reached only suggestive significance were: DBP, rs6494981 (intergenic TMCO5B, p=9.40E-08), which was also a suggestive signal for MAP (p=1.59E-06, displaying pleiotropic effects); HTN, rs113112741 (intronic MAML3, p=7.12E-08); MAP, rs73315125 (intergenic FDPSP6, p=8.02E-08), which was also a suggestive signal for SBP (p=6.35E-08) and DBP (p=1.13E-06, displaying pleiotropic effects). Stage 2 GWAS The number of SNPs increased from 13,952,382 (Stage 1) to 14,845,228 for the Stage 2 GWAS. No GW associations were detected in the Stage 2 analysis for any of the traits. Stage 2 GWAS suggestive independent genomic regions (p<1E-06), identified in FUMA, are shown in Table 3 with 40 independent genomic regions (17 SBP, 23 DBP (42 independent SNPs with 40 lead SNPs)). Most of these signals were driven by the Uganda Genome Resource (UGR) dataset (p<5E-04, Table S 5). Only one SBP (rs17428471) and five DBP (rs114007149, rs141245590, rs474277, rs617549, rs556594) signals were also identified in the Stage 1 GWAS, reaching suggestive significance. The signals with the lowest p-values for traits that reached only suggestive significance were: SBP, rs115702999 (ncRNA_exonic HECW2:AC020571.3, p=2.77E-07); DBP, rs6009081 (intronic PPARA, p=5.75E-07). Pleiotropic effects were not observed. Replication of Stage 1 and 2 GWAS outcomes Exact replication was conducted by comparing GW SNPs (p<5E-08) with suggestive SNPs (p<5E-04) from this study (Stage 1 and Stage 2) with GW SNPs from previous BP GWASs17,22,37,38. No replication of the two GW SNPs (p < 5E-08) for SBP (rs77846204, beta=17.7, p=4.95E-08) and PP (rs115808349, beta=32.4, p=1.76E-08) were found when compared against suggestive SNPs of the previous BP GWASs (p < 5E-04). Both GW signals were novel to this study. This study identified 25,338 SNPs (12,385 pleiotropic SNPs) across the five BP-related traits for Stage 1 (SBP, DBP, HTN, PP, MAP) and 10,632 SNPs (532 pleiotropic SNPs) for Stage 2 (SBP, DBP) that met replication suggestive significance (p<5E-04), with only 217 SNPs in both Stages. Replication of only 596 GW significant SNPs within 132 identified genomic regions (p<5E-08) from previous studies were found (503 Stage 1, 115 Stage 2, 23 both Stages) when compared against suggestive SNPs (p<5E-04) of the Stage 1 and Stage 2 GWASs (Table 4). Replication for each previous study is reported i.e. GWAS Catalogue17, Warren, et al.38 and Evangelou, et al.22 (Table S 6). Several SNPs illustrating pleiotropic effects were identified, with most replicated SNPs from European-ancestry studies. Three replicated SNPs (Stage 2) were from trans-ethnic studies that included African ancestry participants and displaying pleiotropic effects i.e. rs982148939(SBP, DBP), rs1742847125,40 (SBP,DBP,PP, BP), and rs89151141,42 (SBP, DBP, HTN, MAP). Fine-mapping and functional analysis Fine-mapping, to identify potential causal variants, for GW SNPs (p<5E-8) and/or top signal SNPs are shown as regional plots in Figure 3 and Figure S 5 respectively. Potential causal variants that met GW significance were identified for SBP (rs77846204, p=4.95E-08) and PP (rs115808348, p=1.76E-08), for the Stage 1 GWAS (Figure 3). The top signal SNPs (lowest p-value) for BP traits that only met suggestive significance for both the Stage 1 and 2 GWASs (p<5E-06) can be seen in Figure S 5, with none replicating (Figure S 5). For SBP the regional plot around the P2RY1 region for rs77846204 (chr3, p=4.95E-08, intergenic RP11–38P22.2) showed a very narrow peak (Figure 3), with an allele frequency of 0.81 for African populations (Table S 4). This SNP was found to be closely associated with SBP (p = 2.00E-08) and PP (p = 4.00E-11) for rs161792 (distance = 0.40MB) and with PP only for rs325729 (p=9.00E-09, distance = −0.44MB) and rs146975914 (p=5.00E-09, distance = −0.32MB) in other studies39,43,44 (Table S 7). It was also associated with other CVD linked traits such as HDL cholesterol, lung function/post bronchodilator (FEV1, associated with lung function), liver function tests and type 2 diabetes. It was also closely related to prostate cancer, baldness/hair colour, white cell counts and mood-related traits, within 1MB flanking region. For PP the regional plot around the Linc01256 region for rs115808349 (chr4, intergenic ELL2P2), showed two narrow peaks within the same region (Figure 3), since it also consisted of rs62317311 (chr 4, ncRNA_intronic RP11–789C2.1), that only reached suggestive significance (p=8.92E-07). This SNP had the highest allele frequency (0.05) for African population, with extremely low allele frequencies (<0.004) found in non-African populations (Table S 4). This SNP was found to be closely associated with resistance to AHM in HTN for rs1908127 (p=2.00E-06, distance = 0.15MB) (Table S 7). This SNP was found to be closely associated with traits such as total PHF-tau (SNP × SNP interaction), protein quantitative trait loci (liver) and mood-related traits within 1MB flanking region. Annotation of variants with GW significance (p<5E-08) for SBP (rs77846204) and PP (rs115808349), showed that these were intergenic (Table S 8). None of the SNPs were potentially deleterious (CADD score <12.37), with the exception of rs116166107 for PP (CADD = 12.72). Genetic positions for SBP (rs77846204) and PP (rs11580834) had a RDB score of 6 and 7 respectively. Functional mapping of position, eQTL (matched cis-eQTL SNPs) and chromatin interaction (i.e. 3D DNA–DNA interactions) are reported in Table S 9. PRS Polygenic risk scores (PRS), developed from three ancestry (African, European and multi-ancestry) GWASs (discovery) were applied to the individuals in AWI-Gen cohort (target, N=10,676) for SBP and DBP (shown in Figure 4). All PRSs developed from the different ancestries, show an increase in predicting higher BP levels as the quantile score increases (Figure 4a). The highest change in effect size (mm/Hg) was observed in the model from the multi-ancestry population, whereas the lowest change was observed in the African-ancestry PRS derived from the UKBBa dataset for both SBP and DBP. The variance explained i.e. R2 (%), was highest for the multi-ancestry PRS (0.22% for SBP and 0.36% for DBP) and lowest for the African-ancestry PRSs for both SBP (for UKBBa: 0.07%) and DBP (for UGR: 0.04%) (Figure 4b). The PRS was significant (p0.02), low minor allele frequency (MAF) (<0.01), and extreme deviation from Hardy Weinberg Equilibrium proportions (HWE) (<0.0005) were excluded; (2) Samples with high genotype missingness (>0.01) and discordant sex information were removed; (3) Mitochondrial, Y and X chromosome SNPs, including SNPs that did not match the Genome Reference Consortium Human Build 37 (GRCh37/hg19) reference alleles were removed. After QC ~1.7 million SNPs and 10,903 samples remained56. Imputation Genotype imputation was conducted to increase the coverage of genomic variation and allow fine-mapping. The African Genome Resources reference panel (EAGLE2+PBWT pipeline) at the Sanger Imputation Server (https://imputation.sanger.ac.uk) was used for genotype imputation to increase the coverage of the genome, to narrow-down the location of potential causal variants and to capture most haplotype blocks. Post imputation QC (i.e. removal of indels, rare SNPs) resulted in 13,976,041 SNPs (MAF >0.01 and info score >0. 6)56. Only participants with good quality phenotype and genotype data were used for the GWAS analyses (N=10,775). The genome assembly (base pair position) was the GRCh37/hg19. Genetic association analysis The discovery GWAS for the BP-related traits was conducted in two stages (Figure 1). Discovery GWAS (Stage 1 AWI-Gen GWAS) Potential confounders used as covariates in the GWAS were examined for significance by running a general linear model, using STATA V1565. As the participants originate from East, West and Southern Africa, there was significant population structure across regions (Figure S 2); moreover, preliminary analysis indicated relatedness among individuals from some of the AWI-Gen cohorts. Therefore, adjustments based on PCs (addressing genetic population structure) and kinship-matrix (addressing relatedness) was used as covariates. Previously defined confounders were also used as covariates, using Singh, et al.47 as a guideline to determine adjustments (with exception to BMI which was not adjusted for in the previous studies which were included in the Stage 2 GWAS). All genetic association tests were adjusted for the covariates: age, age2, sex and the first 10 PCs (population structure and geographic region-based adjustments). The H3ABioNet/H3Agwas Association pipeline workflow66 was used to conduct the discovery GWAS (https://github.com/h3abionet/h3agwas/). Novel associations were defined using the GWAS significant threshold significance of p<5E-08, with a suggestive threshold of p<5E-06. Linear mixed models (LMMs) were used to account for fixed and random effects for relatedness. Matrix LMMs were run to test for genetic associations, for an additive genetic model, for four continuous BP traits (SBP, DBP, PP and MAP) and one binary trait (HTN), using the Bayesian LMM association testing approach in BOLT-LMM v2.3.2 mixed model association testing67. This approach accounts for relatedness, ancestral heterogeneity (in samples) and any other unaccounted structure within the data. Independent GWASs for each AWI-Gen region (East, West, South) was conducted and a meta-analysis of summary statistics was conducted, using RE2 (Han and Eskin’s Random Effects model), in METASOFT v2.0.168. This was implemented in H3ABioNet/H3Agwas Meta-analysis pipeline workflow66 (http://github.com/h3abionet/h3agwas/meta/meta.nf), to evaluate the robustness of associations detected in joint analysis of the AWI-Gen dataset (Stage 1 GWAS). This approach assumes no heterogeneity of effect sizes if the null hypothesis is true (i.e. all beta values are zero), thus correcting for the overly conservative standard RE2 meta-analysis approach. Power calculation A power calculation was conducted (study design = continuous trait - independent individuals, hypothesis = gene-interaction, fixed number of samples = 10,903), using Quanto V1. 2. 369. A graph for power versus effect size (beta) at different allele frequencies (see Figure S 3) was constructed in R70. Meta-analysis (Stage 2 GWAS) Previous studies, including only African and African-ancestry participants were used for a meta-analysis (sub-population of African participants from the UKBB (https://biobank.ctsu.ox.ac.uk) and Gurdasani, et al.46), to combine with the Stage 1 AWI-Gen meta-analysis GWAS, to improve study power (Stage 2). Permission was obtained to access the genotype and phenotype dataset of the UKBB (research project number: 63215). The UKBBa (N=3,060) was previously QCed and imputed, following the same methodology used for the Stage 1 GWAS. The H3ABioNet/H3Agwas Association pipeline workflow66 was used to conduct the discovery GWAS (https://github.com/h3abionet/h3agwas/tree/master/assoc) for UKBBa, following the same methodology used for the Stage 1 GWAS. Gurdasani, et al.46 consisted of four African-ancestry cohorts: UGR (N=6,400), DDS (N=1,165), DCC (N=1,542) and AADM (N=5,231). Diabetes causes the walls of the blood vessels to stiffen, which leads to high BP49–51, therefore AADM, DDS and DCC, which included ~50% diabetic participants, were excluded. The H3ABioNet/H3Agwas Meta-analysis pipeline workflow66 was implemented to conduct the Stage 2 meta-analysis (https://github.com/h3abionet/h3agwas/tree/master/meta). The meta-analysis was conducted for SBP and DBP by comparing the Stage 1 GWAS (meta-analysis of AWI-Gen GWAS by region) with the UKBBa dataset (N=3,058) and the UGR cohort (N=6,400)46 summary statistics, using RE2 in METASOFT v2.0.168. Other BP-related traits (HTN, PP MAP) could not be included due to the lack of data availability in cohorts used in the Stage 2 GWAS. Visualization and interpretation of genetic associations Miami plots were generated, to display significantly associated SNPs in associated regions, using the Hudson package in R71. Genomic control and quantile-quantile (Q-Q) plots were conducted as a QC check, to re-evaluate genetic inflation and confounding biases such as cryptic relatedness and population stratification (with the assumption that the regional groupings will be independent of each other). Genomic control was evaluated in R70, by calculating GIF as, i.e. X2 = chi-squared of observed and 0.456 = median chi-squared of expected, where λ= 1 means that the population is homogenous and λ > 1.05 means that correction for population structure was not done efficiently and needs to be re-corrected. Q-Q plots were constructed in FUMA72 to assess deviation in the distribution of expected and observed p-values. Pleiotropy was identified when the same SNP meet suggestive significance (p<5E-06) in more than one BP-related trait. Regional visualisation of associated SNP regions were performed using LD from AWI-Gen in LocusZoom V0.4.873, using a 1MB flanking region, which was compared to data in the GWAS Catalog17. Replication with previous findings Replication of the Stage 1 and 2 GWAS with populations of similar genetic ancestry was performed, using the exact replication strategy.74. Replication was also tested against the Stage 1 and 2 GWAS, using previous studies: (1) GWAS Catalog17; (2) PhenoScanner37; (3) List of all 3,800 published BP associated SNPs38; (4) European-only ancestry population from Evangelou, et al.22, consisting of the UKBB & ICPBP cohorts (currently the largest published study with 757,601 European ancestry individuals). Any duplicate signals from the same study across the previous study databases were removed. Replication of the Stage 1 and 2 GWAS with previous studies, was assessed by comparing GW associations (p<5E-08) against SNPs with suggestive associations (p<5E-04). In addition, replication of GW signals found in previous studies (p<5E-08) were compared against the Stage 1 and 2 GWAS suggestive associations (p<5E-04). The H3ABioNet/H3Agwas Replication pipeline workflow66 was implemented to conduct replication analysis (https://github.com/h3abionet/h3agwas/tree/master/replication). The exact replication method was used to test for replication with the GWAS Catalog. The GWAS Catalog database was downloaded (https://www.ebi.ac.uk/gwas/, accessed on 27 March 2022). Since the genome assembly of GWAS Catalog was the Genome Reference Consortium Human Build 38 (GRCh38/hg38), it was converted to GRCh37/hg19, to allow for comparison, by conducting a lift-over. A subset of the GWAS Catalog data was generated by filtering for key words relevant to BP traits: “Systolic blood pressure”, “Diastolic blood pressure”, “hypertension”, “pulse pressure”, and “mean-arterial pressure”. The GW associations (p<5E-08) were also compared against the PhenoScanner database37, since the GWAS Catalog is limited to GW signals with p<5E-8 with a few suggestive at p<5E-06, to pick up any missed or additional suggestive signals (p<5E-04) found within this database (http://www.phenoscanner.medschl.cam.ac.uk). Replication of the Stage 1 and 2 GWAS was also compared against a list of 3,800 published BP associated SNPs listed within Warren, et al.38. Exact replication was tested by using Evangelou et al. (2018) summary full statistics data (currently the largest published study with 757,601 European ancestry individuals), for SBP, DBP and PP to determine which signals were uniquely identified in studies with African-ancestry populations. Since both genome assemblies were GRCh37/hg19, a direct comparison was evaluated in R70. Novel associated SNPs were determined by searching for SNPs within a 500 kb region of all SNPs with GW associations (p-value<5E-08) found in our study. The different regions of the AWI-Gen datasets were compared for South, East and West Africa (which were found to be significantly different sub-populations). In silico functional analysis FUMA72 was used for in silico functional analysis and annotation was performed to select the most likely causal variants from the GWAS summary statistics. The FUMA pipeline (https://fuma.ctglab.nl) was used for functional gene mapping, using SNP2GENE tool, for positional, expression quantitative trait loci (eQTL) and chromatin interaction mappings. Candidate SNPs were selected in the associated genomic region with R2 ≥ 0.6 to define independent significant SNPs with GW (p < 5E-08) and MAF ≥ 01 for annotation (reference population = 1000G Phase3 African; included variants in reference panel (non-GWAS tagged SNPs in LD); maximum distance between LD blocks to merge into a locus = 250kb). Candidate SNPs functional consequences were predicted by chromosome base-pair position, and reference and alternate alleles, to databases containing known functional annotations. This included: (1) ANNOVAR, a variant annotation tool which is used to obtain functional consequences of SNPs on gene functions75; (2) combined annotation-dependent depletion (CADD), a score of deleteriousness of SNPs predicted by 63 functional annotations with a threshold of > 12.37 to be deleterious76; (3) RegulomeDB (RDB), a categorical score representing regulatory functionality of SNPs based on expression quantitative trait loci (eQTLs) and chromatin marks77, with eQTLs scans using the Genotype-Tissue Expression (GTex) Consortium78. Fine-mapping Fine-mapping to identify potential causal variants was conducted by comparing the GW associations and/or top association SNPs where only suggestive associations were identified (lowest pFigure 2 Discovery GWAS genetic associations in AWI-Gen (Stage 1) and the meta-analysis (Stage 2) showed by (a) Q-Q plots with the genomic control coefficient (λ) and (b) Miami plots for five BP traits. Adjusting for age, age2, sex and the first 10 PCs as covariates. With GW significance = p < 5E-8. QQ-plot shows the distribution of −log10-transformed p-value for observed (y-axis) vs expected (x-axis) with GIF (λ); red line = observed; grey line = expected. Miami plot shows −log10-transformed two-tailed p-value for each BP trait (y-axis) and base pair positions along the chromosomes (x-axis); red line = GW significance (p < 5E-08); purple line = threshold for suggestive association (p <1E-06). Figure 3 Fine mapping of novel GW significant (p < 5E-08) associations for BP-related traits in the AWI-Gen study. Locuszoom73 plots showing association for GW significant associations. Lead SNPs (purple diamond), GWAS Catalog trait labels and genes are labelled. Plots shown for SBP around the P2RY1 region (rs77846204, p = 4.95E-08, intergenic RP11–38P22.2) and PP around the Linc01256 region (rs115808348, p =1.76E-08,intergenic ELL2P2 – also consisting of rs62317311 (p = 8.92E-07, ncRNA_intronic RP11–789C2.1), for the AWI-Gen Stage 1 GWAS (N =10,775). Figure 4 Transferability of Polygenic Risk Score (PRS). Models derived from three ancestry GWASs (discovery) and applied to the AWI-Gen cohort (target): (1) African: Two African-ancestry cohorts i.e. UKBBa (n=3,058) and UGR (n = 6,400)46 (2) European: UK biobank and ICPBP (N = 757,601)22,41 and (3) Multi-ancestry: PAGE (N = 49,839 with 17,152 African-ancestry) cohort40,41. a) PRS stratification of SBP and DBP: Point range-plots comparing the difference in BP-trait mean (mmHg) of the upper PRS quintiles from the lowest, stratified by the discovery datasets (error bars = mean ± 95% confidence intervals) are shown. b Plots showing additional variance explained (% R2) by each PRS for SBP and DBP: P-value (above bar) and number of SNPs (within bar) are stated for the P-threshold value (PT).

Document structure show

Annnotations TAB TSV DIC JSON TextAE Lectin_function IAV-Glycan

  • Denotations: 0
  • Blocks: 0
  • Relations: 0