> top > docs > PMC:4682381 > spans > 1908-7610

PMC:4682381 / 1908-7610 JSONTXT

Discrete structural features among interface residue-level classes Abstract Background Protein-protein interaction (PPI) is essential for molecular functions in biological cells. Investigation on protein interfaces of known complexes is an important step towards deciphering the driving forces of PPIs. Each PPI complex is specific, sensitive and selective to binding. Therefore, we have estimated the relative difference in percentage of polar residues between surface and the interface for each complex in a non-redundant heterodimer dataset of 278 complexes to understand the predominant forces driving binding. Results Our analysis showed ~60% of protein complexes with surface polarity greater than interface polarity (designated as class A). However, a considerable number of complexes (~40%) have interface polarity greater than surface polarity, (designated as class B), with a significantly different p-value of 1.66E-45 from class A. Comprehensive analyses of protein complexes show that interface features such as interface area, interface polarity abundance, solvation free energy gain upon interface formation, binding energy and the percentage of interface charged residue abundance distinguish among class A and class B complexes, while electrostatic visualization maps also help differentiate interface classes among complexes. Conclusions Class A complexes are classical with abundant non-polar interactions at the interface; however class B complexes have abundant polar interactions at the interface, similar to protein surface characteristics. Five physicochemical interface features analyzed from the protein heterodimer dataset are discriminatory among the interface residue-level classes. These novel observations find application in developing residue-level models for protein-protein binding prediction, protein-protein docking studies and interface inhibitor design as drugs. Background Protein-protein binding is a known phenomenon in complex biological networks. The molecular principle of such binding is often elusive in nature. Understanding its driving forces using known protein complexes is essential. The analysis of existing protein-protein interaction (PPI) complexes from the Protein Data Bank (PDB) [1] is the key to gaining insights into recognition mechanisms and binding principles as reviewed elsewhere [2-6]. Sequence and structural investigations on the existing complexes has been carried out for several decades [3,7-10]. In these extensive surveys, structural features over diverse datasets of protein-protein complexes were typically averaged, obscuring information on individual proteins' structural integrity. Each individual complex is specific and sensitive to binding. Although, non-polar (or hydrophobic) interactions are known to play a major role in contributing to the driving force for binding, in a considerable number of complexes, polar interactions are also observed to contribute abundantly to the formation of a stable interface [11]. Therefore, it is often essential to study the relative difference in surface and interface polarity of each PPI complex to determine the major binding forces at the interface and determine their discriminatory features. Interfaces are localized regions of surfaces with different physico-chemical properties compared to the rest of the surfaces, thereby driving binding to other molecules. Both physical and chemical features (including hydrophobicity, electrostatic interactions, binding energy, interface size, hydrogen bonds, salt bridges, disulphide bonds, planarity, sphericity, shape complementarity, amino acid chemical groups, and conserved residue clusters) govern the formation of protein interfaces as described elsewhere [7,9,12-18]. The chemical nature of residues forming a protein interface (amino acid residue composition) determines the hydrophobic effect of an interface. Non-polar (or hydrophobic) residues are observed to occur predominantly at the protein interface, playing a major role in contributing to the driving force for binding [7,13]. Interfaces are observed to be less non-polar (or hydrophobic) than the protein interior [13]. The residue composition of protein-protein interfaces was observed to be more similar to the protein surface than the protein interior [9]. Interfaces were observed to be significantly polar as well as non-polar with few charged groups, similar to the characteristics of the protein surface [12]. Structural analysis also revealed that charged and polar amino acids are involved in protein-protein interactions as reviewed elsewhere [19]. Charged and polar residues contributing to binding specificity and complex formation are demonstrated in a number of complexes such as human IL-4, human CD2 and CD58, barnase-barstar, Colicin E9, integrin αvβ6 membrane protein and in intrinsically disordered proteins [20-25]. Shape complementarity, polar interactions, hydrogen bonding and salt bridges are also known to contribute to binding specificity and free energy of binding [17,24,26,27]. Also, charged and aromatic side chains are crucial for binding, determining the cation-pi, electrostatic and aromatic interactions [8]. The role of electrostatics in binding stability of protein-protein complexes is demonstrated [16]. These observations indicate that although PPIs are driven by non-polar interactions at the interface for a majority of complexes, in some cases polar interactions contribute to binding specificity (characteristic of polar residues) and likewise to complex stabilization. Therefore, a study on the relative percentage difference between surface and interface polarities of each protein complex is often essential. In our previous study, we have identified a class of complexes with more polar residues at the interface than the rest of the surface, where binding is mainly polar with a dataset of 198 complexes [11]. This observation has now been extended for an updated yet non-redundant dataset of 278 protein complexes to verify and identify any discriminatory features among these interface residue-level classes. In this study, we have carried out a comprehensive structural analysis of 278 non-redundant heterodimeric protein complexes from the PDB. We estimated the relative difference in surface and interface polarities of each complex in the dataset, using percentage values of polar residues. This property divides the dataset into two interface classes as also observed in our previous study with a smaller dataset [11]. Class A has less polar residues at the interface than the rest of the surface (~60%) which is the 'classical' definition of a PPI complex and class B has more polar residues at the interface than the rest of the surface (~40%), is 'non-classical.' Therefore, we have investigated essential PPI structural features such as interface area (ΔASA), the relative abundance of polar and non-polar residues at the interface (interface polarity abundance), hydrogen bonds (H-bonds), salt bridges, percentage of charged residues at the interface (interface charged residues%), solvation free energy gain upon interface formation (ΔiG), binding energy (BE), and electrostatics among these interface classes and their gleaned features are documented. We identified five key features (ΔASA, interface polarity abundance, interface charged residues%, ΔiG and BE) that are significantly different between the interface classes. These novel observations have implications for residue-level characterization of protein complexes to develop models for protein-protein binding prediction and docking studies. Methods Heterodimer dataset We created a non-redundant heterodimer dataset of protein complexes from the PDB, using the RCSB PDB's advanced search interface. The following criteria were used for filtering: (i) resolution <= 3Å (ii) protein size >50 residues (iii) contains experimental data (iv) number of chains, entities and oligomeric state is set at 2 (v) devoid of DNA or RNA or a hybrid of such molecules with the protein or otherwise (vi) entries with mutations were not included and (vii) sequence identity cut-off is set to 30%, which is the minimum cut-off available in the PDB. As a second step, the USEARCH program [28] was used to further remove the redundancy among heterodimer complexes at sequence identity cut-off of 20%, as this threshold eliminates remote homology up to 25% sequence identity seen in structures as well [29]. Interface analysis The interface of PPI complex is calculated as the change in solvent accessible surface area (ΔASA) upon complex formation. The Surface Racer 5.0 program [30] was used to calculate ASA with a probe radius of 1.4Å and Lee and Richards implementation [31]. Interface residues with ΔASA > 0.1Å2 were considered for this analysis, as defined by Chakrabarti and Janin [32]. ASA was used to determine surface residues of each complex. The amount of polar, non-polar and charged residues at the interface was then estimated for the dataset. The interface polarity abundance (P%-NP%) is measured as the difference in the percentage of polar residues (P%) and percentage of non-polar residues (NP%) at the interface [11]. Classification based on relative interface-surface polarity Interfaces are part of protein surface formed upon binding of individual subunits. Each protein complex has a specific composition of polar (P) and nonpolar (NP) residues at the surface (S) and at the interface (I). The distribution of polar and nonpolar residues at the interface of a protein complex describes the nature of the interface and the major driving force for binding. We have calculated the percentage of polar and nonpolar residues at the surface and interface for each complex in the dataset. Polar residues considered in the analysis are R, N, D, E, Q, H, K, S, T, and Y and non-polar residues are A, C, G, I, L, M, F, P, V, and W. Complexes were then grouped based on the relative difference in percentage of polar residues between surface (S) and the interface (I). Complexes with interface polarity less than the surface (represented as S>I) are grouped as class A, and those that have interface polarity greater than the surface (represented as SI) implying relatively less polar interactions at the interface (or relative abundance of non-polar interactions at the interface). Class B complex shows surface polariy of 50.69% and an interface polarity of 73.21% (S

Document structure show

projects that have annotations to this span

Unselected / annnotation Selected / annnotation
2_test 40 (40)