PMC:1481596 / 1476-8510
Annnotations
{"target":"https://pubannotation.org/docs/sourcedb/PMC/sourceid/1481596","sourcedb":"PMC","sourceid":"1481596","source_url":"https://www.ncbi.nlm.nih.gov/pmc/1481596","text":"Background\nA major challenge in systems biology is to understand the intricate network of interacting molecules. The complexity in biological systems arises not only from various individual protein molecules but also from their organization into systems with numerous interacting partners. In fact, most cellular processes are carried out by multi-protein complexes, groups of proteins that bind together to perform a specific task. Some proteins form stable complexes, such as the ribosomal complex that consists of more than 50 proteins and three RNA molecules, while other proteins form transient associations and are part of several complexes at different stages of a cellular process. A better understanding of this higher-order organization of proteins into overlapping complexes is an important step towards unveiling functional and evolutionary mechanisms behind biological networks.\nData on protein complexes are collected from the study of individual systems, and more recently through high-throughput experiments, such as yeast two-hybrid (Y2H) [1,2] and tandem affinity purification followed by mass spectrometry (TAP/MS) [3,4]. The TAP/MS approach helps pinpoint proteins that interact with a tagged bait protein, either directly or indirectly, and are thus suited to identify multi-protein complexes. In fact, several research groups have systematically applied TAP/MS technology to study protein complexes involved in different signaling pathways [5].\nProtein interactions are routinely represented as graphs, with proteins as nodes and interactions as edges (links). Therefore, it is not surprising that analysis of protein interaction networks reach out for a variety of graph-theoretical tools. Following the observation that protein interaction networks display a characteristic power-law like node degree distribution [6], a substantial body of research focused on statistical properties of protein interaction networks [7,8]. In 1999, Hartwell et al. [9] introduced a notion of a functional module, a group of cellular components and their interaction that can be attributed a specific biological function. The authors also suggested the modular organization of molecular interaction networks, where each functional module involves a small number of cellular components and is autonomous, i.e., its interaction with other modules is limited to a few cellular components. Subsequently, this assumption was used in several computational methods to identify protein complexes and functional modules in high-throughput protein interaction networks [10-15]. Some methods [10-13] look for densely connected subgraphs within a protein interaction network, either cliques or \"cliquish\" components. For example, Spirin et al. [13] use the term functional module to denote groups of proteins which are densely connected within themselves but sparsely connected with the rest of the network. Other methods [14,15] combine protein interaction with other information to identify functional modules, such as signal transduction pathways, that do not necessarily correspond to densely connected regions of the network.\nIn a recent paper, Gagneur et al. applied modular decomposition to elucidate the organization of protein complexes [16]. The basic principle behind modular decomposition is to iteratively identify and contract nodes that are in a certain sense equivalent, until no more equivalent nodes can be found in the graph. A graph is called prime if it cannot be decomposed any further. Only graphs that belong to a very special graph family called cographs can be completely decomposed (that is, the iterative reduction process does not halt with a non-trivial prime graph). While the modular decomposition provides an excellent description of combinatorial variants within a family of complexes, it does not impose any order on the complexes within the family. As such it lacks the description power to represent the dynamics of complex formation, i.e., the manner in which proteins form transient interactions to participate in the complexes within the family. The order imposed on protein complexes within the family is particularly interesting if the family corresponds to a functional module where biological function is achieved through a dynamic formation of protein complexes and the order reflects this formation.\nIn this work, we model a functional module as a union of overlapping dense subnetworks called here functional groups. A functional group is either a maximal clique (typically representing a protein complex) or a set of alternative variants of such complexes/cliques. As components of a larger functional module, functional groups are not assumed to be well separated and can have significant overlaps. Intuitively, if a functional module performs a function that requires a sequence of steps (like in the case of a signaling pathway) then we would like functional groups to be snapshots of protein associations at these steps. We propose a new method for identifying and representing overlapping functional groups in a functional module. Furthermore, if the module corresponds to a dynamic process that requires certain complexes (or more generally functional groups) come into contact in a specific order, our method attempts to discover this order. Our method is motivated by a fundamental result for chordal graphs [17], which states that every chordal graph has the so called clique tree representation. However, not every protein interaction network is chordal and not every functional group is a clique. Therefore, we developed a graph-theoretical framework that enables automatic construction of a tree-like representation, analogous to the clique tree representation, for much broader family of graphs. We call this representation the Tree of Complexes representation. The nodes in the tree are functional groups, and for every protein, the set of functional groups that contain this protein forms a single subtree. The \"single subtree\" requirement restricts significantly the way in which the nodes of the tree can be interconnected. As a consequence, this representation shows a smooth transition between functional groups and allows for tracking a protein's path through a cascade of functional groups. Therefore, depending on the nature of the network, the representation may be capable of elucidating temporal relations between functional groups.\nWe developed a new method, Complex Overlap Decomposition (COD), that given a protein interaction network identifies its functional groups and constructs the Tree of Complexes representation. Our method requires that the network satisfies certain mathematical properties. We applied the COD method to several protein interaction networks, such as the TNFα/NF-κB signaling pathway and the pheromone signaling pathway. The corresponding subnetworks for all interaction networks are extracted from high throughput experimental data. Our results show that the COD method opens a new avenue for the analysis of protein interaction networks.","divisions":[{"label":"title","span":{"begin":0,"end":10}},{"label":"p","span":{"begin":11,"end":891}},{"label":"p","span":{"begin":892,"end":1466}},{"label":"p","span":{"begin":1467,"end":3124}},{"label":"p","span":{"begin":3125,"end":4339}},{"label":"p","span":{"begin":4340,"end":6399}}],"tracks":[{"project":"2_test","denotations":[{"id":"16722537-10688190-1694473","span":{"begin":1057,"end":1058},"obj":"10688190"},{"id":"16722537-11283351-1694474","span":{"begin":1059,"end":1060},"obj":"11283351"},{"id":"16722537-11805837-1694475","span":{"begin":1135,"end":1136},"obj":"11805837"},{"id":"16722537-11805826-1694476","span":{"begin":1137,"end":1138},"obj":"11805826"},{"id":"16722537-14743216-1694477","span":{"begin":1463,"end":1464},"obj":"14743216"},{"id":"16722537-10521342-1694478","span":{"begin":1839,"end":1840},"obj":"10521342"},{"id":"16722537-15284103-1694479","span":{"begin":1941,"end":1942},"obj":"15284103"},{"id":"16722537-15548452-1694480","span":{"begin":1943,"end":1944},"obj":"15548452"},{"id":"16722537-10591225-1694481","span":{"begin":1973,"end":1974},"obj":"10591225"},{"id":"16722537-12525261-1694482","span":{"begin":2566,"end":2568},"obj":"12525261"},{"id":"16722537-12538875-1694482","span":{"begin":2566,"end":2568},"obj":"12538875"},{"id":"16722537-12711690-1694482","span":{"begin":2566,"end":2568},"obj":"12711690"},{"id":"16722537-14517352-1694482","span":{"begin":2566,"end":2568},"obj":"14517352"},{"id":"16722537-12413400-1694482","span":{"begin":2566,"end":2568},"obj":"12413400"},{"id":"16722537-14576317-1694482","span":{"begin":2566,"end":2568},"obj":"14576317"},{"id":"16722537-12525261-1694483","span":{"begin":2588,"end":2590},"obj":"12525261"},{"id":"16722537-12538875-1694483","span":{"begin":2588,"end":2590},"obj":"12538875"},{"id":"16722537-12711690-1694483","span":{"begin":2588,"end":2590},"obj":"12711690"},{"id":"16722537-14517352-1694483","span":{"begin":2588,"end":2590},"obj":"14517352"},{"id":"16722537-14517352-1694484","span":{"begin":2739,"end":2741},"obj":"14517352"},{"id":"16722537-12413400-1694485","span":{"begin":2917,"end":2919},"obj":"12413400"},{"id":"16722537-14576317-1694486","span":{"begin":2920,"end":2922},"obj":"14576317"},{"id":"16722537-15287979-1694487","span":{"begin":3241,"end":3243},"obj":"15287979"}],"attributes":[{"subj":"16722537-10688190-1694473","pred":"source","obj":"2_test"},{"subj":"16722537-11283351-1694474","pred":"source","obj":"2_test"},{"subj":"16722537-11805837-1694475","pred":"source","obj":"2_test"},{"subj":"16722537-11805826-1694476","pred":"source","obj":"2_test"},{"subj":"16722537-14743216-1694477","pred":"source","obj":"2_test"},{"subj":"16722537-10521342-1694478","pred":"source","obj":"2_test"},{"subj":"16722537-15284103-1694479","pred":"source","obj":"2_test"},{"subj":"16722537-15548452-1694480","pred":"source","obj":"2_test"},{"subj":"16722537-10591225-1694481","pred":"source","obj":"2_test"},{"subj":"16722537-12525261-1694482","pred":"source","obj":"2_test"},{"subj":"16722537-12538875-1694482","pred":"source","obj":"2_test"},{"subj":"16722537-12711690-1694482","pred":"source","obj":"2_test"},{"subj":"16722537-14517352-1694482","pred":"source","obj":"2_test"},{"subj":"16722537-12413400-1694482","pred":"source","obj":"2_test"},{"subj":"16722537-14576317-1694482","pred":"source","obj":"2_test"},{"subj":"16722537-12525261-1694483","pred":"source","obj":"2_test"},{"subj":"16722537-12538875-1694483","pred":"source","obj":"2_test"},{"subj":"16722537-12711690-1694483","pred":"source","obj":"2_test"},{"subj":"16722537-14517352-1694483","pred":"source","obj":"2_test"},{"subj":"16722537-14517352-1694484","pred":"source","obj":"2_test"},{"subj":"16722537-12413400-1694485","pred":"source","obj":"2_test"},{"subj":"16722537-14576317-1694486","pred":"source","obj":"2_test"},{"subj":"16722537-15287979-1694487","pred":"source","obj":"2_test"}]}],"config":{"attribute types":[{"pred":"source","value type":"selection","values":[{"id":"2_test","color":"#ecd393","default":true}]}]}}