Materials and method Dataset A global gene expression (GE) dataset (GSE38376) from 1) cells sensitive to lapatinib (said to be under "parental conditions") and 2) cells with acquired resistance to lapatinib was obtained from Komurov et al. [17]. Expression values were measured using Illumina HumanHT-12 V3.0 expression beadchip (GPL6947). Samples include SKBR3 parental and resistant (SKBR3-R) each under basal conditions and in response to 0.1 μM and 1 μM lapatinib after 24 hours, where the resistant cell line variant (SKBR3-R) showed 100-fold more resistance to lapatinib treatment than the parental SKBR3 cell line, as reported by Komurov et al. [17]. These gene expression datasets used probe-level annotation, which we converted into gene-level annotation. To obtain gene-level GE values, probes were mapped to gene symbols using the corresponding annotation file (GPL6947). While mapping, the average GE values were calculated across all probes if the same gene symbol was annotated to multiple probes. Two GE data matrices were constructed for parental SKBR3 cell lines and resistant SKBR3-R cell lines, respectively, where rows were labelled with gene symbols and columns were labelled with different treatment conditions (0, 0.1 μM and 1 μM of lapatinib). Construction of a gene-gene relationship network We define the gene-gene relationship network as GGR:= (S,R) for each GE data matrix. Here, S is a set of 370 cancer related genes collected from the Cancer Gene Census [23]. R is defined as the set of pair-wise relationships among seed genes. A gene pair (genei, genej) is included in R if the corresponding absolute Pearson Correlation Coefficient (PCC) is above some threshold, and defined as a pair-wise relationship. These threshold values were empirically chosen for parental and resistant conditions individually, based on the corresponding distributions of all pairwise absolute PCC values. Note PCC values resulting from probes mapped to the same gene were trivially ignored. Bayesian statistical modeling of GGR network Network model For statistical modeling of networks, exponential families of distributions offer robust and flexible parametric models [24]. These probabilistic models can be used to evaluate the probability that an edge is present in the network. They can also be used to quantify topological properties of networks by summarizing them in a parametric form and associating sufficient statistics with those parameters [19,24]. In this study, we use a special class of exponential family distributions known as ERGM (Exponential Random Graph Models), also known as the p1-model, which was introduced by Holland and Leinhardt [24]. A gene-gene relationship network with g genes can be regarded as a random variable X taking values from a set G containing all 2g(g−1) possible relationship networks [24,25]. Let u be a generic point of G which can alternatively be denoted as the realization of X by X = u. Let the binary outcome uij = 1 if genei interacts with genej, or uij = 0 otherwise. Then u is a binary data matrix [19]. Let Pr(u) be the probability function on G given by (1) Pr(u)=Pr(X=u)=1κθexp∑pθpzpu where zp(u) is the network statistic of type p, θp is the parameter associated with zp(u) and κ(θ) is the normalizing constant that ensures Pr(u) is a proper probability distribution (sums to 1 over all u in G) [26]. The parameter θ is a vector of model parameters associated with network statistics and needs to be estimated. See [24] for further details. A major limitation of the p1-model is the difficulty of calculating the normalizing constant, κ(θ), since it is a sum over the entire graph space. Estimating the maximum likelihood of this model becomes intractable as there are 2g(g−1) possible directed graphs (or 2g(g−1)2 undirected graphs), each having g nodes (genes). A technique called maximum pseudolikelihood estimation has been developed to address this problem [27]. This technique employs MCMC methods such as Gibbs or Metropolis-Hastings sampling algorithms [28]. The construction of the p1-model for a directed network is described in an Appendix Additional file 1: Appendix I. For the gene-gene relationship network with undirected edges, the description of the p1-model can be simplified by using only two Bernoulli variables Yij0 and Yij1 instead of four as follows: Yijk=1ifuij=k,0otherwise The simplified p1-model can then be defined using the following two equations to predict the probability of an edge being present between genei and genej: (2) logPrYij1=1=λij+θ+αi+αj (3) log Pr Y ij 0 = 1 = λ ij for i