PMC:4307189 / 18276-21521
Annnotations
2_test
{"project":"2_test","denotations":[{"id":"25599599-20100321-14868124","span":{"begin":2925,"end":2927},"obj":"20100321"}],"text":"Bayesian modeling\nWe used a fully Bayesian approach for modeling our gene-gene relationship network. Parameter estimation is a crucial step in statistical modeling, for which a classical approach is maximum likelihood estimation (MLE). However, unlike MLE, Bayesian techniques involve calculation of posterior probabilities of model parameters by training the model with given data. We assume that the data follows the generative model , and assign a prior probability Pθ|ℳ to the parameter vector θ under the model . Then Bayes’ rule for calculating posterior probability is as follows: (4) Prθ|ℳ,D=PrD|θ,ℳ×Prθ|ℳZ\nwhere PrD|θ,ℳ is the likelihood function. Now, the marginal likelihood can be expressed as (5) Z=PrD|ℳ=∫PrD|ℳ,θ×Pθ|ℳdθ,\nComputing the exact solution for the marginal likelihood is often intractable since it is prone to the curse of dimensionality. Fortunately, Markov Chain Monte Carlo (MCMC) methods such as Gibbs sampling and Metropolis-Hastings methods do not require to be explicitly computed. In general, MCMC methods are stochastic simulation techniques which generate samples from the joint distribution Pℳ,θ|D for calculating the posterior probabilities of parameters. Here we used Gibbs sampling methods, which sample iteratively, one parameter at a time, from the full conditional distribution given the current and previous values of all other parameters. To implement Gibbs sampling, we employed WinBUGS [29], which is a high-level software package providing an easy interface for implementing complex Bayesian models. In WinBUGS, users are free from background lower-level programming details, and only have to express the model precisely.\nWe hypothesized that gene-pairs involved in drug resistance are likely to be found with high probabilities in the resistant network but low probabilities in the parental network. Therefore, we built two networks, one from resistant datasets and the other from parental datasets. In this Bayesian approach, the model likelihood is defined in Equations (2) and (3), where Yk is the data matrix calculated from the observed data u. Here we have two Yk data matrices, namely a gene-gene relationship network YkR derived from resistant samples and YkP derived from parental samples.\nOur approach is a hierarchical Bayesian model in that model parameters are in turn dependent on hyperparameters. We assign the density parameter θ in Equation (2) a normal prior distribution with mean 0 and standard deviation σθ. (6) θ∼N0,σθ2\nNote, in WinBUGS the parameter τ, called the precision, replaces the standard deviation parameter σ of the normal distribution, where, τ=σ−2. For the hyperparameter τθ we specify a gamma prior distribution as follows, since it is a conjugate prior for the normal distribution: (7) τθ∼Gammaa0,b0\nWe set a0 = 0.001 and b0 = 0.001 to make the prior for θnoninformative, making its standard deviation wide to express large uncertainty [19]. For attractiveness/ expansiveness parameters αi and αj, we followed the approach used by Adams et al. [30]. (8) αiRαiP∼N00,Σ\n(9) Σ − 1 ∼ Wishart 1 0 0 1 , 2\nHere, αiR and αiP represent the expansiveness/attractiveness parameters for the network model of resistant and parental conditions, respectively."}