Yapısal Bağımlılık Altında Karmaşık MAPK Yolunun Bayesci Tahmini

MAPK yolu, tüm ökaryotlarda bulunan hücresel büyüme kontrolünü düzenleyen başlıca sinyal iletim sistemlerinden biridir. Hayati görevinden dolayı sistemin idaresi çok sayıda protein vasıtasıyla yürütülür, buna bağlı olarak karmaşık bir yapı oluşturur. Çalışmada, Euler yaklaşımına dayalı MCMC teknikleriyle bu sistemin tahmininde diğer proteinlerle yüksek yapısal bağımlılıklar gösteren birçok proteinin varolduğu gözlenmiştir. Bu proteinler kabul edilme olasılıklarını imkansız yapan tekil difüzyon/varyans matrislerine neden olmuşlardır. Bu nedenle bu sorunlu proteinler tahmin hesabının başında çıkarılmış ve parametreler sadece sistemdeki doğrusal bağımsız türler kullanarak tahmin edilmiştir. Ancak bu durumda da özellikle bağımlı türlerin sayısı arttıkça, tahminin doğruluğu bahsedilen eliminasyondan oldukça etkilenmektedir. Bu proteinlerin elenmesi MCMC’deki mevcut kayıp terim sayısının belirgin derecede artmasına neden olmaktadır. Bu çalışmada dolaylı yoldan bu proteinler, bağımlı terimlerin bağımsız türlerin doğrusal kombinasyonu şeklinde simülasyon eden alternatif bir yaklaşımla hesaplamaların içine katılmaktadır. Bu şekilde reaksiyon oranlarının ve durumlarının kabul edilme olasılıklarını hesaplamada bağımlı türlerin etkileri ilave edilebilmektedir. Analizlerden, bahsedilen yeniliğin tahminlerin ortalama hatalarını azalttığı ve MAPK yolunun tahmininde daha az hesaplama maliyeti önerdiği sonucuna varılmıştır.

Bayesian Inference of the Complex MAPK Pathway Under the Structural Dependency

The MAPK pathway is one of the main signal transaction system in all eukaryotes which regulates the cellular growth control. Because of its vital role, the regulation of the pathway is conducted via many proteins, thereby constitutes a complex structure. In inference of this system via MCMC techniques based on the Euler approximation, we have observed that there are many proteins which indicate high structural dependencies on other proteins and these species have caused singular diffusion matrices, hereby resulted in infeasible acceptance probabilities. Therefore, we have discarded these problematic substrates at the beginning of the inference and estimated the parameters by using merely linearly independent species in the system. However in that case, the accuracy of the estimation has been highly affected by the underlying exclusion, particularly, when the number of dependent species was big. The elimination of those proteins has led to a significant rise in the number of current missing components in MCMC. In this study, we implicitly include these proteins in our computation via an alternative approach which simulates dependent terms as a linear combination of linearly independent species. In that way, we can add the effect of dependent species in the calculation of acceptance probabilities of reaction rates and states. From the analysis, we conclude that the highlighted innovation decreases the average error of estimates and suggests less computational cost in inference of the MAPK pathway.

___

  • Bower, J.M., and Bolouri, H., 2001. Computational modelling of genetic and biochemical networks (Second edition). Massachusetts Institute of Technology. Cambridge. Massachusetts.
  • Boys, R.J., Wilkinson, D.J., and Kirkwood, T.B.L., 2008. Bayesian inference for a discretely observed stochastic kinetic model. Statistical Computing, 18, 125-135.
  • Elerian, B.O., Chib, S., and Shephard, N., 2001. Likelihood inference for discretely observed nonlinear diffusions. Econometrica, 69 (4), 959-993.
  • Eraker, B., 2001. MCMC analysis of diffusion models with application to finance. Journal of Business and Economic Statistics, 19 (2), 177–191.
  • Gelman, A., Carlin, J.B., Stern, H.S., and Rubin, D.B., 2004. Bayesian data analysis. Chapman and Hall/CRC. Florida. U.S.A.
  • Geweke, J., 1992. Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. Bayesian Statistics 4 in Bernando, J.M., Berger, J.O., Dawid, A. P., and Smith, A.F.M. (Eds), 169-193. Oxford University Press. Oxford.
  • Gibson, M.A., and Bruck, J., 2000. Efficient exact stochastic simulation of chemical systems with many species and many channels. Journal of Physical Chemistry, A(104), 1876–1889.
  • Gillespie, D.T., 1977. Exact stochastic simulation of coupled chemical reactions. Journal of Physical Chemistry, 81 (25), 2340–2361.
  • Gillespie, D.T., 1992. A rigorous derivation of the chemical master equation. Physica, A 188, 404–425.
  • Golightly, A., and Wilkinson, D.J., 2005. Bayesian inference for stochastic kinetic models using a diffusion approximation. Biometrics, 61 (3), 781–788.
  • Golightly, A., and Wilkinson, D.J., 2006a. Bayesian sequential inference for nonlinear multivariate diffusions. Statistics and Computing, 16, 323-338.
  • Golightly, A., and Wilkinson, D.J., 2006b. Bayesian sequential inference for stochastic kinetic biochemical network models. Journal of Computational Biology, 13 (3), 838-851.
  • Kolch, W., Calder, M., and Gilbert, D., 2005. When kinases meet mathematics: the systems biology of MAPK signaling. FEBS Letters, 579, 1891–1895.
  • Orton, R., Sturm, O.E., Vyshemirsky, V., Calder, M., Gilbert, D.R., and Kolch, W., 2005. Computational modelling of the receptor-tyrosine-kinase-activated MAPK pathway. Biochemical Journal, 392, 249–261.
  • Purutçuoğlu, V., and Wit, E., 2006. Exact and approximate stochastic simulations of the MAPK pathway and comparisons of simulations’ results. Journal of Integrative Bioinformatics, 3, 231-243.
  • Purutçuoğlu, V., and Wit, E., 2008a. Inclusion of convoluted measurements in Bayesian inference of the MAPK/ERK pathway via multivariate diffusion model. Proceeding of the Third International Symposium on Health, Informatics and Bioinformatics in Sezerman, U. (Ed), Sabancı University, İstanbul, Turkey, CD-Rom.
  • Purutçuoğlu, V., and Wit, E., 2008b. Bayesian inference for the MAPK/ERK pathway by considering the dependency of the kinetic parameters. Bayesian Analysis, 3 (4), 851-86.
  • Roberts, G.O., Gelman, A., and Gilks, W.R., 1997. Weak convergence and optimal scaling of random walk metropolis algorithms. The Annals of Applied Probability, 77(1), 110-120.
  • Roberts, G.O., and Rosenthal, J.S., 1998. Optimal scaling of discrete approximations to Langevin diffusions. Journal of Royal Statistical Society, Series B, 60 (1), 255-268.
  • Roberts, G.O., and Stramer, O., 2001. On inference for partially observed nonlinear diffusion models using the Metropolis-Hastings algorithm. Biometrika, 88 (3), 603–621.
  • Turner, T.E., Schnell, S., and Burrage, K., 2004. Stochastic approaches for modelling in vivo reactions. Computational Biology and Chemistry, 28, 165–178.
  • Wilkinson, D.J., 2006. Stochastic modelling for systems biology. Chapman and Hall/CRC. Florida. U.S.A.