Prediction of host-pathogen protein interactions by extended network model

Knowledge of the pathogen-host interactions between the species is essentialin order to develop a solution strategy against infectious diseases. In vitro methods take extended periods of time to detect interactions and provide very few of the possible interaction pairs. Hence, modelling interactions between proteins has necessitated the development of computational methods. The main scope of this paper is integrating the known protein interactions between thehost and pathogen organisms to improve the prediction success rate of unknown pathogen-host interactions. Thus, the truepositive rate of the predictions was expected to increase.In order to perform this study extensively, encoding methods and learning algorithms of several proteins were tested. Along with human as the host organism, two different pathogen organisms were used in the experiments. For each combination of protein-encoding and prediction method, both the original prediction algorithms were tested using only pathogen-host interactions and the same methodwas testedagain after integrating the known protein interactions within each organism. The effect of merging the networks of pathogen-host interactions of different species on the prediction performance of state-of-the-art methods was also observed. Successwas measured in terms of Matthews correlation coefficient, precision, recall, F1 score, and accuracy metrics. Empirical results showed that integrating the host and pathogen interactions yields better performance consistently in almost all experiments.

___

  • Baldi P., 2001, BIOINFORMATICS MACHI, V2nd ed
  • Bhargava N, 2013, P INT J ADV RES COMP, V3
  • Bhasin M, 2004, J BIOL CHEM, V279, P23262, DOI 10.1074/jbc.M401932200
  • Bock JR, 2003, BIOINFORMATICS, V19, P125, DOI 10.1093/bioinformatics/19.1.125
  • Bock JR, 2001, BIOINFORMATICS, V17, P455, DOI 10.1093/bioinformatics/17.5.455
  • Breiman L, 2001, MACH LEARN, V45, P5, DOI 10.1023/A:1010933404324
  • Calderone A, 2015, NUCLEIC ACIDS RES, V43, pD588, DOI 10.1093/nar/gku830
  • Chen J, 2007, AMINO ACIDS, V33, P423, DOI 10.1007/s00726-006-0485-9
  • Cleary S, 1995, EDUCATIONAL INNOVATION IN ECONOMICS AND BUSINESS ADMINISTRATION, P108
  • Dasarathy BV, 1991, IEEE COMPUTER SOC TU
  • Davis J., 2006, P 23 INT C MACHINE L, P233, DOI DOI 10.1145/1143844.1143874
  • De Bodt S, 2009, BMC GENOMICS, V10, DOI 10.1186/1471-2164-10-288
  • Dyer MD, 2011, INFECT GENET EVOL, V11, P917, DOI 10.1016/j.meegid.2011.02.022
  • Friedman N, 1997, MACH LEARN, V29, P131, DOI 10.1023/A:1007465528199
  • Guirimand T, 2015, NUCLEIC ACIDS RES, V43, pD583, DOI 10.1093/nar/gku1121
  • Hall M., 2009, SIGKDD EXPLORATIONS, V11, P10, DOI [DOI 10.1145/1656274.1656278, 10.1145/1656274.1656278]
  • Kanehisa M, 2017, NUCLEIC ACIDS RES, V45, pD353, DOI 10.1093/nar/gkw1092
  • Kosesoy I, 2019, COMPUT BIOL CHEM, V78, P170, DOI 10.1016/j.compbiolchem.2018.12.001
  • Ksesoy I, 2018, J COMPUT BIOL, V25, P1120, DOI 10.1089/cmb.2018.0049
  • Kshirsagar M, 2016, LECT NOTES COMPUT SC, V9649, P53
  • Kshirsagar M, 2013, NIPS WORK MACH LEARN, P3
  • Kshirsagar M, 2013, BIOINFORMATICS, V29, P217, DOI 10.1093/bioinformatics/btt245
  • Martin S, 2005, BIOINFORMATICS, V21, P218, DOI 10.1093/bioinformatics/bth483
  • Mei SY, 2013, PLOS ONE, V8, DOI 10.1371/journal.pone.0079606
  • Mondal KC, 2012, BIOINFORMATICS: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, P164, DOI 10.5220/0003769001540173
  • Mukhopadhyay A, 2010, 2010 International Conference on Systems in Medicine and Biology (ICSMB), P344, DOI 10.1109/ICSMB.2010.5735401
  • Muralidharan V, 2012, APPL SOFT COMPUT, V12, P2023, DOI 10.1016/j.asoc.2012.03.021
  • Naghavi M, 2015, LANCET, V385, P117, DOI 10.1016/S0140-6736(14)61682-2
  • Nanni L, 2005, NEUROCOMPUTING, V68, P289, DOI 10.1016/j.neucom.2005.03.004
  • Nourani E, 2015, FRONT MICROBIOL, V6, DOI 10.3389/fmicb.2015.00094
  • Orchard S, 2014, NUCLEIC ACIDS RES, V42, pD358, DOI 10.1093/nar/gkt1115
  • Qi YJ, 2010, BIOINFORMATICS, V26, pi645, DOI 10.1093/bioinformatics/btq394
  • Ray S, 2012, PROC INT CONF EMERG, P28, DOI 10.1109/EAIT.2012.6407854
  • Shen JW, 2007, P NATL ACAD SCI USA, V104, P4337, DOI 10.1073/pnas.0607879104
  • Szklarczyk D, 2017, NUCLEIC ACIDS RES, V45, pD362, DOI 10.1093/nar/gkw937
  • Tekir SD, 2013, BIOINFORMATICS, V29, P1357, DOI 10.1093/bioinformatics/btt137
  • Wattam AR, 2017, NUCLEIC ACIDS RES, V45, pD535, DOI 10.1093/nar/gkw1017
  • Wu XM, 2006, NUCLEIC ACIDS RES, V34, P2137, DOI 10.1093/nar/gkl219
  • Xu Q, 2010, IEEE INT C BIOINFORM, P62, DOI 10.1109/BIBM.2010.5706537
  • You ZH, 2015, PLOS ONE, V10, DOI 10.1371/journal.pone.0125811
  • Zhou HF, 2013, J BIOINF COMPUT BIOL, V11, DOI 10.1142/S0219720012300018