Tourism demand modelling and forecasting using data mining techniques in multivariate time series: a case study in Turkey

Tourism demand modelling and forecasting using data mining techniques in multivariate time series: a case study in Turkey

:In this study multiple linear regression, multilayer perceptron (MLP) regression, and support vector regression (SVR) are used to make multivariate tourism forecasting for Turkey. This paper is a comparative study of data mining techniques based on multivariate regression modelling with monthly data points to forecast tourism demand; it focuses on Turkey. Both MLP and SVR methods are widely employed in the variety forecasting problems. Most of the previous research on tourism forecasting used univariate time series or a limited number of variables with mostly yearly or quarterly, and rarely monthly frequencies. However, the application of data mining techniques for multivariate forecasting in the context of tourism demand has not been widely explored. This paper differs from earlier research in two ways: 1) it proposes multivariate regression modelling with monthly data points to forecast tourism demand; and 2) it focuses on Turkey by using a dataset with the most recently accumulated (between January 1996 and Dec 2013) 67 time series with respect to Turkey and its top 26 major tourism clients. Comparison of forecasting performances in terms of relative absolute error (RAE) and root relative squared error (RRSE) measurements shows that the SVR model, with RAE = 12.34% and RRSE = 14.02%, gives a better performance. The results obtained in this study provide information for researchers interested in applying data mining techniques to tourism demand forecasting and help policy makers, government bodies, investors, and managers for their regularization, planning, and investments by way of accurate tourism demand forecasting.

___

  • [1] Archer B. Demand forecasting and estimation. In: Ritchie JRB, Charles RG, editors. Travel, Tourism, and Hospitality Research - A Handbook for Managers and Researchers. New York, NY, USA: John Wiley & Sons, 1987. pp. 77-85.
  • [2] Petropoulos C, Nikolopoulos K, Patelis VA. A technical analysis approach to tourism demand forecasting. Appl Econ Lett 2005; 12: 327-333.
  • [3] Pai PF, Hong WC. An improved neural network model in forecasting tourist arrivals. Ann Tourism Res 2005; 32: 1138-1141.
  • [4] Aslan A, Kaplan M, Kula F. International tourism demand for Turkey: a dynamic panel data approach. Research Journal of International Studies 2009; 9: 65-73.
  • [5] Turkish Tourism Industry Report. Investment Support and Promotion Agency of Turkey. Prime Ministry of the Republic of Turkey, 2010.
  • [6] World Tourism Organization. UNWTO Tourism Highlights. UNWTO, 2013.
  • [7] Granger CWJ, Terasvirta T. Modelling Nonlinear Economic Relationships. Oxford, UK: Oxford University Press, 1993.
  • [8] Efendigil T, On¨ut S, Kahraman C. A decision support system for demand forecasting with artificial neural networks ¨ and neuro-fuzzy models: a comparative analysis. Expert Syst Appl 2009; 36: 6697-6707.
  • [9] Gooijer JGD, Hyndman RJ. 25 years of time series forecasting. Int J Forecasting 2006; 22: 443-473.
  • [10] Zhang GP, Patuwo EB, Hu MY. Forecasting with artificial neural networks: the state of the art. Int J Forecasting 1998; 14: 35-62.
  • [11] Song H, Li G. Tourism demand modelling and forecasting - a review of recent research. Tourism Manage 2008; 29: 203-220.
  • [12] Witt SF, Witt CA. Forecasting tourism demand: a review of empirical research. Int J Forecasting 1995; 11: 447-475.
  • [13] Aslanargun A, Mammadov M, Yazici B, Yolacan S. Comparison of ARIMA, neural networks and hybrid models in time series: tourist arrival forecasting. J Stat Comput Sim 2007; 77: 29-53.
  • [14] Chen KY, Wang CH. Support vector regression with genetic algorithms in forecasting tourism demand. Tourism Manage 2005; 28: 215-226.
  • [15] Vapnik V, Golowich S, Smola A. Support vector machine for function approximation regression estimation and signal processing. Adv Neur In 1996; 9: 281-287.
  • [16] Hong WC. Electric load forecasting by support vector model. Appl Math Model 2009; 33: 2444-2454.
  • [17] Lin KP, Pai PF. A fuzzy support vector regression model for business cycle predictions. Expert Syst Appl 2010; 37: 5430-5435.
  • [18] Uysal, M. The determinants of tourism demand: a theoretical perspective. In: Debbage KG, Ioannides D, editors. The Economic Geography of the Tourist Industry: A Supply-Side Analysis. London, UK: Routledge, 1998. pp. 79-95.
  • [19] Haykin SO. Neural Networks: A Comprehensive Foundation. 2nd ed. Upper Saddle River, NJ, USA: Prentice Hall, 1998.
  • [20] Jones MT. Artificial Intelligence: A Systems Approach. Hingham, MA, USA: Infinity Science Press LLC, 2008.
  • [21] Bishop CM. Neural Networks for Pattern Recognition. Oxford, UK: Clarendon press, 1995.
  • [22] Vapnik V. The Nature of Statistical Learning Theory. New York, NY, USA: Springer, 1995.
  • [23] Witten IH, Frank E. Data Mining: Practical Machine Learning Tools and Techniques. 2nd ed. San Francisco, CA, USA: Morgan Kaufmann, 2005.
  • [24] Drucker H, Burges CJC, Kaufman L, Smola A, Vapnik V. Support vector regression machines. Adv Neural In 1997; 9: 155-161.
  • [25] Kecman V. Learning and soft computing: support vector machines, neural networks and fuzzy logic models. Cambridge, MA, USA: The MIT Press, 2001.
  • [26] McNelis PD. Neural Networks in Finance: Gaining Predictive Edge in the Market. Burlington, MA, USA: Elsevier Academic Press, 2005.
  • [27] Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter 2009; 11: 10-18.
  • [28] Choy KL, Lee WB, Lo V. Design of an intelligent supplier relationship management system: a hybrid case based neural network approach. Expert Syst Appl 2003; 24: 225-237.
  • [29] Lenard MJ, Alam P, Madey GR. The application of neural networks and a qualitative response model to the auditor’s going concern uncertainty decision. Decision Sci 1995; 26: 209-227.
  • [30] Piramuthu S, Shaw M, Gentry J. A classification approach using multilayered neural networks. Decis Support Syst 1994; 11: 509-525.
  • [31] Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL, editors. Parallel Distributed Processing. Cambridge, MA, USA: MIT Press, 1986. pp. 318-362.
  • [32] Palmer A, Montano JJ, Sese A. Designing an artificial neural network for forecasting tourism time series. Tourism Manage 2006; 27: 781-790.
  • [33] Platt JC. Fast training of support vector machines using sequential minimal optimization. In: Sch¨olkopf B, Burges C, Smola A, editors. Advances in Kernel Methods: Support Vector Learning. Cambridge, MA, USA: MIT Press, 1998. pp. 185-208.
  • [34] Sch¨olkopf B, Smola AJ. Learning with Kernels - Support Vector Machines, Regularization, Optimization, and Beyond. Cambridge, MA, USA: MIT Press, 2001.
  • [35] Shevade SK, Keerthi SS, Bhattacharyya C, Murthy KRK. Improvements to SMO algorithm for SVM regression. IEEE Neural Networ 2000; 11: 1188-1193.
  • [36] Duan K, Keerthi SS, Poo AN. Evaluation of simple performance measures for tuning SVM hyperparameters. Neurocomputing 2003; 51: 41-59.
  • [37] Keerthi SS. Efficient tuning of SVM hyper-parameters using radius/margin bound and iterative algorithms. IEEE Neural Networ 2002; 13: 1225-1229.
  • [38] Fei SW, Liu CL, Miao YB. Support vector machine with genetic algorithm for forecasting of key-gas ratios in oil-immersed transformer. Expert Syst Appl 2009; 36: 6326-6331.
  • [39] Cristianini N, Shawe-Taylor J. An introduction to support vector machines and other kernel-based learning methods. New York, NY, USA: Cambridge University Press, 2000.
  • [40] Gunn SR. Support Vector Machines for Classification and Regression. Southampton, UK: University of Southampton, 1997.
  • [41] Vapnik V. Statistical Learning Theory. New York, NY, USA: Wiley, 1998.
  • [42] Basak D, Pal S, Patranabis DC. Support vector regression. Neural Information Processing – Letters and Reviews 2007; 11, 203-224.
  • [43] Deng S, Liu DM, Hsieh SL. Applying machine learning methods to the shooting accuracy prediction: a case study of T-75 pistol shooting. Information Technology Journal 2011; 10: 1508-1517.
  • [44] Campbell C, Cristianini N. Simple training algorithms for support vector machines. Technical Report CIG-TR-KA, University of Bristol, Engineering Mathematics, Computational Intelligence Group, 1999.
  • [45] Brown MPS, Grundy WN, Lin D, Cristianini N, Sugnet C, Ares M, Haussler D. Support Vector Machine Classi- fication of Microarray Gene Expression Data. Technical Report UCSC-CRL-99-09, University of California, Santa Cruz, CA, USA, 1999.
  • [46] Fei SW, Wang MJ, Miao YB, Tu J, Liu CL. Particle swarm optimization-based support vector machine for forecasting dissolved gases content in power transformer oil. Energ Convers Manage 2009; 50: 1604-1609.
  • [47] Pesaran MH, Timmermann A. Predictability of stock returns: robustness and economic significance. J Financ 1995; 50: 1201-1227.
  • [48] Thawornwong S, Enke D. The adaptive selection of financial and economic variables for use with artificial neural networks. Neurocomputing 2004; 56: 205-232.
  • [49] Cheng J, Qian JS, Niu GD, Guo YN. Gas concentration forecasting based on support vector regression in correlation space via KPCA. In: Yu W, He H, Zhang N, editors. Advances in Neural Networks – ISNN, 2009. Berlin, Germany:Springer-Verlag, 2009, pp. 937-945.
  • [50] Li D, Mersereau RM, Simske S. Blind image deconvolution through support vector regression. IEEE Neural Networ 2007; 18: 931-935.
  • [51] Reinsel GC. Elements of Multivariate Time Series Analysis. 2nd ed. New York, NY, USA: Springer-Verlag, 1997.