Ekonometride Yeni Bir Ufuk: Büyük Veri ve Makine Öğrenmesi

Bu çalışmada büyük veri kavramı, büyük verinin özellikleri, avantajları, zorlukları ve büyük verinin makine öğrenmesi ile ilişkisi araştırılmıştır. Büyük veri ve makine öğrenmesinin ekonometri alanındaki yeri, önemi ve bu konu ile ilgili literatürdeki gelişmeler incelenmiştir. Bunların yanında makine öğrenmesi ile ekonometrinin farklılığı üzerinde de durulmuştur.   Büyük veri kavramı ve makine öğrenmesi uzun yıllardır birçok bilim dallarında yaygın bir şekilde kullanılırken, ekonometri alanında son birkaç yıldır ilgi görmeye başlamıştır. Bu artan ilgi varolan makine öğrenmesi algoritma ve yöntemlerinin ekonometri alanında da kullanılmasına yol açmaktadır.

A New Horizon in Econometrics: Big Data and Machine Learning

Big data and machine learning begun to be interested in econometrics for only past few years while they have been widely used in different areas of science for many years. This growing interest  has led to the use of machine learning algorithms and methods in the field of econometrics.   In this study, the concept of big data, features of big data, its advantages, challenges and the relationship between big data and machine learning were investigated. The role and importance of big data and machine learning in econometrics were researched and the developments in related literature were investigated. In addition, the difference between machine learning and econometrics was also emphasized.

___

  • Abaker I, Hashem T., Yaqoob I., Anuar N.B, Mokhtar S., Gani A., Khan S.U. (2015). The Rise Of “Big Data” On Cloud Computing: Review And Open Research İssues, Information Systems, 47, 98-115.
  • Athey S., Imbens G. (2015). A Measure of Robustness to Misspecification, American Economic Review, 105 (5): 476-80. DOI: 10.1257/aer.p20151020.
  • Athey S. (2017). Beyond prediction: Using Big Data For Policy Problems. Science 355, 483–485.
  • Athey S., Tibshirani J., Wager S.(2017). Generalized Random Forests. arXiv preprint, arXiv:1610.01271, URL https://arxiv.org/abs/1610.01271.
  • Athey, S., Wager S. (2017). Efficient policy learning. arXiv preprint arXiv:1702.02896.
  • Athey, S. (2018). The Impact of Machine Learning on Economics in “The Economics of Artificial Intelligence: An Agenda,” National Bureau of Economic Research, Inc,http://www.nber.org/chapters/c14009.pdf
  • Athey, S. (2015). Machine Learning and Causal Inference for Policy Evaluation, In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 5–6. ACM.
  • Badaoui F., Amar A., Hassou L., Zoglat A., Okou C.G. (2017). Dimensionality Reduction And Class Prediction Algorithm With Application To Microarray Big Data, Journal of Big Data, 4: 32. https://doi.org/10.1186/s40537-017-0093-4
  • Belloni, A., Chen D., Chernozhukov V., Hansen C. (2012). Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain. Econometrica, 80: 2369–2429.
  • Belloni, A., Chernozhukov V., Fern´andez-Val I., Hansen C. (2013). Program Evaluation With High-Dimensional Data. arXiv preprint arXiv:1311.2645.
  • Belloni, A., Chernozhukov V., Hansen C. (2011). Inference For Highdimensional Sparse Econometric Models. arXiv preprint arXiv:1201.0220.
  • Belloni, A., Chernozhukov V., Hansen C. (2014). Inference on Treatment Effects After Selection Amongst High-Dimensional Controls, Review of Economic Studies, 81: 608–650.
  • Bickel, P., Ritov Y. and Tsybakov A. (2009). Simultaneous Analysis of Lasso And Dantzig Selector, Annals of Statistics. Vol. 37, No. 4, 1705–1732.
  • Breiman L., Friedman J.H., Olshen R.A., Stone C.J., (1984). Classification and Regression Trees, Wadsworth and Brooks/Cole, Monterey.
  • Chen, H., Chiang, R. H., Storey, V. C. (2012). Business Intelligence and Analytics: From Big Data to Big Impact. MIS quarterly, 36(4), 11651188.
  • Chen C.L.P, Zhang C. (2014). Data-intensive Applications, Challenges, Techniques and Technologies: A Survey on Big Data, Information Sciences, 275, 314-347.
  • Chernozhukov V., Hansen C., Spindler M. (2015). Post-selection and Postregularization Inference in Linear Models With Many Controls and Instruments. The American Economic Review, 105(5):486–490.
  • Doornik, J.A., Hendry, D.F. (2015). Statistical Model Selection With “Big Data”. Cogent Economics & Finance, 3, 1045216, 1-15. doi. 10.1080/23322039.2015.1045216.
  • Dudik, M. Erhan, D. Langford, J. Li, L. (2014). Doubly Robust Policy Evaluation and Optimization. Statistical Science, 29(4):485–511.
  • Einav L., Levin J. (2014 a). The Data Revolution and Economic Analysis, Technical Report, NBER Innovation Policy and the Economy Conference.
  • Einav L., Levin J. (2014 b). Economics in The Age of Big Data, Science, 346, DOI: 10.1126/science.1243089.
  • Fan J., Liao Y.(2014). Endogeneity in High Dimensions. Annals of Statistics, 42(3): 872.
  • Fan, J.; Han, F; Liu, H. (2014). Challanges of Big Data Analysis, National Science Review, 1(2),293-314. DOI:10.1093/nsr/nwt032.
  • Firican G. (2017). The 10 Vs of Big Data, https://tdwi.org/articles/2017/02/08/10-vs-of-big-data.aspx.
  • Fürnkranz J., Gamberger D., Lavrač N. ( 2012). Foundations of Rule Learning, Springer Verlag, Berlin Heidelberg.
  • Gandomi A., Haider M. (2015). Beyond the Hype:Big Data Concepts, Methods, and Analytics, International Journal of Information Management, 35, 137-144.
  • Hastie T., Tibshirani R., Friedman J. (2009). The Elements of Statistical Learning: Data Mining, Inference And Prediction, Springer-Verlag.
  • Hastie T., Tibshirani R., Wainwright M. (2015). Statistical Learning with Sparsity: the Lasso and Generalizations, CRC Press, Taylor and Francis Group.
  • Jin X., Wah B.W., Cheng X., Wang Y. (2015). Significance and Challenges of Big Data Research, Big Data Research, 2, 59-64.
  • Jiang, N., Li, L. (2016). Doubly robust off-policy value evaluation for reinforcement learning. In International Conference on Machine Learning, 652–661.
  • Kaisler S., Armour F., Espinosa J.A., Money W. (2013). Big Data: Issues and Challanges Moving Forward, 46th Hawaii International Conference on System Sciences, IEEE, DOI: 10.1109/HICSS.2013.645.
  • Katal, A., Wazid, M., and Goudar, R. (2013). Big Data: Issues, Challenges, Tools and Good Practices, in Proceeding of the Sixth International Conference on Contemporary Computing (IC3), IEEE, 404–409 (available at http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6612229).
  • Laney D, (2001). 3-D Data Management: Controlling Data Volume, Velocity And Variety. Application Delivery Strategies by META Group Inc. (2001, February 6),Retrieved from http://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf
  • Mullainathan S. and Spiess J. (2017). Machine Learning: An Applied Econometric Approach, Journal of Economic Perspectives, 31(2): 87–106.
  • Murphy K.P. (2012). Machine Learning a Probalistic Perspective. MIT Press.
  • Ohlhorst, F. (2013). Big Data Analytics: Turning Big Data into Big Money, New Jersey: Wiley Publicity.
  • Owais, S. S., Hussein, N. S. (2016). Extract Five Categories CPIVW from the 9V’s Characteristics of the Big Data. International Journal of Advanced Computer Science & Applications, 1(7), 254-258.
  • Raghupathi W., Raghupathi W. (2014). Big Data Analytics in Healthcare: Promise and Potential, Health Information Science and Systems, 2:3 http://www.hissjournal.com/content/2/1/3
  • Rao, B.L.S. Prakasa (2014). Big data and High Dimensional Data Analysis, CR Rao Advanced Institute Of Mathematics, Statistics And Computer Science (AIMSCS) http://www.crraoaimscs.org/lecture_notes/BLSP_Rao_BiDA2014.pdf.
  • Schroeck M., Shockley R., Smart, J., Romero Morales D., Tufano, P. (2012). Analytics: The Real-World Use of Big Data. How Innovative Enterprises Extract Value From Uncertaindata. www.stthomas.edu/gradsoftware/files/BigData_RealWo rldUse.pdf
  • Taylor L., Schroeder R., Meyer E. (2014). Emerging Practices And Perspectives on Big Data Analysis in Economics: Bigger and Better or More of The Same?, Big Data & Society, 1-10. DOI:10.1177/2053951714536877.
  • TechAmerica Foundation’s Federal Big Data Commission. (2012). Demystifying Bigdata: A Practical Guide To Transforming The Business Of Government. Retrieved from http://www.techamerica.org/Docs/fileManager.cfm?f=techamerica-bigdatareport-final.pdf
  • Tibshirani R. 1996. Regression Shrinkage and Selection via The LASSO. Journal of the Royal Statistical Society B 58(1): 267–288.
  • Varian H.R. (2013). Beyond Big Data, NABE Annual Meeting, San Francisco, CA. http://people.ischool.berkeley.edu/~hal/Papers/2013/BeyondBigDataPaperFINAL.pdf.
  • Varian H.R. (2014). Big Data: New Tricks for Econometrics." Journal of Economic Perspectives, 28(2): 3-28. DOI: 10.1257/jep.28.2.3.
  • Venables W.N., Ripley B.D., (2002). Modern Applied Statistics with S., Springer-Verlag, Newyork.
  • Wager S., Athey S. (2017). Estimation and Inference of Heterogeneous Treatment Effects Using Random Forests, arXiv.org > stat > arXiv:1510.04342, https://arxiv.org/abs/1510.04342.