Data analysis through social media according to the classified crime

The amount and variety of data generated through social media sites has increased along with the widespread use of social media sites. In addition, the data production rate has increased in the same way. The inclusion of personal information within these data makes it important to process the data and reach meaningful information within it. This process can be called intelligence and this meaningful information may be for commercial, academic, or security purposes. An example application is developed in this study for intelligence on Twitter. Crimes in Turkey are classified according to Turkish Statistical Institute criminal data and keywords are defined according to this data. A total of 150,000 tweet data in the Turkish language are collected from Twitter between specified dates and processed by Turkish Zemberek natural language processing. It is seen that 56 % of the people are talking about terrorist attacks and bombing attacks on the study dates. The words "bomb", "terror", "attack", "organization", and "explode" have percentages of 24 %, 12 %, 8 %, 6 %, and 6 %, respectively. Moreover, associations between words and situations are found. Correlations are important to create new subclusters like "terror" and "rape" in this study with 0.90 correlation. Bigger masses can be accessible by expanding keyword groups to have a clear picture of the real situation.

___

  • Savaş S, Topaloğlu N, Güler O. The determination of user’s preferences on some domain names in Turkey: a survey application. International Journal of Informatics Technologies 2015; 8: 51-58.
  • Setty K, Bakhshi R. What is big data and what does it have to do with it audit? ISACA Journal 2013; 3: 23-25.
  • Zikopoulos PC, Eaton C, deRoos D, Deutsch T, Lapis G. Understanding Big Data. Analytics for Enterprise Class Hadoop and Streaming Data. New York, NY, USA: McGraw-Hill Osborne, 2012.
  • Bollen J, Mao H, Zeng X. Twitter mood predicts the stock market. J Comput Sci-Neth 2011; 2: 1-8.
  • Zhang X, Fuehres H, Gloor PA. Predicting stock market indicators through twitter “I hope it is not as bad as I fear”. In: COINs2010: Collaborative Innovation Networks Conference; Procedia - Social and Behavioral Sciences; 2013. pp. 55-62.
  • Agnihotri R, Kothandaraman P, Kashyap R, Singh R. Bringing social into sales - the impact of salespeople’s social media use on service behaviors and value creation. Journal of Personal Selling & Sales Management 2012; 32: 333-345.
  • Mostafa MM. More than words: social networks’ text mining for consumer brand sentiments. Expert Syst Appl 2013; 40: 4241-4251.
  • He W, Zha S, Li L. Social media competitive analysis and text mining: a case study in the pizza industry. Int J Inform Manage 2013; 33: 464-472.
  • Court D, Elzinga D, Mulder S, Vetvik OJ. The consumer decision journey. McKinsey Quarterly 2009; 3: 1-11.
  • Vázqueza S, Muñoz-Garcíac Ó, Campanella I, Pocha M, Fisas B, Bel N, Andreu G. A classification of user-generated content into consumer decision journey stages. Neural Networks 2014; 58: 68-81.
  • Li YM, Chen HM, Liou JH, Lin LF. Creating social intelligence for product portfolio design. Decis Support Syst 2014; 66: 123-134.
  • Abrahams AA, Jiao J, Wang GA, Fan W. Vehicle defect discovery from social media. Decis Support Syst 2012; 54: 87-97.
  • Abrahams AA, Jiao J, Wang GA, Fan W, Zhang Z. What’s buzzing in the blizzard of buzz? Automotive component isolation in social media postings. Decis Support Syst 2013; 55: 871-882.
  • Tang X, Yang CC. Detecting social media hidden communities using dynamic stochastic block model with temporal Dirichlet process. ACM T Intel Syst Tec 2014; 5: 36.
  • Weichselbraun A, Gindl S, Scharl A. Enriching semantic knowledge bases for opinion mining in big data applications. Knowl-Based Syst 2014; 69: 78-85.
  • Yang MC, Rim HC. Identifying interesting twitter contents using topical analysis. Expert Syst Appl 2014; 41: 4330-4336.
  • Atzmueller M. Mining social media: key players, sentiments and communities. Data Min Knowl Disc 2012; 2: 411-419.
  • Poria S, Cambria E, Winterstein G, Huang GB. Sentic patterns: dependency-based rules for concept-level sentiment analysis. Knowl-Based Syst 2014; 69: 45-63.
  • Khan FH, Bashir S, Qamar U. TOM: Twitter opinion mining framework using hybrid classification scheme. Decis Support Syst 2014; 57: 245-257.
  • Behrendt S, Richter, A, Trier M. Mixed methods analysis of enterprise social networks. Comput Netw 2014; 75: 560-577.
  • Fu X, Shen Y. Study of collective user behaviour in twitter: a fuzzy approach. Neural Comput Appl 2014; 25: 1603-1614.
  • Lau RYK, Li C, Liao SSY. Social analytics: learning fuzzy product ontologies for aspect-oriented sentiment analysis. Decis Support Syst 2014; 65: 80-94.
  • Dang Y, Zhang Y, Hub PJH, Brown SA, Ku Y, HwangWang J, Chen H. An integrated framework for analyzing multilingual content in web 2.0 social media. Decis Support Syst 2014; 61: 126-135.
  • Wu Y, Liu S, Yan K, Liu M, Wu F. OpinionFlow: Visual analysis of opinion diffusion on social media. IEEE T Vis Comput Gr 2014; 20: 1763-1772.
  • Grieve R, Witteveen K, Tolan GA. Social media as a tool for data collection: examining equivalence of socially value-laden constructs. Curr Psychol 2014; 33: 532-544.
  • Weidemann C, Swift J. Social media location intelligence: the next privacy battle - an arcgis add-in and analysis of geospatial data collected from twitter.com. International Journal of Geoinformatics 2013; 9: 21-27.
  • Bell D, Koulouri T, Lauria S, Macredie RD, Sutton J. Microblogging as a mechanism for human–robot interaction. Knowl-Based Syst 2014; 69: 64-77.
  • Günüç S, OdabaşıHF, Kuzu A. The defining characteristics of students of the 21st century by student teachers: a Twitter activity. Journal of Theory and Practice in Education 2013; 9: 436-455.
  • Martinčić-Ipsić S, Močibob E, Perc M. Link prediction on twitter. PLoS ONE 2017; 12: e0181079.
  • Jalili M, Orouskhani Y, Asgari M, Alipourfard N, Perc M. Link prediction in multiplex online social networks. R Soc Open Sci 2017; 4: 160863.
  • Ban K, Perc M, Levnajić Z. Robust clustering of languages across Wikipedia growth. R Soc Open Sci 2017; 4: 171217.
  • Helbing D, Brockmann D, Chadefaux C, Donnay K, Blanke U, Woolley-Meza O, Moussaid M, Johansson A, Krause J, Schutte S et al. Saving human lives: what complexity science and information systems can contribute. J Stat Phys 2015; 158: 735-781.
  • D’Orsogna MR, Perc M. Statistical physics of crime: a review. Phys Life Rev 2015; 12: 1-21.
  • Omand S, Bartlett J, Miller C. Introducing social media intelligence (SOCMINT). Intelligence and National Security 2012; 27: 801-823.
  • Bayraktar G. The new requirement for the fifth dimension of the war: cyber intelligence. Journal of Security Strategies 2014; 20: 119-147.
  • Kırık AM. Social media-individual interaction and social transformation in the context of Arab Spring. Education and Society in the 21st Century 2012; 1: 87-98 (in Turkish with abstract in English).
  • Dündar E. Sosyal medya: bir protesto aracı. Türk Kütüphaneciliği 2011; 25: 165-172 (translation in Turkish of original article in English).
  • Korkmaz A. Arap baharısürecinde internet ve sosyal medyanın rolü. In: International Symposium on Language and Communication: Research Trends and Challenges; 10–13 June 2012; İzmir, Turkey. pp. 2147-2153.
  • Chakraborty B, Banerjee S. Modeling the evolution of post disaster social awareness from social web sites. In: 2013 IEEE International Conference on Cybernetics; 13–15 June 2013; Lausanne, Switzerland. New York, NY, USA: IEEE. pp. 51-56.
  • Oh, O, Agrawal M, Rao HR. Community intelligence and social media services: A rumor theoretic analysis of tweets during social crises. MIS Quarterly 2013; 37: 407-426.
  • Werbin KC. Spookipedia: Intelligence, social media and biopolitics. Media Cult Soc 2011; 33: 1254–1265.
  • Bertot JC, Jaeger PT. Transparency and technological change: ensuring equal and sustained public access to government information. Gov Inform Q 2010; 27: 371-376.
  • Bertot JC, Jaeger PT, Grimes JM. Using ICTs to create a culture of transparency: E-Government and social media as openness and anti-corruption tools for societies. Gov Inform Q 2010; 27: 264-271.
  • Bertot JC, Jaeger PT, Hansen D. The impact of polices on government social media usage: issues, challenges and recommendations. Gov Inform Q 2012; 29: 30-40.
  • Gerber MS. Predicting crime using twitter and kernel density estimation. Decis Support Syst 2014; 61: 115-125.
  • Kavanaugh AL, Fox EA, Sheetz SD, Yang S, Li LT, Shoemaker DJ, Natsev A, Xie L. Social media use by government: from the routine to the critical. Gov Inform Q 2012; 29: 480-491.
  • Surowiecki J. The Wisdom of Crowds. New York, NY, USA: Anchor Books, 2004.
  • Robu D, Sandu F, Petreus D, Nedelcu A, Balica A. Social networking of instrumentation – a case study in telematics. Adv Electr Comput En 2014; 14: 153-160.
  • Çöltekin C. A freely available morphological analyzer for Turkish. In: Proceedings of the International Conference on Language Resources and Evaluation; 17–23 May 2010; Valletta, Malta. pp. 820-827.
  • Park Y, Kang S, Seo J. information extraction using distant supervision and semantic similarities. Adv Electr Comput En 2016; 16: 11-18.
  • Thoplan R. Text mining the works of Christopher Marlowe. Research Journal of Science & IT Management 2014; 3: 43-51.