Extended correlated principal component analysis with SVM-PUK in opinion mining

Extended correlated principal component analysis with SVM-PUK in opinion mining

With the rapid growth of microblogs and online sites, an inordinate number of product reviews are availableon the Internet. They not only help in analyzing, but also assist in making informed decisions about product quality. Inthe proposed work, an extended correlated principal component analysis (ECPCA) is used for dimensionality reduction.A comparative analysis is conducted on movie reviews (DB-1) and Twitter datasets (DB-2 and DB-3) in opinion miningextraction. The performance of naïve Bayes, CHIRP, and support vector machine (SVM) with kernel methods such asradial basis function (RBF), polynomial, and Pearson (PUK) are compared and analyzed on the three datasets. Theexperimental results using ECPCA for selecting relevant features and SVM-PUK as a classifier exhibit better performanceon movie reviews and Twitter datasets. The performance of the proposed approach is 99.69%, 99.4%, and 99.54% onthe DB-1, DB-2, and DB-3 datasets, respectively, and comparatively outperforms the existing methods.

___

  • Lin C, He Y, Everson R, Ruger S. Weakly supervised joint sentiment-topic detection from text. IEEE T Knowl Data En 2012; 24: 1134-1145.
  • Cambria E, Schuller B, Xia Y, Havasi C. New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 2013; 28: 15-21.
  • Zhu J, Wang H, Zhu M, Tsou B K, Ma M. Aspect-based opinion polling from customer reviews. IEEE T Affective Computing 2011; 2: 37-49.
  • Jain AP, Katkar VD. Sentiments analysis of Twitter data using data mining. In: 2015 International Conference on Information Processing (ICIP); 16 Dec 2015, USA: IEEE, pp. 807-810.
  • Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res 2003; 3: 993-1022.
  • Eirinaki M, Pisal S, Singh J. Feature-based opinion mining and ranking. J Comput Syst Sci 2012; 78: 1175-1184.
  • Go A, Bhayani R, Huang L. Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 2009.
  • Ortega R, Fonseca A, Montoyo A. SSA-UO: unsupervised Twitter sentiment analysis. In: Second Joint Conference on Lexical and Computational Semantics; 13–14 June 2013; Atlanta, Georgia, USA: ACL. pp. 501-507.
  • Pang B, Lee L. Opinion mining and sentiment analysis. In: Foundations and Trends. Information Retrieval. USA: ACM, 2008.
  • Zhang W, Yu C, Meng W. Opinion retrieval from blogs. In: Sixteenth Conference on Information and Knowledge Management; 6–10 Nov 2007; Lisbon, Portugal: ACM. pp. 831-840.
  • Ikram M, Butt N, Afzal M. Open source software adoption evaluation through feature level sentiment analysis using Twitter data. Turk J Elec Eng & Comp Sci 2016; 24: 4481-4496.
  • Hofmann T. Probabilistic latent semantic indexing. In: 22nd annual International SIGIR Conference on Research and Development in Information Retrieval; 15–19 Aug 1999; Berkeley, CA, USA: ACM. pp. 50-57.
  • Hu M, Liu B. Mining opinion features in customer reviews. In: AAAI; 25–29 July 2004; San Jose, CA, USA: IEEE. pp. 755-760.
  • Poria S, Cambria E, Gelbukh A. Aspect extraction for opinion mining with a deep convolutional neural network. Knowl-Based Syst 2016; 108: 42-49.
  • Ahmed S, Danti A. A novel approach for Sentimental Analysis and Opinion Mining based on SentiWordNet using web data. In: International Conference on Trends in Automation, Communications and Computing Technology; 21–22 December 2015; Bangalore, India: IEEE. pp. 1-5.
  • Sahu T, Ahuja S. Sentiment analysis of movie reviews: a study on feature selection & classification algorithms. In: International Conference on Microelectronics, Computing & Communications; 23–25 January 2016; Durgapur, India: IEEE, pp.1-6.
  • Claypo N, Jaiyen S. Opinion mining for Thai restaurant reviews using K-Means clustering and MRF feature selection. In: 7th International Conference on Knowledge and Smart Technology; 28–31 January 2015; Chonburi, Thailand: IEEE. pp. 105-108.
  • Bouazizi M, Ohtsuki T. Opinion mining in Twitter: how to make use of sarcasm to enhance sentiment analysis. In: 2015 International Conference on Advances in Social Networks Analysis and Mining; 25–28 August 2015; Paris, France: IEEE. pp. 1594-1597.
  • Chen C, Chen Z, Wu C. An Unsupervised approach for person name bipolarization using principal component analysis. IEEE T Knowl Data En 2012; 24: 1963-1976.
  • Li P, Yan Y, Wang C, Ren Z, Cong P, Wang H, Feng J. Customer voice sensor: a comprehensive opinion mining system for call center conversation. In: 2016 IEEE International Conference on Cloud Computing and Big Data Analysis; 5–7 July 2016; Chengdu, China: IEEE; pp. 324-329.
  • Zhang X, Cui L, Wang Y. CommTrust: Computing Multi-Dimensional Trust by Mining E-Commerce Feedback Comments. IEEE T Knowl Data En 2014; 26: 1631-1643.
  • Pang B, Lee L. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In: 43rd annual meeting on association for computational linguistics; 25–30 June 2005; Michigan, USA: ACM. pp. 115-124.
  • Mukherjee S, Pushpak B. Sentiment analysis in Twitter with lightweight discourse analysis. In: Coling; 8–15 December 2012; Bombay, India: ACL. pp. 1847-1864.
  • Esuli A, Sebastiani F. Sentiment quantification. IEEE Intell Syst 2010; 25: 72-75