Göksel BİRİCİK, Banu DİRİ, Ahmet Coşkun SÖNMEZ

86107

Abstract feature extraction for text classification

Feature selection and extraction are frequently used solutions to overcome the curse of dimensionality in text classification problems. We introduce an extraction method that summarizes the features of the document samples, where the new features aggregate information about how much evidence there is in a document, for each class. We project the high dimensional features of documents onto a new feature space having dimensions equal to the number of classes in order to form the abstract features. We test our method on 7 different text classification algorithms, with different classifier design approaches. We examine performances of the classifiers applied on standard text categorization test collections and show the enhancements achieved by applying our extraction method. We compare the classification performance results of our method with popular and well-known feature selection and feature extraction schemes. Results show that our summarizing abstract feature extraction method encouragingly enhances classification performances on most of the classifiers when compared with other methods.

Anahtar Kelimeler:

Dimensionality reduction, feature extraction, preprocessing for classification, probabilistic abstract features

Abstract feature extraction for text classification

Feature selection and extraction are frequently used solutions to overcome the curse of dimensionality in text classification problems. We introduce an extraction method that summarizes the features of the document samples, where the new features aggregate information about how much evidence there is in a document, for each class. We project the high dimensional features of documents onto a new feature space having dimensions equal to the number of classes in order to form the abstract features. We test our method on 7 different text classification algorithms, with different classifier design approaches. We examine performances of the classifiers applied on standard text categorization test collections and show the enhancements achieved by applying our extraction method. We compare the classification performance results of our method with popular and well-known feature selection and feature extraction schemes. Results show that our summarizing abstract feature extraction method encouragingly enhances classification performances on most of the classifiers when compared with other methods.

Keywords:

Dimensionality reduction, feature extraction, preprocessing for classification, probabilistic abstract features,

Turkish Journal of Electrical Engineering and Computer Science-Cover

ISSN: 1300-0632
Yayın Aralığı: Yılda 6 Sayı
Yayıncı: TÜBİTAK

Arşiv

Sayıdaki Diğer Makaleler

Observer path design by imitation of competing constraints for bearing only tracking

Rıdvan GÜRCAN, Mesut KARTAL

Application of the Posicast control method to static shunt compensators

Amir GHORBANI, Siamak MASOUDI, Arash SHABANI

An implementation of modified scatter search algorithm to transmission expansion planning

Majid Zeinaddini MEYMAND, Masoud RASHIDINEJAD, Hamid KHORASANI

Broken rotor bar fault detection in inverter-fed squirrel cage induction motors using stator current analysis and fuzzy logic

Mehmet AKAR, İlyas ÇANKAYA

Optimal design of UPFC-based damping controller using imperialist competitive algorithm

Ali AJAMI, Reza GHOLIZADEH

Identification of linear dynamic systems using the artificial bee colony algorithm

Özden ERÇİN, Ramazan ÇOBAN

A novel approach for optimal allocation of distributed generations based on static voltage stability margin

Mohsen Rezaie ESTABRAGH, Mohsen MOHAMMADIAN, Mehdi SHAFIEE

Analysis and characterization of longitudinal flux single-sided linear switched reluctance machines

Lenin CHOKKALINGAM, Rengasamy ARUMUGAM

A novel motor speed calculation method using square wave speed sensor signals via fast Fourier transform

Hayri ARABACI, Osman BİLGİN

Abstract feature extraction for text classification

Göksel BİRİCİK, Banu DİRİ, Ahmet Coşkun SÖNMEZ