A novel hybrid feature selection method based on rough set and improved harmony search

Inbarani, H.; Bagyamathi, M.; Azar, Ahmad
November 2015
Neural Computing & Applications;Nov2015, Vol. 26 Issue 8, p1859
Academic Journal
Feature selection is a process of selecting optimal features that produce the most prognostic outcome. It is one of the essential steps in knowledge discovery. The crisis is that not all features are important. Most of the features may be redundant, and the rest may be irrelevant and noisy. This paper presents a novel feature selection approach to deal with issues of high dimensionality in the medical dataset. Medical datasets are habitually classified by a large number of measurements and a comparatively small number of patient records. Most of these measurements are irrelevant or noisy. This paper proposes a supervised feature selection method based on Rough Set Quick Reduct hybridized with Improved Harmony Search algorithm. Rough set theory is one of the most thriving methods used for feature selection. The Rough Set Improved Harmony Search Quick Reduct (RS-IHS-QR) algorithm is a relatively new population-based meta-heuristic optimization algorithm. This approach imitates the music improvisation process, where each musician improvises their instrument's pitch by searching for a perfect state of harmony. The quality of the reduced data is measured by the classification performance. The proposed algorithm is experimentally compared with the existing algorithms Rough Set Quick Reduct (RS-QR) and Rough Set Particle Swarm Optimization Quick Reduct (RS-PSO-QR). The number of features selected by the proposed method is comparatively low. The proposed algorithm reveals more than 90 % classification accuracy in most of the cases and the time taken to reduct the dataset also decreased than the existing methods. The experimental result demonstrates the efficiency and effectiveness of the proposed algorithm.


Related Articles

  • A Feature Selection Approach of Inconsistent Decision Systems in Rough Set. Lin Sun; Jiucheng Xu; Yuhui Li // Journal of Computers;Jun2014, Vol. 9 Issue 6, p1333 

    Feature selection has been widely discussed as an important preprocessing step in data mining applications since it reduces a model's complexity. In this paper, limitations of several representative reduction methods are analyzed firstly, and then by distinguishing consistent objects form...

  • A Novel Feature Selection Method for Effective Breast Cancer Diagnosis and Prognosis. Sridevi, T.; Murugan, A. // International Journal of Computer Applications;Feb2014, Vol. 88, p28 

    A major area of current research in data mining is the field of medical diagnosis. In the present study using the Breast cancer Wisconsin data sets, a feature selection algorithm Modified Correlation Rough Set Feature Selection (MCRSFS) predicts both diagnosis and prognosis by comparing several...

  • A Novel Framework Based on Rough Set, Ant Colony Optimization and Genetic Algorithm for Spam Filtering. Yang Yang // International Journal of Advancements in Computing Technology;Aug2012, Vol. 4 Issue 14, p516 

    In this paper, we propose a rough set, ant colony optimization and genetic algorithm based framework (RCGF) for spam filtering. The proposed method is divided into three stages. In the first stage, we propose a filtering method based on ant colony system and rough set for initial feature select....

  • Selección de atributos relevantes aplicando algoritmos que combinan conjuntos aproximados y optimización en colonias de hormigas. Rodríguez, Yanela; Fernández, Yumilka; Bello, Rafael; Caballero, Yailé // Revista Cubana de Ciencias Informáticas;ene-mar2014, Vol. 8 Issue 1, p140 

    Feature selection can be viewed as one of the most fundamental problems in the field of machine learning. An analysis on the methods of feature selection is done in this investigation; stressing those that use techniques of Ant Colony Optimization and the Rough Set Theory. Also, in this...

  • Feature Selection Based on Feature Distinguish Ability And Meta-Information. Hao-dong Zhu; Hong-chan Li; Jin-Chao Zhao // International Journal of Advancements in Computing Technology;Jun2012, Vol. 4 Issue 11, p344 

    Feature selection is one of the key steps in text categorization, selected feature subset directly influences results of text categorization. Firstly, the features distinguish ability based on word frequency and document frequency was presented. Next, meta-information was introduced into rough...

  • A Novel Algorithm of Improved Semi-Supervised Clustering Based on Rough Set Theory. Lei Ge; DanDan Cui // Journal of Convergence Information Technology;May2013, Vol. 8 Issue 9, p911 

    Semi-supervised clustering using pair wise constraints such as seed set or a priori knowledge to obtain better clustering results. Compared with the unsupervised clustering, semi-supervised clustering using the small amount of supervision information to help guide the clustering process....

  • A novel feature selection algorithm based on LVQ hypothesis margin. Hu, Yaomin; Liu, Weiming // Neural Computing & Applications;May2014, Vol. 24 Issue 6, p1431 

    Feature selection has been widely discussed as an important preprocessing step in machine learning and data mining. In this paper, a new feature selection evaluation criterion based on low-loss learning vector quantization (LVQ) classification is proposed. Based on the evaluation criterion, a...

  • A novel image mining technique for classification of mammograms using hybrid feature selection. Mohanty, Aswini; Senapati, Manas; Lenka, Saroj // Neural Computing & Applications;May2013, Vol. 22 Issue 6, p1151 

    The image mining technique deals with the extraction of implicit knowledge and image with data relationship or other patterns not explicitly stored in the images. It is an extension of data mining to image domain. The main objective of this paper is to apply image mining in the domain such as...

  • ENHANCED PRIVACY PRESERVATION WITH PERTURBED DATA USING FEATURE SELECTION. PRAKASH, V. S.; SHANMUGAM, A. // Journal of Theoretical & Applied Information Technology;12/31/2013, Vol. 58 Issue 3, p641 

    In data mining applications, privacy plays an imperative role. This has triggered the development of many privacy preserving data mining techniques. To facilitate privacy preservation in data mining or machine learning algorithms over horizontally partitioned or vertically partitioned data, many...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics