Abstract

Imbalanced data classification problem has always been one of the hot issues in the field of machine learning. Synthetic minority over-sampling technique (SMOTE) is a classical approach to balance datasets, but it may give rise to such problem as noise. Stacked De-noising Auto-Encoder neural network (SDAE), can effectively reduce data redundancy and noise through unsupervised layer-wise greedy learning. Aiming at the shortcomings of SMOTE algorithm when synthesizing new minority class samples, the paper proposed a Stacked De-noising Auto-Encoder neural network algorithm based on SMOTE, SMOTE-SDAE, which is aimed to deal with imbalanced data classification. The proposed algorithm is not only able to synthesize new minority class samples, but it also can de-noise and classify the sampled data. Experimental results show that compared with traditional algorithms, SMOTE-SDAE significantly improves the minority class classification accuracy of the imbalanced datasets.

Details

Title
An Imbalanced Data Classification Algorithm of De-noising Auto-Encoder Neural Network Based on SMOTE
Author
Zhang, Chenggang; Song, Jiazhi; Pei, Zhili; Jiang, Jingqing
Section
Computer and Information technologies
Publication year
2016
Publication date
2016
Publisher
EDP Sciences
ISSN
22747214
e-ISSN
2261236X
Source type
Conference Paper
Language of publication
English
ProQuest document ID
1786240651
Copyright
© 2016. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.